Topic outline
-
Most of the programming concepts presented so far can be found in any programming language. Constructs such as variable definitions, operators, basic input and output, and control flow via conditional statements and loops are fundamental to what it means to compute. In this unit, we begin studying how data is structured within Python so we can program efficiently. Specifically, you will be introduced to lists and also immersed more deeply in the subject of strings. Upcoming units will introduce even more powerful data structures.
Completing this unit should take you approximately 6 hours.
-
-
A list is a data structure capable of storing different types of data. Run this set of commands in Repl.it:
a=2 b=3 alist_examp=[1,3.4,'asdf',96, True, 9.6,'zxcv',False, 2>5,a+b] print(alist_examp) print(type(alist_examp)) print('This list contains ', len(alist_examp), ' elements')
You should see the list output to the screen as well as the data type of the variable
alist_examp
. Notice that the list contains bool, int, str, and float data. It also contains the result of relational and arithmetic operations. Any valid data type in Python can be placed within a list.The data contained within the list are called elements. The list you printed contains 10 elements. You can figure this out by counting them by hand. There is also a command called len that will give you this information, which you can see in the example. len is a very useful command.
Pay very close attention to the syntax used to build the list. The left square bracket/right square bracket is known as a "container". Elements must be separated by a comma when building the list.
Pay very close attention to this example. Add this code on the line after
print('This list contains ', len(alist_examp), ' elements')
.tmp=alist_examp print(tmp) print(alist_examp) tmp[3]='vbnm' print(tmp) print(alist_examp)
Notice that changing the value in the variable tmp also changed the value in the variable alist_examp. Initially, this appears to go against what we have learned about the assignment operation (=). For reasons that will be discussed in a later unit, when it comes to lists, Python views both variables as occupying the same space in memory. They are different names that refer to the same data. How then can we truly assign the list to a new variable that occupies a different place in memory and can be modified independently of the variable
alist_examp
? This set of commands will accomplish this:tmp=alist_examp.copy() print(tmp) print(alist_examp) tmp[3]='vbnm' print(tmp) print(alist_examp)
This example introduces some new syntax that will soon become second nature. The command "alist_examp.copy()" is your first introduction to object-oriented syntax. The "copy()" portion of this command is what is known as a method. It is connected to the variable alist_exmp using the period or 'dot notation'. This command makes a true hard copy of the variable alist_examp so that modifications of tmp do not affect alist_examp.
Every variable type in Python is what is known as an "object". You will learn an immense amount about objects as we delve deeper into the course. The main point to realize is that there are several methods available for use with lists. The
copy
method we just introduced is one of them. Some other important ones worth mentioning at this point arepop
,append
,remove
, andinsert
. Pay attention to the sections that describe these methods in detail. -
-
It is often the case that a program requires referencing a group of elements within a list (instead of a single element using a single value of the index). This can be accomplished by slicing. Consider running this example:
x=[101,-45,34,-300,8,9,-3,22,5] print() print(x) print(x[0:9])
The colon operator can be used to index multiple elements with the list. The index on the left side of the colon (:) is the starting index. By Python's indexing convention, the value on the right side of the colon (:) indexes that value minus one (be careful with the right index). Since the variable
x
has 9 elements indexed from 0 to 8,x[0:9]
references all indices from 0 to 8. Here are some more examples you should try:print() print(x[3:4]) print(x[3:5]) print(x[3:6]) print(x[3:7]) print(x[3:8]) print(x[3:9])
Again, you should be careful when using the right index since Python will sequence up to that value minus one. For the sake of convenience, there are also shorthand slicing techniques that do not require specifying the start or ending index when it assumed the complete rest of the list is being referenced:
print() print(x[3:]) print(x[:4])
In addition to specifying the start and stop indices, you can also specify the "step". Here is an example that will count by twos from element 3 to element 7 (every other element starting at index 3). The step size is given by the index value after the second colon.
print() print(x[3:8:2])
Finally, Python allows for negative indices where, by convention, the index -1 implies starting from the end of the list.
print() print(x[-1]) print(x[-1:-3:-1]) print(x[-1:-len(x)-1:-1])
In these examples, the step size of -1 means to count backwards in steps of -1.
This video reviews many of the list concepts we have discussed so far. At 6:11, it discusses and elaborates on the rules for slicing in Python. Be sure to follow along and practice the examples in Repl.it.
-
Practice these examples using Repl.it to become more familiar with some methods commonly applied to lists. As you go through these examples, you should begin to see how powerful Python can be as a programming language.
-
Python offers many opportunities for creating efficient programming structures. When it comes to writing loops, 'list comprehension' allows for very compact code. Using lists, it is possible to pack quite a bit of power using just one line. This is an optional topic that requires understanding how to write loops using lists. Practice the examples to expand your ability to write loops.
-
-
-
-
There are a host of methods available for processing string objects. Here are a few examples. At this point it is sensible to introduce the comment character, #. The comment character allows you to put comments into your code for the purpose of documentation. While comments are not considered executable code, they are extremely useful for making your code readable and understandable.
#explore changing to uppercase and lowercase a='good' c=a.upper() d=c.lower() print(c) print(d) #join a list of strings together with a space in between the strings b='morning' e=' '.join([a,b,'today']) print(e) #find a string within a string #find method returns the first index where string was found x='a picture is worth a thousand words' x1=x.find('picture') print(x1) x2=x.find('worth') print(x2) x3=x.find('words') print(x3) #split up a string into a list of smaller strings #use the ' ' space character as the boundary (delimiter) to split up the string y=x.split(' ') print(y) print(type(y)) #try the replace method ... z=x.replace('thousand', 'million') print(x) print(z)
Take some time to explore your own versions of these examples.
-
-
In this video, course designer Eric Sakk walks through the major topics we covered in Unit 4. As you watch, work through the exercises to try them out yourself.
-
Take this assessment to see how well you understood this unit.
- This assessment does not count towards your grade. It is just for practice!
- You will see the correct answers when you submit your answers. Use this to help you study for the final exam!
- You can take this assessment as many times as you want, whenever you want.
-