This material is based on Google's Python Class licensed under Creative Commons Attribution 3.0 License, and code samples are licensed under the Apache 2.0 License.
Python is a dynamic, interpreted (bytecode-compiled) language. There are no type declarations of variables, parameters, functions, or methods in source code. This makes the code short and flexible, and you lose the compile-time type checking of the source code. Python tracks the types of all values at runtime and flags code that does not make sense as it runs.
An excellent way to see how Python code works is to run the Python
interpreter and type code right into it. If you ever have a question
like, "What happens if I add an int
to a list
?" Just typing it into
the Python interpreter is a fast and likely the best way to see what
happens. (See below to see what really happens!)
$ python ## Run the Python interpreter
Python 2.7.9 (default, Dec 30 2014, 03:41:42)
[GCC 4.1.2 20080704 (Red Hat 4.1.2-55)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> a = 6
>>> a
6
>>> a + 2
8
>>> a = 'hi'
>>> a
'hi'
>>> len(a)
2
>>> a + len(a) ## try something that doesn't work
Traceback (most recent call last):
File "", line 1, in
TypeError: cannot concatenate 'str' and 'int' objects
>>> a + str(len(a))
'hi2'
>>> foo ## try something else that doesn't work
Traceback (most recent call last):
File "", line 1, in
NameError: name 'foo' is not defined
As you can see above, it's easy to experiment with variables and operators. Also, the interpreter throws, or "raises" in Python parlance, a runtime error if the code tries to read a variable that has not been assigned a value. Like C++ and Java, Python is case sensitive so "a" and "A" are different variables. The end of a line marks the end of a statement, so unlike C++ and Java, Python does not require a semicolon at the end of each statement. Comments begin with a '#' and extend to the end of the line.
Python source files use the ".py" extension and are called "modules."
With a Python module hello.py
, the easiest way to run it is with the
shell command "python hello.py Alice" which calls the Python interpreter
to execute the code in hello.py
, passing it the command line argument
"Alice". See the official docs
page on all the different options
you have when running Python from the command-line.
Functions in Python are defined like this:
# Defines a "repeat" function that takes 2 arguments.
def repeat(s, exclaim):
result = s + s + s # can also use "s * 3" which is faster (Why?)
if exclaim:
result = result + '!!!'
return result
Notice also how the lines that make up the function or if-statement are
grouped by all having the same level of indentation. We also presented 2
different ways to repeat strings, using the + operator which is more
user-friendly, but *
also works because it's Python's "repeat"
operator, meaning that '-' * 10
gives '----------'
, a neat way to
create an onscreen "line." In the code comment, we hinted that *
works
faster than +, the reason being that *
calculates the size of the
resulting object once whereas with +
, that calculation is made each time +
is called. Both +
and *
are called "overloaded" operators because
they mean different things for numbers vs. for strings (and other data
types).
The def
keyword defines the function with its parameters within
parentheses and its code indented. The first line of a function can be a
documentation string ("docstring") that describes what the function
does. The docstring can be a single line, or a multi-line description as
in the example above. (Yes, those are "triple quotes," a feature unique
to Python!) Variables defined in the function are local to that
function, so the "result" in the above function is separate from a
"result" variable in another function. The return
statement can take
an argument, in which case that is the value returned to the caller.
Here is code that calls the above repeat() function, printing what it returns:
print(repeat('Yay', False))
print(repeat('Woo Hoo', True))
At run time, functions must be defined by the execution of a "def" before they are called.
One unusual Python feature is that the whitespace indentation of a piece of code affects its meaning. A logical block of statements such as the ones that make up a function should all have the same indentation, set in from the indentation of their parent function or "if" or whatever. If one of the lines in a group has a different indentation, it is flagged as a syntax error.
Python's use of whitespace feels a little strange at first, but it's logical and I found I got used to it very quickly. Avoid using TABs as they greatly complicate the indentation scheme (not to mention TABs may mean different things on different platforms). Set your editor to insert spaces instead of TABs for Python code.
A common question beginners ask is, "How many spaces should I indent?" According to the official Python style guide (PEP 8), you should indent with 4 spaces.
Python does very little checking at compile time, deferring almost all type, name, etc. checks on each line until that line runs.
if name == 'Guido':
print(repeeeet(name) + '!!!')
else:
print(repeat(name, False))
The if-statement contains an obvious error, where the repeat()
function
is accidentally typed in as repeeeet()
. The funny thing in Python ...
this code compiles and runs fine so long as the name at runtime is not
'Guido'. Only when a run actually tries to execute the repeeeet() will
it notice that there is no such function and raise an error. This just
means that when you first run a Python program, some of the first errors
you see will be simple typos like this. This is one area where languages
with a more verbose type system, like Java, have an advantage ... they
can catch such errors at compile time (but of course you have to
maintain all that type information ... it's a tradeoff).
Since Python variables don't have any type spelled out in the source code, it's extra helpful to give meaningful names to your variables to remind yourself of what's going on. So use "name" if it's a single name, and "names" if it's a list of names, and "tuples" if it's a list of tuples. Many basic Python errors result from forgetting what type of value is in each variable, so use your variable names (all you have really) to help keep things straight.
As far as actual naming goes, some languages prefer underscored_parts for variable names made up of "more than one word," but other languages prefer camelCasing. In general, Python prefers the underscore method but guides developers to defer to camelCasing if integrating into existing Python code that already uses that style. Readability counts. Read more in the section on naming conventions in PEP 8.
As you can guess, keywords like 'print' and 'while' cannot be used as variable names — you'll get a syntax error if you do. However, be careful not to use built-ins as variable names. For example, while 'str' and 'list' may seem like good names, you'd be overriding those system variables. Built-ins are not keywords and thus, are susceptible to inadvertent use by new Python developers.
With the statement import sys
you can then access the definitions in
the sys
module and make them available by their fully-qualified name,
e.g. sys.exit(). (Yes, 'sys' has a namespace too!)
import sys
# Now can refer to sys.xxx facilities
sys.version
There is another import form that looks like this: "from sys import version
". That makes version available as version
without the module prefix;
however, we recommend the original form with the fully-qualified names
because it's a lot easier to determine where a function or attribute
came from.
There are many modules and packages which are bundled with a standard installation of the Python interpreter, so you don't have to do anything extra to use them. These are collectively known as the "Python Standard Library." Commonly used modules/packages include:
sys
— access toversion
,exit()
,argv
,stdin
,stdout
, ...re
— regular expressionsos
— operating system interface, file system
You can find the documentation of all the Standard Library modules and packages at https://docs.python.org/3/library/.
There are a variety of ways to get help for Python.
- Do a Google search, starting with the word "python", like "python list" or "python string lowercase". The first hit is often the answer. This technique seems to work better for Python than it does for other languages for some reason.
- The official Python docs site — docs.python.org — has high quality docs. Nonetheless, I often find a Google search of a couple words to be quicker.
- There is also an official Tutor mailing list specifically designed for those who are new to Python and/or programming!
- Many questions (and answers) can be found on StackOverflow and Quora.
- Use the
help()
anddir()
functions (see below).
Inside the Python interpreter, the help()
function pulls up
documentation strings for various modules, functions, and methods. These
doc strings are similar to Java's javadoc. The dir()
function tells you
what the attributes of an object are. Below are some ways to call help()
and dir()
from the interpreter:
help(len)
— help string for the built-inlen()
function; note that it's "len" not "len()", which is a call to the function, which we don't wanthelp(sys)
— help string for thesys
module (must do animport sys
first)dir(sys)
—dir()
is likehelp()
but just gives a quick list of its defined symbols, or "attributes"help(sys.exit)
— help string for theexit()
function in thesys
modulehelp('xyz'.split)
— help string for thesplit()
method for string objects. You can callhelp()
with that object itself or an example of that object, plus its attribute. For example, callinghelp('xyz'.split)
is the same as callinghelp(str.split)
.help(list)
— help string forlist
objectsdir(list)
— displayslist
object attributes, including its methodshelp(list.append)
— help string for theappend()
method forlist
objects
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 3.0 License, and code samples are licensed under the Apache 2.0 License.