Version 58 (modified by 6 years ago) ( diff ) | ,
---|
Python 2/3 Compatibility
This guideline documents coding conventions to achieve hybrid Python-2.7/Python-3.5 compatibility in Sahana.
It's a working document that will be added to as we move towards full Python-3 compatibility.
Syntax
No print statements
Don't use the print statement anywhere:
print "example" # deprecated
You shouldn't use the print-function either, because it can clash with uWSGI:
print("example") # not good
...but it can be tolerated in CLI scripts which don't run in the WSGI environment.
Best option for debug output is to use sys.stderr.write:
import sys sys.stderr.write("example\n") # better
...or the logger (as it can be configured globally to write to a log file instead of the system console):
current.log.debug("example") # even better
Use as-syntax for catching exceptions
When catching exceptions in a variable, don't use the comma-syntax:
try: ... except Exception, e: # deprecated ...
Instead, use the as-keyword:
try: ... except Exception as e: # new standard ...
No exec statements
In Python-3, the exec
statement has become a function exec()
. Python-2.7 accepts the function-syntax as well, so we use it throughout:
# Deprecated: exec pyexpr # New Standard: exec(pyexpr)
NB the Python-2.7 exec
statement also accepts a 3-tuple as parameter exec(expr, globals, locals)
which syntax is equivalent to the exec-function in Python-3.
No cmp-parameter in sort/sorted
Python-3 does no longer support the cmp
parameter for x.sort()
and sorted(x)
. We use the key
parameter instead.
For locale-sensitive sorting, use the s3compat alternative sorted_locale
:
# Python-2 pattern, not working in Python-3: x = sorted(x, cmp=locale.strcoll) # Python-3 pattern, not working with unicode in Python-2: x = sorted(x, key=locale.strxfrm) # Compatible pattern: from s3compat import sorted_locale x = sorted_locale(x)
Raise Exceptions Instances
Python-3 does no longer support the raise E, V, T
syntax - the new syntax is raise E(V).with_traceback(T)
.
However, the traceback is rarely required - and where E is an Exception class (rather than a string), we can use the raise E(V)
syntax which works in all Python versions.
# Works in Py2, but not in Py3: raise SyntaxError, "Error Message" # Works in all Python versions, hence our Standard: raise SyntaxError("Error Message")
If a traceback object must be passed (which is rarely needed), then we must use the PY2 constant to implement alternative statements.
Usage
No implicit package-relative imports
Python-3 does not search for modules relative to the current module in the same package - unless explicitly indicated by leading .
or ..
in the module path.
from s3datetime import s3_format_datetime # inside modules/s3, not working in Python-3
Python-2.7 would search relative to the current module, but on the other hand, it supports the explicit-relative syntax as well.
So we decide that only explicit paths shall be used in imports.
To import a module in the same package (e.g. within s3), either use explicit-relative syntax:
from .s3datetime import s3_format_datetime # inside modules/s3, preferred variant
...or an absolute path relative to modules (or the global python path):
from s3.s3datetime import s3_format_datetime # inside modules/s3, acceptable alternative
Outside of modules/s3, you should always import from the top-level of the s3 package (because the package structure may change over time):
from s3 import s3_format_datetime # outside modules/s3
Alternative Imports
As the locations and names of some libraries have changed in Python-3, we use the compatibility module (modules/s3compat.py
) to implement suitable alternatives. Similarily, the compat module provides alternatives for other objects such as types, functions and certain common patterns.
Where an object is provided by modules/s3compat.py, it MUST be imported from there if used.
The following objects are provided by s3compat:
Constants
Name | Type | Comments/Caveats |
---|---|---|
PY2 | boolean | Constant indicating whether we're currently running on Py2, should only be used if alternatives cannot be generalized |
Libraries
Name | Type | Comments/Caveats |
---|---|---|
Cookie | module | maps to http.cookies in Py3 |
pickle | module | replaces cPickle in Py2 |
urlparse | module | maps to urllib.parse in Py3 |
urllib2 | module | maps to urllib.requests in Py3, which contains only part of Py2's urllib2 - some urllib2 objects therefore need to be imported separately (see below) |
Functions
Name | Type | Comments/Caveats |
---|---|---|
reduce | function | |
sorted_locale | lambda i | locale-sensitive sorting, sorted(i, cmp=locale.strcoll) in Py2, sorted(i, key=locale.strxfrm) in Py3
|
unichr | function | |
urlencode | function | replaces urllib.urlencode in Py2 |
urllib_quote | function | replaces urllib.quote in Py2 |
urlopen | function | replaces urllib.urlopen and urllib2.urlopen in Py2 |
xrange | function | maps to range in Py3, since xrange does no longer exist (but range behaves like it)
|
Types
Name | Type | Comments/Caveats |
---|---|---|
basestring | type | |
long | type | same as int in Py3, so can occasionally lead to redundancy
|
unicodeT | type | for type checking, instead of unicode in Py2, maps to str in Py3
|
ClassType | type | replaces types.ClassType (old-style classes) in Py2, maps to type in Py3 (old-style classes do no longer exist)
|
Type Tuples for isinstance()
Name | Type | Comments/Caveats |
---|---|---|
CLASS_TYPES | tuple | maps to tuple of all known class types: (type,types.ClassType) in Py2, just (type,) in Py3
|
INTEGER_TYPES | tuple | maps to tuple of all known integer types: (int,long) in Py2, (int,) in Py3 |
STRING_TYPES | tuple | maps to tuple of all known string types: (str,unicode) in Py2, (str,) in Py3 |
Other Classes and Exceptions
Name | Type | Comments/Caveats |
---|---|---|
HTTPError | Exception | replaces urllib2.HTTPError in Py2, NB HTTPError is a subclass of URLError, so must be caught first in order to differentiate |
HTMLParser | class | replaces HTMLParser.HTMLParser in Py2 |
StringIO | class/function | maps to cStringIO.StringIO in Py2 (which is a function rather than a class), so can't use this for type checking |
URLError | Exception | replaces urllib2.URLError in Py2 |
No dict.iteritems, iterkeys or itervalues
Python-3 does no longer have dict.iteritems()
. We use dict.items()
instead:
# Deprecated: for x, y in d.iteritems(): # Compatible: for x, y in d.items():
NB In Python-2, dict.items()
returns a list of tuples, but in Python-3 it returns a dict view object that is sensitive to changes of the dict. If a list is required, or the dict is changed inside the loop, the result of dict.items()
must be converted into a list explicitly.
The same applies to dict.iterkeys()
(use dict.keys()
instead) and dict.itervalues()
(use dict.values()
instead).
No dict.has_key
The dict.has_key()
method has been removed in Python-3 in favor of the x in y
pattern, which is also available (and equivalent) in Python-2.7:
# Deprecated: if d.has_key(k): # New Standard: if k in d:
Map, Filter and Zip return generators
In Python-3, the map()
, filter
and zip()
functions return generator objects rather than lists. This is fine when we want to iterate over the result, especially when there is a chance to break out of the loop early.
But where lists are required, the return value must be converted explicitly using the list constructor.
# This is fine: for item in map(func, values): # This could be wrong: result = map(func, values) # This is better: result = list(map(func, values)) # This could be even better: result = [func(v) for v in values]
NB For building a list from a single iterable argument, we prefer list comprehensions over map()
or filter()
for readability and speed.
NB Generator objects (unlike lists, tuples or sets) can only be iterated over once, and they cannot be accessed by index
Don't unicode.encode
Since there is no difference between unicode
and str
, using the encode()
method will produce bytes
rather than str
. A bytes
object differs from string in that it is an array of integers rather than an array of characters. It will also give a distorted result with any later str()
or s3_str()
.
if isinstance(x, unicodeT): # unicodeT maps to str in Py3 x = x.encode("utf-8") # x becomes a bytes-object in Py3, unlike in Py2 where it becomes a str str(x) # thus, in Py3, this results in something like "b'example'" instead of the expected "example" x[1] # is "e" in Py2, but 120 (an integer!) in Py3
If you just want to encode a potential unicode
instance as an utf-8 encoded str
, use s3_str
rather than unicode.encode
:
# Do this instead (without type-check): x = s3_str(x)