3 """fixdiv - tool to fix division operators.
5 To use this tool, first run `python -Qwarnall yourscript.py 2>warnings'.
6 This runs the script `yourscript.py' while writing warning messages
7 about all uses of the classic division operator to the file
8 `warnings'. The warnings look like this:
10 <file>:<line>: DeprecationWarning: classic <type> division
12 The warnings are written to stderr, so you must use `2>' for the I/O
13 redirect. I know of no way to redirect stderr on Windows in a DOS
14 box, so you will have to modify the script to set sys.stderr to some
15 kind of log file if you want to do this on Windows.
17 The warnings are not limited to the script; modules imported by the
18 script may also trigger warnings. In fact a useful technique is to
19 write a test script specifically intended to exercise all code in a
20 particular module or set of modules.
22 Then run `python fixdiv.py warnings'. This first reads the warnings,
23 looking for classic division warnings, and sorts them by file name and
24 line number. Then, for each file that received at least one warning,
25 it parses the file and tries to match the warnings up to the division
26 operators found in the source code. If it is successful, it writes
27 its findings to stdout, preceded by a line of dashes and a line of the
32 If the only findings found are suggestions to change a / operator into
33 a // operator, the output is acceptable input for the Unix 'patch'
36 Here are the possible messages on stdout (N stands for a line number):
38 - A plain-diff-style change ('NcN', a line marked by '<', a line
39 containing '---', and a line marked by '>'):
41 A / operator was found that should be changed to //. This is the
42 recommendation when only int and/or long arguments were seen.
44 - 'True division / operator at line N' and a line marked by '=':
46 A / operator was found that can remain unchanged. This is the
47 recommendation when only float and/or complex arguments were seen.
49 - 'Ambiguous / operator (..., ...) at line N', line marked by '?':
51 A / operator was found for which int or long as well as float or
52 complex arguments were seen. This is highly unlikely; if it occurs,
53 you may have to restructure the code to keep the classic semantics,
54 or maybe you don't care about the classic semantics.
56 - 'No conclusive evidence on line N', line marked by '*':
58 A / operator was found for which no warnings were seen. This could
59 be code that was never executed, or code that was only executed with
60 with user-defined objects as arguments. You will have to
61 investigate further. Note that // can be overloaded separately from
62 /, using __floordiv__. True division can also be separately
63 overloaded, using __truediv__. Classic division should be the same
64 as either of those. (XXX should I add a warning for division on
65 user-defined objects, to disambiguate this case from code that was
68 - 'Phantom ... warnings for line N', line marked by '*':
70 A warning was seen for a line not containing a / operator. The most
71 likely cause is a warning about code executed by 'exec' or eval()
72 (see note below), or an indirect invocation of the / operator, for
73 example via the div() function in the operator module. It could
74 also be caused by a change to the file between the time the test
75 script was run to collect warnings and the time fixdiv was run.
77 - 'More than one / operator in line N'; or
78 'More than one / operator per statement in lines N-N':
80 The scanner found more than one / operator on a single line, or in a
81 statement split across multiple lines. Because the warnings
82 framework doesn't (and can't) show the offset within the line, and
83 the code generator doesn't always give the correct line number for
84 operations in a multi-line statement, we can't be sure whether all
85 operators in the statement were executed. To be on the safe side,
86 by default a warning is issued about this case. In practice, these
87 cases are usually safe, and the -m option suppresses these warning.
89 - 'Can't find the / operator in line N', line marked by '*':
91 This really shouldn't happen. It means that the tokenize module
92 reported a '/' operator but the line it returns didn't contain a '/'
93 character at the indicated position.
95 - 'Bad warning for line N: XYZ', line marked by '*':
97 This really shouldn't happen. It means that a 'classic XYZ
98 division' warning was read with XYZ being something other than
99 'int', 'long', 'float', or 'complex'.
103 - The augmented assignment operator /= is handled the same way as the
106 - This tool never looks at the // operator; no warnings are ever
107 generated for use of this operator.
109 - This tool never looks at the / operator when a future division
110 statement is in effect; no warnings are generated in this case, and
111 because the tool only looks at files for which at least one classic
112 division warning was seen, it will never look at files containing a
113 future division statement.
115 - Warnings may be issued for code not read from a file, but executed
116 using an exec statement or the eval() function. These may have
117 <string> in the filename position, in which case the fixdiv script
118 will attempt and fail to open a file named '<string>' and issue a
119 warning about this failure; or these may be reported as 'Phantom'
120 warnings (see above). You're on your own to deal with these. You
121 could make all recommended changes and add a future division
122 statement to all affected files, and then re-run the test script; it
123 should not issue any warnings. If there are any, and you have a
124 hard time tracking down where they are generated, you can use the
125 -Werror option to force an error instead of a first warning,
126 generating a traceback.
128 - The tool should be run from the same directory as that from which
129 the original script was run, otherwise it won't be able to open
130 files given by relative pathnames.
142 opts
, args
= getopt
.getopt(sys
.argv
[1:], "hm")
143 except getopt
.error
, msg
:
154 usage("at least one file argument is required")
157 sys
.stderr
.write("%s: extra file arguments ignored\n", sys
.argv
[0])
158 warnings
= readwarnings(args
[0])
161 files
= warnings
.keys()
163 print "No classic division warnings read from", args
[0]
168 x
= process(file, warnings
[file])
173 sys
.stderr
.write("%s: %s\n" % (sys
.argv
[0], msg
))
174 sys
.stderr
.write("Usage: %s [-m] warnings\n" % sys
.argv
[0])
175 sys
.stderr
.write("Try `%s -h' for more information.\n" % sys
.argv
[0])
177 PATTERN
= ("^(.+?):(\d+): DeprecationWarning: "
178 "classic (int|long|float|complex) division$")
180 def readwarnings(warningsfile
):
181 prog
= re
.compile(PATTERN
)
183 f
= open(warningsfile
)
185 sys
.stderr
.write("can't open: %s\n" % msg
)
194 if line
.find("division") >= 0:
195 sys
.stderr
.write("Warning: ignored input " + line
)
197 file, lineno
, what
= m
.groups()
198 list = warnings
.get(file)
200 warnings
[file] = list = []
201 list.append((int(lineno
), intern(what
)))
205 def process(file, list):
207 assert list # if this fails, readwarnings() is broken
211 sys
.stderr
.write("can't open: %s\n" % msg
)
216 index
= 0 # list[:index] has been processed, list[index:] is still to do
217 g
= tokenize
.generate_tokens(f
.readline
)
219 startlineno
, endlineno
, slashes
= lineinfo
= scanline(g
)
220 if startlineno
is None:
222 assert startlineno
<= endlineno
is not None
224 while index
< len(list) and list[index
][0] < startlineno
:
225 orphans
.append(list[index
])
228 reportphantomwarnings(orphans
, f
)
230 while index
< len(list) and list[index
][0] <= endlineno
:
231 warnings
.append(list[index
])
233 if not slashes
and not warnings
:
235 elif slashes
and not warnings
:
236 report(slashes
, "No conclusive evidence")
237 elif warnings
and not slashes
:
238 reportphantomwarnings(warnings
, f
)
244 for (row
, col
), line
in slashes
:
251 print "*** More than one / operator in line", rows
[0]
253 print "*** More than one / operator per statement",
254 print "in lines %d-%d" % (rows
[0], rows
[-1])
258 for lineno
, what
in warnings
:
259 if what
in ("int", "long"):
261 elif what
in ("float", "complex"):
262 floatcomplex
.append(what
)
266 for (row
, col
), line
in slashes
:
271 if line
[col
:col
+1] != "/":
272 print "*** Can't find the / operator in line %d:" % row
276 print "*** Bad warning for line %d:" % row
, bad
278 elif intlong
and not floatcomplex
:
279 print "%dc%d" % (row
, row
)
282 print ">", line
[:col
] + "/" + line
[col
:]
283 elif floatcomplex
and not intlong
:
284 print "True division / operator at line %d:" % row
286 elif intlong
and floatcomplex
:
287 print "*** Ambiguous / operator (%s, %s) at line %d:" % (
288 "|".join(intlong
), "|".join(floatcomplex
), row
)
292 def reportphantomwarnings(warnings
, f
):
296 for row
, what
in warnings
:
299 blocks
.append(lastblock
)
300 lastblock
.append(what
)
303 whats
= "/".join(block
[1:])
304 print "*** Phantom %s warnings for line %d:" % (whats
, row
)
305 f
.report(row
, mark
="*")
307 def report(slashes
, message
):
309 for (row
, col
), line
in slashes
:
311 print "*** %s on line %d:" % (message
, row
)
312 print "*", chop(line
)
316 def __init__(self
, fp
, window
=5, lineno
=1):
320 self
.eoflookahead
= 0
324 while len(self
.lookahead
) < self
.window
and not self
.eoflookahead
:
325 line
= self
.fp
.readline()
327 self
.eoflookahead
= 1
329 self
.lookahead
.append(line
)
332 if not self
.lookahead
:
334 line
= self
.lookahead
.pop(0)
335 self
.buffer.append(line
)
339 del self
.buffer[-window
:]
340 def __getitem__(self
, index
):
342 bufstart
= self
.lineno
- len(self
.buffer)
343 lookend
= self
.lineno
+ len(self
.lookahead
)
344 if bufstart
<= index
< self
.lineno
:
345 return self
.buffer[index
- bufstart
]
346 if self
.lineno
<= index
< lookend
:
347 return self
.lookahead
[index
- self
.lineno
]
349 def report(self
, first
, last
=None, mark
="*"):
352 for i
in range(first
, last
+1):
356 line
= "<missing line>"
357 print mark
, chop(line
)
363 for type, token
, start
, end
, line
in g
:
365 if startlineno
is None:
366 startlineno
= endlineno
367 if token
in ("/", "/="):
368 slashes
.append((start
, line
))
369 if type == tokenize
.NEWLINE
:
371 return startlineno
, endlineno
, slashes
374 if line
.endswith("\n"):
379 if __name__
== "__main__":