7 This document is a work in progress!
16 Aggressive Dead Code Elimination
21 Due to Clang's influence (mostly the fact that parsing and semantic
22 analysis are so intertwined for C and especially C++), the typical
23 working definition of AST in the LLVM community is roughly "the
24 compiler's first complete symbolic (as opposed to textual)
25 representation of an input program".
26 As such, an "AST" might be a more general graph instead of a "tree"
27 (consider the symbolic representation for the type of a typical "linked
28 list node"). This working definition is closer to what some authors
29 call an "annotated abstract syntax tree".
31 Consult your favorite compiler book or search engine for more details.
36 .. _lexicon-bb-vectorization:
39 Basic-Block Vectorization
42 Bit-tracking dead code elimination. Some bit-wise instructions (shifts,
43 ands, ors, etc.) "kill" some of their input bits -- that is, they make it
44 such that those bits can be either zero or one without affecting control or
45 data flow of a program. The BDCE pass removes instructions that only
46 compute these dead bits.
49 Bottom Up Rewriting System --- A method of instruction selection for code
50 generation. An example is the `BURG
51 <http://www.program-transformation.org/Transform/BURG>`_ tool.
57 Call Frame Information. Used in DWARF debug info and in C++ unwind info
58 to show how the function prolog lays out the stack frame.
61 Common Information Entry. A kind of CFI used to reduce the size of FDEs.
62 The compiler creates a CIE which contains the information common across all
63 the FDEs. Each FDE then points to its CIE.
66 Common Subexpression Elimination. An optimization that removes common
67 subexpression computation. For example ``(a+b)*(a+b)`` has two
68 subexpressions that are the same: ``(a+b)``. This optimization would
69 perform the addition only once and then perform the multiply (but only if
70 it's computationally correct/safe).
76 Directed Acyclic Graph
82 A pointer to the interior of an object, such that a garbage collector is
83 unable to use the pointer for reachability analysis. While a derived pointer
84 is live, the corresponding object pointer must be kept in a root, otherwise
85 the collector might free the referenced object. With copying collectors,
86 derived pointers pose an additional hazard that they may be invalidated at
87 any `safe point`_. This term is used in opposition to `object pointer`_.
90 Data Structure Analysis
93 Dead Store Elimination
99 This namespace houses the
100 `Clang Static Analyzer <https://clang.llvm.org/docs/ClangStaticAnalyzer.html>`_.
101 It is an abbreviation of `entomology <https://en.wikipedia.org/wiki/Entomology>`_.
103 *"Entomology is the scientific study of insects."*
105 In the past, this namespace had not only the name `GR` (aka. Graph Reachability)
112 First Class Aggregate
115 Frame Description Entry. A kind of CFI used to describe the stack frame of
122 Garbage Collection. The practice of using reachability analysis instead of
123 explicit memory management to reclaim unused memory.
126 ``GetElementPtr``. An LLVM IR instruction that is used to get the address
127 of a subelement of an aggregate data structure. It is documented in detail
128 `here <https://llvm.org/docs/GetElementPtr.html>`_.
131 Global Value Numbering. GVN is a pass that partitions values computed by a
132 function into congruence classes. Values ending up in the same congruence
133 class are guaranteed to be the same for every execution of the program.
134 In that respect, congruency is a compile-time approximation of equivalence
135 of values at runtime.
143 In garbage collection, the region of memory which is managed using
144 reachability analysis.
150 Internal Compiler Error. This abbreviation is used to describe errors
151 that occur in LLVM or Clang as they are compiling source code. For example,
152 if a valid C++ source program were to trigger an assert in Clang when
153 compiled, that could be referred to as an "ICE".
156 Inter-Procedural Analysis. Refers to any variety of code analysis that
157 occurs between procedures, functions or compilation units (modules).
160 Inter-Procedural Optimization. Refers to any variety of code optimization
161 that occurs between procedures, functions or compilation units (modules).
164 Instruction Selection
170 Loop-Closed Static Single Assignment Form
173 "Looks Good To Me". In a review thread, this indicates that the
174 reviewer thinks that the patch is okay to commit.
177 Loop Invariant Code Motion
180 Language Specific Data Area. C++ "zero cost" unwinding is built on top a
181 generic unwinding mechanism. As the unwinder walks each frame, it calls
182 a "personality" function to do language specific analysis. Each function's
183 FDE points to an optional LSDA which is passed to the personality function.
184 For C++, the LSDA contain info about the type and location of catch
185 statements in that function.
191 Link-Time Optimization
204 "No functional change". Used in a commit message to indicate that a patch
205 is a pure refactoring/cleanup.
206 Usually used in the first line, so it is visible without opening the
215 A pointer to an object such that the garbage collector is able to trace
216 references contained within the object. This term is used in opposition to
223 Problem report. A bug filed on `the LLVM Bug Tracking System
224 <https://bugs.llvm.org/enter_bug.cgi>`_.
227 Partial Redundancy Elimination
234 Replace All Uses With. The functions ``User::replaceUsesOfWith()``,
235 ``Value::replaceAllUsesWith()``, and
236 ``Constant::replaceUsesOfWithOnConstant()`` implement the replacement of one
237 Value with another by iterating over its def/use chain and fixing up all of
238 the pointers to point to the new value. See
239 also `def/use chains <ProgrammersManual.html#iterating-over-def-use-use-def-chains>`_.
242 Rearranging associative expressions to promote better redundancy elimination
243 and other optimization. For example, changing ``(A+B-A)`` into ``(B+A-A)``,
244 permitting it to be optimized into ``(B+0)`` then ``(B)``.
247 Request for Comment. An email sent to a project mailing list in order to
248 solicit feedback on a proposed change.
254 In garbage collection, a pointer variable lying outside of the `heap`_ from
255 which the collector begins its reachability analysis. In the context of code
256 generation, "root" almost always refers to a "stack root" --- a local or
257 temporary variable within an executing function.
268 In garbage collection, it is necessary to identify `stack roots`_ so that
269 reachability analysis may proceed. It may be infeasible to provide this
270 information for every instruction, so instead the information is
271 calculated only at designated safe points. With a copying collector,
272 `derived pointers`_ must not be retained across safe points and `object
273 pointers`_ must be reloaded from stack roots.
276 Selection DAG Instruction Selection.
279 Strongly Connected Component
282 Sparse Conditional Constant Propagation
285 Superword-Level Parallelism, same as :ref:`Basic-Block Vectorization
286 <lexicon-bb-vectorization>`.
289 Splat refers to a vector of identical scalar elements.
291 The term is based on the PowerPC Altivec instructions that provided
292 this functionality in hardware. For example, "vsplth" and the corresponding
293 software intrinsic "vec_splat()". Examples of other hardware names for this
294 action include "duplicate" (ARM) and "broadcast" (x86).
297 Scalar Replacement of Aggregates
300 Static Single Assignment
303 In garbage collection, metadata emitted by the code generator which
304 identifies `roots`_ within the stack frame of an executing function.
310 Type-Based Alias Analysis