1 llvm-profdata - Profile data tool
2 =================================
4 .. program:: llvm-profdata
9 :program:`llvm-profdata` *command* [*args...*]
14 The :program:`llvm-profdata` tool is a small utility for working with profile
20 * :ref:`merge <profdata-merge>`
21 * :ref:`show <profdata-show>`
22 * :ref:`overlap <profdata-overlap>`
23 * :ref:`order <profdata-order>`
25 .. program:: llvm-profdata merge
35 :program:`llvm-profdata merge` [*options*] [*filename...*]
40 :program:`llvm-profdata merge` takes several profile data files
41 generated by PGO instrumentation and merges them together into a single
42 indexed profile data file.
44 By default profile data is merged without modification. This means that the
45 relative importance of each input file is proportional to the number of samples
46 or counts it contains. In general, the input from a longer training run will be
47 interpreted as relatively more important than a shorter run. Depending on the
48 nature of the training runs it may be useful to adjust the weight given to each
49 input file by using the ``-weighted-input`` option.
51 Profiles passed in via ``-weighted-input``, ``-input-files``, or via positional
52 arguments are processed once for each time they are seen.
60 Print a summary of command line options.
62 .. option:: --output=<output>, -o
64 Specify the output file name. *Output* cannot be ``-`` as the resulting
65 indexed profile data can't be written to standard output.
67 .. option:: --weighted-input=<weight,filename>
69 Specify an input file name along with a weight. The profile counts of the
70 supplied ``filename`` will be scaled (multiplied) by the supplied
71 ``weight``, where ``weight`` is a decimal integer >= 1.
72 Input files specified without using this option are assigned a default
73 weight of 1. Examples are shown below.
75 .. option:: --input-files=<path>, -f
77 Specify a file which contains a list of files to merge. The entries in this
78 file are newline-separated. Lines starting with '#' are skipped. Entries may
79 be of the form <filename> or <weight>,<filename>.
81 .. option:: --remapping-file=<path>, -r
83 Specify a file which contains a remapping from symbol names in the input
84 profile to the symbol names that should be used in the output profile. The
85 file should consist of lines of the form ``<input-symbol> <output-symbol>``.
86 Blank lines and lines starting with ``#`` are skipped.
88 The :doc:`llvm-cxxmap <llvm-cxxmap>` tool can be used to generate the symbol
91 .. option:: --instr (default)
93 Specify that the input profile is an instrumentation-based profile.
97 Specify that the input profile is a sample-based profile.
99 The format of the generated file can be generated in one of three ways:
101 .. option:: --binary (default)
103 Emit the profile using a binary encoding. For instrumentation-based profile
104 the output format is the indexed binary format.
106 .. option:: --extbinary
108 Emit the profile using an extensible binary encoding. This option can only
109 be used with sample-based profile. The extensible binary encoding can be
110 more compact with compression enabled and can be loaded faster than the
111 default binary encoding.
115 Emit the profile in text mode. This option can also be used with both
116 sample-based and instrumentation-based profile. When this option is used
117 the profile will be dumped in the text format that is parsable by the profile
122 Emit the profile using GCC's gcov format (Not yet supported).
124 .. option:: --sparse[=true|false]
126 Do not emit function records with 0 execution count. Can only be used in
127 conjunction with -instr. Defaults to false, since it can inhibit compiler
128 optimization during PGO.
130 .. option:: --num-threads=<N>, -j
132 Use N threads to perform profile merging. When N=0, llvm-profdata auto-detects
133 an appropriate number of threads to use. This is the default.
135 .. option:: --failure-mode=[any|all]
137 Set the failure mode. There are two options: 'any' causes the merge command to
138 fail if any profiles are invalid, and 'all' causes the merge command to fail
139 only if all profiles are invalid. If 'all' is set, information from any
140 invalid profiles is excluded from the final merged product. The default
141 failure mode is 'any'.
143 .. option:: --prof-sym-list=<path>
145 Specify a file which contains a list of symbols to generate profile symbol
146 list in the profile. This option can only be used with sample-based profile
147 in extbinary format. The entries in this file are newline-separated.
149 .. option:: --compress-all-sections=[true|false]
151 Compress all sections when writing the profile. This option can only be used
152 with sample-based profile in extbinary format.
154 .. option:: --use-md5=[true|false]
156 Use MD5 to represent string in name table when writing the profile.
157 This option can only be used with sample-based profile in extbinary format.
159 .. option:: --gen-partial-profile=[true|false]
161 Mark the profile to be a partial profile which only provides partial profile
162 coverage for the optimized target. This option can only be used with
163 sample-based profile in extbinary format.
165 .. option:: --convert-sample-profile-layout=[nest|flat]
167 Convert the merged profile into a profile with a new layout. Supported
168 layout are ``nest`` (Nested profile, the input should be CS flat profile) and
169 ``flat`` (Profile with nested inlinees flattened out).
171 .. option:: --supplement-instr-with-sample=<file>
173 Supplement an instrumentation profile with sample profile. The sample profile
174 is the input of the flag. Output will be in instrumentation format (only works
177 .. option:: --zero-counter-threshold=<float>
179 For the function which is cold in instr profile but hot in sample profile, if
180 the ratio of the number of zero counters divided by the total number of
181 counters is above the threshold, the profile of the function will be regarded
182 as being harmful for performance and will be dropped.
184 .. option:: --instr-prof-cold-threshold=<int>
186 User specified cold threshold for instr profile which will override the cold
187 threshold got from profile summary.
189 .. option:: --suppl-min-size-threshold=<int>
191 If the size of a function is smaller than the threshold, assume it can be
192 inlined by PGO early inliner and it will not be adjusted based on sample
195 .. option:: --debug-info=<path>
197 Specify the executable or ``.dSYM`` that contains debug info for the raw profile.
198 When ``--debug-info-correlate`` or ``--profile-correlate=debug-info`` was used
199 for instrumentation, use this option to correlate the raw profile.
201 .. option:: --binary-file=<path>
203 Specify the executable that contains profile data and profile name sections for
204 the raw profile. When ``-profile-correlate=binary`` was used for
205 instrumentation, use this option to correlate the raw profile.
207 .. option:: --temporal-profile-trace-reservoir-size
209 The maximum number of temporal profile traces to be stored in the output
210 profile. If more traces are added, we will use reservoir sampling to select
211 which traces to keep. Note that changing this value between different merge
212 invocations on the same indexed profile could result in sample bias. The
213 default value is 100.
215 .. option:: --temporal-profile-max-trace-length
217 The maximum number of functions in a single temporal profile trace. Longer
218 traces will be truncated. The default value is 1000.
224 Merge three profiles:
228 llvm-profdata merge foo.profdata bar.profdata baz.profdata -output merged.profdata
232 The input file ``foo.profdata`` is especially important, multiply its counts by 10:
236 llvm-profdata merge --weighted-input=10,foo.profdata bar.profdata baz.profdata --output merged.profdata
238 Exactly equivalent to the previous invocation (explicit form; useful for programmatic invocation):
242 llvm-profdata merge --weighted-input=10,foo.profdata --weighted-input=1,bar.profdata --weighted-input=1,baz.profdata --output merged.profdata
244 .. program:: llvm-profdata show
254 :program:`llvm-profdata show` [*options*] [*filename*]
259 :program:`llvm-profdata show` takes a profile data file and displays the
260 information about the profile counters for this file and
261 for any of the specified function(s).
263 If *filename* is omitted or is ``-``, then **llvm-profdata show** reads its
264 input from standard input.
269 .. option:: --all-functions
271 Print details for every function.
273 .. option:: --binary-ids
275 Print embedded binary ids in a profile.
279 Print the counter values for the displayed functions.
281 .. option:: --show-format=<text|json|yaml>
283 Emit output in the selected format if supported by the provided profile type.
285 .. option:: --function=<string>
287 Print details for a function if the function's name contains the given string.
291 Print a summary of command line options.
293 .. option:: --output=<output>, -o
295 Specify the output file name. If *output* is ``-`` or it isn't specified,
296 then the output is sent to standard output.
298 .. option:: --instr (default)
300 Specify that the input profile is an instrumentation-based profile.
304 Instruct the profile dumper to show profile counts in the text format of the
305 instrumentation-based profile data representation. By default, the profile
306 information is dumped in a more human readable form (also in text) with
309 .. option:: --topn=<n>
311 Instruct the profile dumper to show the top ``n`` functions with the
312 hottest basic blocks in the summary section. By default, the topn functions
317 Specify that the input profile is a sample-based profile.
319 .. option:: --memop-sizes
321 Show the profiled sizes of the memory intrinsic calls for shown functions.
323 .. option:: --value-cutoff=<n>
325 Show only those functions whose max count values are greater or equal to ``n``.
326 By default, the value-cutoff is set to 0.
328 .. option:: --list-below-cutoff
330 Only output names of functions whose max count value are below the cutoff
333 .. option:: --profile-version
335 Print profile version.
339 Only show context sensitive profile counts. The default is to filter all
340 context sensitive profile counts.
342 .. option:: --show-prof-sym-list=[true|false]
344 Show profile symbol list if it exists in the profile. This option is only
345 meaningful for sample-based profile in extbinary format.
347 .. option:: --show-sec-info-only=[true|false]
349 Show basic information about each section in the profile. This option is
350 only meaningful for sample-based profile in extbinary format.
352 .. option:: --debug-info=<path>
354 Specify the executable or ``.dSYM`` that contains debug info for the raw profile.
355 When ``--debug-info-correlate`` or ``--profile-correlate=debug-info`` was used
356 for instrumentation, use this option to show the correlated functions from the
359 .. option:: --covered
361 Show only the functions that have been executed, i.e., functions with non-zero
364 .. program:: llvm-profdata overlap
366 .. _profdata-overlap:
374 :program:`llvm-profdata overlap` [*options*] [*base profile file*] [*test profile file*]
379 :program:`llvm-profdata overlap` takes two profile data files and displays the
380 *overlap* of counter distribution between the whole files and between any of the
383 In this command, *overlap* is defined as follows:
384 Suppose *base profile file* has the following counts:
385 {c1_1, c1_2, ..., c1_n, c1_u_1, c2_u_2, ..., c2_u_s},
386 and *test profile file* has
387 {c2_1, c2_2, ..., c2_n, c2_v_1, c2_v_2, ..., c2_v_t}.
388 Here c{1|2}_i (i = 1 .. n) are matched counters and c1_u_i (i = 1 .. s) and
389 c2_v_i (i = 1 .. v) are unmatched counters (or counters only existing in)
390 *base profile file* and *test profile file*, respectively.
391 Let sum_1 = c1_1 + c1_2 + ... + c1_n + c1_u_1 + c2_u_2 + ... + c2_u_s, and
392 sum_2 = c2_1 + c2_2 + ... + c2_n + c2_v_1 + c2_v_2 + ... + c2_v_t.
393 *overlap* = min(c1_1/sum_1, c2_1/sum_2) + min(c1_2/sum_1, c2_2/sum_2) + ...
394 + min(c1_n/sum_1, c2_n/sum_2).
396 The result overlap distribution is a percentage number, ranging from 0.0% to
397 100.0%, where 0.0% means there is no overlap and 100.0% means a perfect
400 Here is an example, if *base profile file* has counts of {400, 600}, and
401 *test profile file* has matched counts of {60000, 40000}. The *overlap* is 80%.
406 .. option:: --function=<string>
408 Print details for a function if the function's name contains the given string.
412 Print a summary of command line options.
414 .. option:: --output=<output>, -o
416 Specify the output file name. If *output* is ``-`` or it isn't specified,
417 then the output is sent to standard output.
419 .. option:: --value-cutoff=<n>
421 Show only those functions whose max count values are greater or equal to ``n``.
422 By default, the value-cutoff is set to max of unsigned long long.
426 Only show overlap for the context sensitive profile counts. The default is to show
427 non-context sensitive profile counts.
429 .. program:: llvm-profdata order
439 :program:`llvm-profdata order` [*options*] [*filename*]
444 :program:`llvm-profdata order` uses temporal profiling traces from a profile and
445 finds a function order that reduces the number of page faults for those traces.
446 This output can be directly passed to ``lld`` via ``--symbol-ordering-file=``
447 for ELF or ``-order-file`` for Mach-O. If the traces found in the profile are
448 representative of the real world, then this order should improve startup
456 Print a summary of command line options.
458 .. option:: --output=<output>, -o
460 Specify the output file name. If *output* is ``-`` or it isn't specified,
461 then the output is sent to standard output.
466 :program:`llvm-profdata` returns 1 if the command is omitted or is invalid,
467 if it cannot read input files, or if there is a mismatch between their data.