lldb/docs/use/variable.rst

   1 Variable Formatting
   2 ===================
   3
   4 .. contents::
   5    :local:
   6
   7 LLDB has a data formatters subsystem that allows users to define custom display
   8 options for their variables.
   9
  10 Usually, when you type ``frame variable`` or run some expression LLDB will
  11 automatically choose the way to display your results on a per-type basis, as in
  12 the following example:
  13
  14 ::
  15
  16    (lldb) frame variable
  17    (uint8_t) x = 'a'
  18    (intptr_t) y = 124752287
  19
  20 Note: ``frame variable`` without additional arguments prints the list of
  21 variables of the current frame.
  22
  23 However, in certain cases, you may want to associate a different style to the
  24 display for certain datatypes. To do so, you need to give hints to the debugger
  25 as to how variables should be displayed. The LLDB type command allows you to do
  26 just that.
  27
  28 Using it you can change your visualization to look like this:
  29
  30 ::
  31
  32    (lldb) frame variable
  33    (uint8_t) x = chr='a' dec=65 hex=0x41
  34    (intptr_t) y = 0x76f919f
  35
  36 In addition, some data structures can encode their data in a way that is not
  37 easily readable to the user, in which case a data formatter can be used to
  38 show the data in a human readable way. For example, without a formatter,
  39 printing a ``std::deque<int>`` with the elements ``{2, 3, 4, 5, 6}`` would
  40 result in something like:
  41
  42 ::
  43
  44    (lldb) frame variable a_deque
  45    (std::deque<Foo, std::allocator<int> >) $0 = {
  46       std::_Deque_base<Foo, std::allocator<int> > = {
  47          _M_impl = {
  48             _M_map = 0x000000000062ceb0
  49             _M_map_size = 8
  50             _M_start = {
  51                _M_cur = 0x000000000062cf00
  52                _M_first = 0x000000000062cf00
  53                _M_last = 0x000000000062d2f4
  54                _M_node = 0x000000000062cec8
  55             }
  56             _M_finish = {
  57                _M_cur = 0x000000000062d300
  58                _M_first = 0x000000000062d300
  59                _M_last = 0x000000000062d6f4
  60                _M_node = 0x000000000062ced0
  61             }
  62          }
  63       }
  64    }
  65
  66 which is very hard to make sense of.
  67
  68 Note: ``frame variable <var>`` prints out the variable ``<var>`` in the current
  69 frame.
  70
  71 On the other hand, a proper formatter is able to produce the following output:
  72
  73 ::
  74
  75    (lldb) frame variable a_deque
  76    (std::deque<Foo, std::allocator<int> >) $0 = size=5 {
  77       [0] = 2
  78       [1] = 3
  79       [2] = 4
  80       [3] = 5
  81       [4] = 6
  82    }
  83
  84 which is what the user would expect from a good debugger.
  85
  86 Note: you can also use ``v <var>`` instead of ``frame variable <var>``.
  87
  88 It's worth mentioning that the ``size=5`` string is produced by a summary
  89 provider and the list of children is produced by a synthetic child provider.
  90 More information about these providers is available later in this document.
  91
  92
  93 There are several features related to data visualization: formats, summaries,
  94 filters, synthetic children.
  95
  96 To reflect this, the type command has five subcommands:
  97
  98 ::
  99
 100    type format
 101    type summary
 102    type filter
 103    type synthetic
 104    type category
 105
 106 These commands are meant to bind printing options to types. When variables are
 107 printed, LLDB will first check if custom printing options have been associated
 108 to a variable's type and, if so, use them instead of picking the default
 109 choices.
 110
 111 Each of the commands (except ``type category``) has four subcommands available:
 112
 113 - ``add``: associates a new printing option to one or more types
 114 - ``delete``: deletes an existing association
 115 - ``list``: provides a listing of all associations
 116 - ``clear``: deletes all associations
 117
 118 Type Format
 119 -----------
 120
 121 Type formats enable you to quickly override the default format for displaying
 122 primitive types (the usual basic C/C++/ObjC types: int, float, char, ...).
 123
 124 If for some reason you want all int variables in your program to print out as
 125 hex, you can add a format to the int type.
 126
 127 This is done by typing
 128
 129 ::
 130
 131    (lldb) type format add --format hex int
 132
 133 at the LLDB command line.
 134
 135 The ``--format`` (which you can shorten to -f) option accepts a :doc:`format
 136 name<formatting>`. Then, you provide one or more types to which you want the
 137 new format applied.
 138
 139 A frequent scenario is that your program has a typedef for a numeric type that
 140 you know represents something that must be printed in a certain way. Again, you
 141 can add a format just to that typedef by using type format add with the name
 142 alias.
 143
 144 But things can quickly get hierarchical. Let's say you have a situation like
 145 the following:
 146
 147 ::
 148
 149    typedef int A;
 150    typedef A B;
 151    typedef B C;
 152    typedef C D;
 153
 154 and you want to show all A's as hex, all C's as byte arrays and leave the
 155 defaults untouched for other types (albeit its contrived look, the example is
 156 far from unrealistic in large software systems).
 157
 158 If you simply type
 159
 160 ::
 161
 162    (lldb) type format add -f hex A
 163    (lldb) type format add -f uint8_t[] C
 164
 165 values of type B will be shown as hex and values of type D as byte arrays, as in:
 166
 167 ::
 168
 169    (lldb) frame variable -T
 170    (A) a = 0x00000001
 171    (B) b = 0x00000002
 172    (C) c = {0x03 0x00 0x00 0x00}
 173    (D) d = {0x04 0x00 0x00 0x00}
 174
 175 This is because by default LLDB cascades formats through typedef chains. In
 176 order to avoid that you can use the option -C no to prevent cascading, thus
 177 making the two commands required to achieve your goal:
 178
 179 ::
 180
 181    (lldb) type format add -C no -f hex A
 182    (lldb) type format add -C no -f uint8_t[] C
 183
 184
 185 which provides the desired output:
 186
 187 ::
 188
 189    (lldb) frame variable -T
 190    (A) a = 0x00000001
 191    (B) b = 2
 192    (C) c = {0x03 0x00 0x00 0x00}
 193    (D) d = 4
 194
 195 Note, that qualifiers such as const and volatile will be stripped when matching types for example:
 196
 197 ::
 198
 199    (lldb) frame var x y z
 200    (int) x = 1
 201    (const int) y = 2
 202    (volatile int) z = 4
 203    (lldb) type format add -f hex int
 204    (lldb) frame var x y z
 205    (int) x = 0x00000001
 206    (const int) y = 0x00000002
 207    (volatile int) z = 0x00000004
 208
 209 Two additional options that you will want to look at are --skip-pointers (-p)
 210 and --skip-references (-r). These two options prevent LLDB from applying a
 211 format for type T to values of type T* and T& respectively.
 212
 213 ::
 214
 215    (lldb) type format add -f float32[] int
 216    (lldb) frame variable pointer *pointer -T
 217    (int *) pointer = {1.46991e-39 1.4013e-45}
 218    (int) *pointer = {1.53302e-42}
 219    (lldb) type format add -f float32[] int -p
 220    (lldb) frame variable pointer *pointer -T
 221    (int *) pointer = 0x0000000100100180
 222    (int) *pointer = {1.53302e-42}
 223
 224 While they can be applied to pointers and references, formats will make no
 225 attempt to dereference the pointer and extract the value before applying the
 226 format, which means you are effectively formatting the address stored in the
 227 pointer rather than the pointee value. For this reason, you may want to use the
 228 -p option when defining formats.
 229
 230 If you need to delete a custom format simply type type format delete followed
 231 by the name of the type to which the format applies.Even if you defined the
 232 same format for multiple types on the same command, type format delete will
 233 only remove the format for the type name passed as argument.
 234
 235 To delete ALL formats, use ``type format clear``. To see all the formats
 236 defined, use type format list.
 237
 238 If all you need to do, however, is display one variable in a custom format,
 239 while leaving the others of the same type untouched, you can simply type:
 240
 241 ::
 242
 243    (lldb) frame variable counter -f hex
 244
 245 This has the effect of displaying the value of counter as an hexadecimal
 246 number, and will keep showing it this way until you either pick a different
 247 format or till you let your program run again.
 248
 249 Finally, this is a list of formatting options available out of which you can
 250 pick:
 251
 252 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 253 | **Format name**                               | **Abbreviation** | **Description**                                                          |
 254 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 255 | ``default``                                   |                  | the default LLDB algorithm is used to pick a format                      |
 256 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 257 | ``boolean``                                   | B                | show this as a true/false boolean, using the customary rule that 0 is    |
 258 |                                               |                  | false and everything else is true                                        |
 259 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 260 | ``binary``                                    | b                | show this as a sequence of bits                                          |
 261 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 262 | ``bytes``                                     | y                | show the bytes one after the other                                       |
 263 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 264 | ``bytes with ASCII``                          | Y                | show the bytes, but try to display them as ASCII characters as well      |
 265 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 266 | ``character``                                 | c                | show the bytes as ASCII characters                                       |
 267 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 268 | ``printable character``                       | C                | show the bytes as printable ASCII characters                             |
 269 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 270 | ``complex float``                             | F                | interpret this value as the real and imaginary part of a complex         |
 271 |                                               |                  | floating-point number                                                    |
 272 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 273 | ``c-string``                                  | s                | show this as a 0-terminated C string                                     |
 274 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 275 | ``decimal``                                   | d                | show this as a signed integer number (this does not perform a cast, it   |
 276 |                                               |                  | simply shows the bytes as  an integer with sign)                         |
 277 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 278 | ``enumeration``                               | E                | show this as an enumeration, printing the                                |
 279 |                                               |                  | value's name if available or the integer value otherwise                 |
 280 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 281 | ``hex``                                       | x                | show this as in hexadecimal notation (this does                          |
 282 |                                               |                  | not perform a cast, it simply shows the bytes as hex)                    |
 283 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 284 | ``float``                                     | f                | show this as a floating-point number (this does not perform a cast, it   |
 285 |                                               |                  | simply interprets the bytes as an IEEE754 floating-point value)          |
 286 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 287 | ``octal``                                     | o                | show this in octal notation                                              |
 288 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 289 | ``OSType``                                    | O                | show this as a MacOS OSType                                              |
 290 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 291 | ``unicode16``                                 | U                | show this as UTF-16 characters                                           |
 292 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 293 | ``unicode32``                                 |                  | show this as UTF-32 characters                                           |
 294 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 295 | ``unsigned decimal``                          | u                | show this as an unsigned integer number (this does not perform a cast,   |
 296 |                                               |                  | it simply shows the bytes as unsigned integer)                           |
 297 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 298 | ``pointer``                                   | p                | show this as a native pointer (unless this is really a pointer, the      |
 299 |                                               |                  | resulting address will probably be invalid)                              |
 300 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 301 | ``char[]``                                    |                  | show this as an array of characters                                      |
 302 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 303 | ``int8_t[], uint8_t[]``                       |                  | show this as an array of the corresponding integer type                  |
 304 | ``int16_t[], uint16_t[]``                     |                  |                                                                          |
 305 | ``int32_t[], uint32_t[]``                     |                  |                                                                          |
 306 | ``int64_t[], uint64_t[]``                     |                  |                                                                          |
 307 | ``uint128_t[]``                               |                  |                                                                          |
 308 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 309 | ``float32[], float64[]``                      |                  | show this as an array of the corresponding                               |
 310 |                                               |                  |                       floating-point type                                |
 311 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 312 | ``complex integer``                           | I                | interpret this value as the real and imaginary part of a complex integer |
 313 |                                               |                  | number                                                                   |
 314 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 315 | ``character array``                           | a                | show this as a character array                                           |
 316 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 317 | ``address``                                   | A                | show this as an address target (symbol/file/line + offset), possibly     |
 318 |                                               |                  | also the string this address is pointing to                              |
 319 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 320 | ``hex float``                                 |                  | show this as hexadecimal floating point                                  |
 321 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 322 | ``instruction``                               | i                | show this as an disassembled opcode                                      |
 323 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 324 | ``void``                                      | v                | don't show anything                                                      |
 325 +-----------------------------------------------+------------------+--------------------------------------------------------------------------+
 326
 327 Type Summary
 328 ------------
 329
 330 Type formats work by showing a different kind of display for the value of a
 331 variable. However, they only work for basic types. When you want to display a
 332 class or struct in a custom format, you cannot do that using formats.
 333
 334 A different feature, type summaries, works by extracting information from
 335 classes, structures, ... (aggregate types) and arranging it in a user-defined
 336 format, as in the following example:
 337
 338 before adding a summary...
 339
 340 ::
 341
 342    (lldb) frame variable -T one
 343    (i_am_cool) one = {
 344       (int) x = 3
 345       (float) y = 3.14159
 346       (char) z = 'E'
 347    }
 348
 349 after adding a summary...
 350
 351 ::
 352
 353    (lldb) frame variable one
 354    (i_am_cool) one = int = 3, float = 3.14159, char = 69
 355
 356 There are two ways to use type summaries: the first one is to bind a summary
 357 string to the type; the second is to write a Python script that returns the
 358 string to be used as summary. Both options are enabled by the type summary add
 359 command.
 360
 361 The command to obtain the output shown in the example is:
 362
 363 ::
 364
 365 (lldb) type summary add --summary-string "int = ${var.x}, float = ${var.y}, char = ${var.z%u}" i_am_cool
 366
 367 Initially, we will focus on summary strings, and then describe the Python
 368 binding mechanism.
 369
 370 Summary Strings
 371 ---------------
 372
 373 Summary strings are written using a simple control language, exemplified by the
 374 snippet above. A summary string contains a sequence of tokens that are
 375 processed by LLDB to generate the summary.
 376
 377 Summary strings can contain plain text, control characters and special
 378 variables that have access to information about the current object and the
 379 overall program state.
 380
 381 Plain text is any sequence of characters that doesn't contain a ``{``, ``}``, ``$``,
 382 or ``\`` character, which are the syntax control characters.
 383
 384 The special variables are found in between a "${" prefix, and end with a "}"
 385 suffix. Variables can be a simple name or they can refer to complex objects
 386 that have subitems themselves. In other words, a variable looks like
 387 ``${object}`` or ``${object.child.otherchild}``. A variable can also be
 388 prefixed or suffixed with other symbols meant to change the way its value is
 389 handled. An example is ``${*var.int_pointer[0-3]}``.
 390
 391 Basically, the syntax is the same one described Frame and Thread Formatting
 392 plus additional symbols specific for summary strings. The main of them is
 393 ${var, which is used refer to the variable that a summary is being created for.
 394
 395 The simplest thing you can do is grab a member variable of a class or structure
 396 by typing its expression path. In the previous example, the expression path for
 397 the field float y is simply .y. Thus, to ask the summary string to display y
 398 you would type ${var.y}.
 399
 400 If you have code like the following:
 401
 402 ::
 403
 404    struct A {
 405       int x;
 406       int y;
 407    };
 408    struct B {
 409       A x;
 410       A y;
 411       int *z;
 412    };
 413
 414 the expression path for the y member of the x member of an object of type B
 415 would be .x.y and you would type ``${var.x.y}`` to display it in a summary
 416 string for type B.
 417
 418 By default, a summary defined for type T, also works for types T* and T& (you
 419 can disable this behavior if desired). For this reason, expression paths do not
 420 differentiate between . and ->, and the above expression path .x.y would be
 421 just as good if you were displaying a B*, or even if the actual definition of B
 422 were:
 423
 424 ::
 425
 426    struct B {
 427       A *x;
 428       A y;
 429       int *z;
 430    };
 431
 432 This is unlike the behavior of frame variable which, on the contrary, will
 433 enforce the distinction. As hinted above, the rationale for this choice is that
 434 waiving this distinction enables you to write a summary string once for type T
 435 and use it for both T and T* instances. As a summary string is mostly about
 436 extracting nested members' information, a pointer to an object is just as good
 437 as the object itself for the purpose.
 438
 439 If you need to access the value of the integer pointed to by B::z, you cannot
 440 simply say ${var.z} because that symbol refers to the pointer z. In order to
 441 dereference it and get the pointed value, you should say ``${*var.z}``. The
 442 ``${*var`` tells LLDB to get the object that the expression paths leads to, and
 443 then dereference it. In this example is it equivalent to ``*(bObject.z)`` in
 444 C/C++ syntax. Because ``.`` and ``->`` operators can both be used, there is no
 445 need to have dereferences in the middle of an expression path (e.g. you do not
 446 need to type ``${*(var.x).x}``) to read A::x as contained in ``*(B::x)``. To
 447 achieve that effect you can simply write ``${var.x->x}``, or even
 448 ``${var.x.x}``. The ``*`` operator only binds to the result of the whole
 449 expression path, rather than piecewise, and there is no way to use parentheses
 450 to change that behavior.
 451
 452 Of course, a summary string can contain more than one ${var specifier, and can
 453 use ``${var`` and ``${*var`` specifiers together.
 454
 455 Formatting Summary Elements
 456 ---------------------------
 457
 458 An expression path can include formatting codes. Much like the type formats
 459 discussed previously, you can also customize the way variables are displayed in
 460 summary strings, regardless of the format they have applied to their types. To
 461 do that, you can use %format inside an expression path, as in ${var.x->x%u},
 462 which would display the value of x as an unsigned integer.
 463
 464 You can also use some other special format markers, not available for formats
 465 themselves, but which carry a special meaning when used in this context:
 466
 467 +------------+--------------------------------------------------------------------------+
 468 | **Symbol** | **Description**                                                          |
 469 +------------+--------------------------------------------------------------------------+
 470 | ``Symbol`` | ``Description``                                                          |
 471 +------------+--------------------------------------------------------------------------+
 472 | ``%S``     | Use this object's summary (the default for aggregate types)              |
 473 +------------+--------------------------------------------------------------------------+
 474 | ``%V``     | Use this object's value (the default for non-aggregate types)            |
 475 +------------+--------------------------------------------------------------------------+
 476 | ``%@``     | Use a language-runtime specific description (for C++ this does nothing,  |
 477 |            |                     for Objective-C it calls the NSPrintForDebugger API) |
 478 +------------+--------------------------------------------------------------------------+
 479 | ``%L``     | Use this object's location (memory address, register name, ...)          |
 480 +------------+--------------------------------------------------------------------------+
 481 | ``%#``     | Use the count of the children of this object                             |
 482 +------------+--------------------------------------------------------------------------+
 483 | ``%T``     | Use this object's datatype name                                          |
 484 +------------+--------------------------------------------------------------------------+
 485 | ``%N``     | Print the variable's basename                                            |
 486 +------------+--------------------------------------------------------------------------+
 487 | ``%>``     | Print the expression path for this item                                  |
 488 +------------+--------------------------------------------------------------------------+
 489
 490 Since lldb 3.7.0, you can also specify ``${script.var:pythonFuncName}``.
 491
 492 It is expected that the function name you use specifies a function whose
 493 signature is the same as a Python summary function. The return string from the
 494 function will be placed verbatim in the output.
 495
 496 You cannot use element access, or formatting symbols, in combination with this
 497 syntax. For example the following:
 498
 499 ::
 500
 501    ${script.var.element[0]:myFunctionName%@}
 502
 503 is not valid and will cause the summary to fail to evaluate.
 504
 505
 506 Element Inlining
 507 ----------------
 508
 509 Option --inline-children (-c) to type summary add tells LLDB not to look for a summary string, but instead to just print a listing of all the object's children on one line.
 510
 511 As an example, given a type pair:
 512
 513 ::
 514
 515    (lldb) frame variable --show-types a_pair
 516    (pair) a_pair = {
 517       (int) first = 1;
 518       (int) second = 2;
 519    }
 520
 521 If one types the following commands:
 522
 523 ::
 524
 525    (lldb) type summary add --inline-children pair
 526
 527 the output becomes:
 528
 529 ::
 530
 531    (lldb) frame variable a_pair
 532    (pair) a_pair = (first=1, second=2)
 533
 534
 535 Of course, one can obtain the same effect by typing
 536
 537 ::
 538
 539    (lldb) type summary add pair --summary-string "(first=${var.first}, second=${var.second})"
 540
 541 While the final result is the same, using --inline-children can often save
 542 time. If one does not need to see the names of the variables, but just their
 543 values, the option --omit-names (-O, uppercase letter o), can be combined with
 544 --inline-children to obtain:
 545
 546 ::
 547
 548    (lldb) frame variable a_pair
 549    (pair) a_pair = (1, 2)
 550
 551 which is of course the same as typing
 552
 553 ::
 554
 555    (lldb) type summary add pair --summary-string "(${var.first}, ${var.second})"
 556
 557 Bitfields And Array Syntax
 558 --------------------------
 559
 560 Sometimes, a basic type's value actually represents several different values
 561 packed together in a bitfield.
 562
 563 With the classical view, there is no way to look at them. Hexadecimal display
 564 can help, but if the bits actually span nibble boundaries, the help is limited.
 565
 566 Binary view would show it all without ambiguity, but is often too detailed and
 567 hard to read for real-life scenarios.
 568
 569 To cope with the issue, LLDB supports native bitfield formatting in summary
 570 strings. If your expression paths leads to a so-called scalar type (the usual
 571 int, float, char, double, short, long, long long, double, long double and
 572 unsigned variants), you can ask LLDB to only grab some bits out of the value
 573 and display them in any format you like. If you only need one bit you can use
 574 the [n], just like indexing an array. To extract multiple bits, you can use a
 575 slice-like syntax: [n-m], e.g.
 576
 577 ::
 578
 579    (lldb) frame variable float_point
 580    (float) float_point = -3.14159
 581
 582 ::
 583
 584    (lldb) type summary add --summary-string "Sign: ${var[31]%B} Exponent: ${var[30-23]%x} Mantissa: ${var[0-22]%u}" float
 585    (lldb) frame variable float_point
 586    (float) float_point = -3.14159 Sign: true Exponent: 0x00000080 Mantissa: 4788184
 587
 588 In this example, LLDB shows the internal representation of a float variable by
 589 extracting bitfields out of a float object.
 590
 591 When typing a range, the extremes n and m are always included, and the order of
 592 the indices is irrelevant.
 593
 594 LLDB also allows to use a similar syntax to display array members inside a summary string. For instance, you may want to display all arrays of a given type using a more compact notation than the default, and then just delve into individual array members that prove interesting to your debugging task. You can tell LLDB to format arrays in special ways, possibly independent of the way the array members' datatype is formatted.
 595 e.g.
 596
 597 ::
 598
 599    (lldb) frame variable sarray
 600    (Simple [3]) sarray = {
 601       [0] = {
 602          x = 1
 603          y = 2
 604          z = '\x03'
 605       }
 606       [1] = {
 607          x = 4
 608          y = 5
 609          z = '\x06'
 610       }
 611       [2] = {
 612          x = 7
 613          y = 8
 614          z = '\t'
 615       }
 616    }
 617
 618    (lldb) type summary add --summary-string "${var[].x}" "Simple [3]"
 619
 620    (lldb) frame variable sarray
 621    (Simple [3]) sarray = [1,4,7]
 622
 623 The [] symbol amounts to: if var is an array and I know its size, apply this summary string to every element of the array. Here, we are asking LLDB to display .x for every element of the array, and in fact this is what happens. If you find some of those integers anomalous, you can then inspect that one item in greater detail, without the array format getting in the way:
 624
 625 ::
 626
 627    (lldb) frame variable sarray[1]
 628    (Simple) sarray[1] = {
 629       x = 4
 630       y = 5
 631       z = '\x06'
 632    }
 633
 634 You can also ask LLDB to only print a subset of the array range by using the
 635 same syntax used to extract bit for bitfields:
 636
 637 ::
 638
 639    (lldb) type summary add --summary-string "${var[1-2].x}" "Simple [3]"
 640
 641    (lldb) frame variable sarray
 642    (Simple [3]) sarray = [4,7]
 643
 644 If you are dealing with a pointer that you know is an array, you can use this
 645 syntax to display the elements contained in the pointed array instead of just
 646 the pointer value. However, because pointers have no notion of their size, the
 647 empty brackets [] operator does not work, and you must explicitly provide
 648 higher and lower bounds.
 649
 650 In general, LLDB needs the square brackets ``operator []`` in order to handle
 651 arrays and pointers correctly, and for pointers it also needs a range. However,
 652 a few special cases are defined to make your life easier:
 653
 654 you can print a 0-terminated string (C-string) using the %s format, omitting
 655 square brackets, as in:
 656
 657 ::
 658
 659    (lldb) type summary add --summary-string "${var%s}" "char *"
 660
 661 This syntax works for char* as well as for char[] because LLDB can rely on the
 662 final \0 terminator to know when the string has ended.
 663
 664 LLDB has default summary strings for char* and char[] that use this special
 665 case. On debugger startup, the following are defined automatically:
 666
 667 ::
 668
 669    (lldb) type summary add --summary-string "${var%s}" "char *"
 670    (lldb) type summary add --summary-string "${var%s}" -x "char \[[0-9]+]"
 671
 672 any of the array formats (int8_t[], float32{}, ...), and the y, Y and a formats
 673 work to print an array of a non-aggregate type, even if square brackets are
 674 omitted.
 675
 676 ::
 677
 678    (lldb) type summary add --summary-string "${var%int32_t[]}" "int [10]"
 679
 680 This feature, however, is not enabled for pointers because there is no way for
 681 LLDB to detect the end of the pointed data.
 682
 683 This also does not work for other formats (e.g. boolean), and you must specify
 684 the square brackets operator to get the expected output.
 685
 686 Python Scripting
 687 ----------------
 688
 689 Most of the times, summary strings prove good enough for the job of summarizing
 690 the contents of a variable. However, as soon as you need to do more than
 691 picking some values and rearranging them for display, summary strings stop
 692 being an effective tool. This is because summary strings lack the power to
 693 actually perform any kind of computation on the value of variables.
 694
 695 To solve this issue, you can bind some Python scripting code as a summary for
 696 your datatype, and that script has the ability to both extract children
 697 variables as the summary strings do and to perform active computation on the
 698 extracted values. As a small example, let's say we have a Rectangle class:
 699
 700 ::
 701
 702
 703    class Rectangle
 704    {
 705    private:
 706       int height;
 707       int width;
 708    public:
 709       Rectangle() : height(3), width(5) {}
 710       Rectangle(int H) : height(H), width(H*2-1) {}
 711       Rectangle(int H, int W) : height(H), width(W) {}
 712       int GetHeight() { return height; }
 713       int GetWidth() { return width; }
 714    };
 715
 716 Summary strings are effective to reduce the screen real estate used by the
 717 default viewing mode, but are not effective if we want to display the area and
 718 perimeter of Rectangle objects
 719
 720 To obtain this, we can simply attach a small Python script to the Rectangle
 721 class, as shown in this example:
 722
 723 ::
 724
 725    (lldb) type summary add -P Rectangle
 726    Enter your Python command(s). Type 'DONE' to end.
 727    def function (valobj,internal_dict,options):
 728       height_val = valobj.GetChildMemberWithName('height')
 729       width_val = valobj.GetChildMemberWithName('width')
 730       height = height_val.GetValueAsUnsigned(0)
 731       width = width_val.GetValueAsUnsigned(0)
 732       area = height*width
 733       perimeter = 2*(height + width)
 734       return 'Area: ' + str(area) + ', Perimeter: ' + str(perimeter)
 735       DONE
 736    (lldb) frame variable
 737    (Rectangle) r1 = Area: 20, Perimeter: 18
 738    (Rectangle) r2 = Area: 72, Perimeter: 36
 739    (Rectangle) r3 = Area: 16, Perimeter: 16
 740
 741 In order to write effective summary scripts, you need to know the LLDB public
 742 API, which is the way Python code can access the LLDB object model. For further
 743 details on the API you should look at the LLDB API reference documentation.
 744
 745
 746 As a brief introduction, your script is encapsulated into a function that is
 747 passed two parameters: ``valobj`` and ``internal_dict``.
 748
 749 ``internal_dict`` is an internal support parameter used by LLDB and you should
 750 not touch it.
 751
 752 ``valobj`` is the object encapsulating the actual variable being displayed, and
 753 its type is `SBValue`. Out of the many possible operations on an `SBValue`, the
 754 basic one is retrieve the children objects it contains (essentially, the fields
 755 of the object wrapped by it), by calling ``GetChildMemberWithName()``, passing
 756 it the child's name as a string.
 757
 758 If the variable has a value, you can ask for it, and return it as a string
 759 using ``GetValue()``, or as a signed/unsigned number using
 760 ``GetValueAsSigned()``, ``GetValueAsUnsigned()``. It is also possible to
 761 retrieve an `SBData` object by calling ``GetData()`` and then read the object's
 762 contents out of the `SBData`.
 763
 764 If you need to delve into several levels of hierarchy, as you can do with
 765 summary strings, you can use the method ``GetValueForExpressionPath()``,
 766 passing it an expression path just like those you could use for summary strings
 767 (one of the differences is that dereferencing a pointer does not occur by
 768 prefixing the path with a ``*```, but by calling the ``Dereference()`` method
 769 on the returned `SBValue`). If you need to access array slices, you cannot do
 770 that (yet) via this method call, and you must use ``GetChildAtIndex()``
 771 querying it for the array items one by one. Also, handling custom formats is
 772 something you have to deal with on your own.
 773
 774 ``options`` Python summary formatters can optionally define this
 775 third argument, which is an object of type ``lldb.SBTypeSummaryOptions``,
 776 allowing for a few customizations of the result. The decision to
 777 adopt or not this third argument - and the meaning of options
 778 thereof - is up to the individual formatter's writer.
 779
 780 Other than interactively typing a Python script there are two other ways for
 781 you to input a Python script as a summary:
 782
 783 - using the --python-script option to type summary add and typing the script
 784   code as an option argument; as in:
 785
 786 ::
 787
 788    (lldb) type summary add --python-script "height = valobj.GetChildMemberWithName('height').GetValueAsUnsigned(0);width = valobj.GetChildMemberWithName('width').GetValueAsUnsigned(0); return 'Area: %d' % (height*width)" Rectangle
 789
 790
 791 - using the --python-function (-F) option to type summary add and giving the
 792   name of a Python function with the correct prototype. Most probably, you will
 793   define (or have already defined) the function in the interactive interpreter,
 794   or somehow loaded it from a file, using the command script import command.
 795   LLDB will emit a warning if it is unable to find the function you passed, but
 796   will still register the binding.
 797
 798 Regular Expression Typenames
 799 ----------------------------
 800
 801 As you noticed, in order to associate the custom summary string to the array
 802 types, one must give the array size as part of the typename. This can long
 803 become tiresome when using arrays of different sizes, Simple [3], Simple [9],
 804 Simple [12], ...
 805
 806 If you use the -x option, type names are treated as regular expressions instead
 807 of type names. This would let you rephrase the above example for arrays of type
 808 Simple [3] as:
 809
 810 ::
 811
 812    (lldb) type summary add --summary-string "${var[].x}" -x "Simple \[[0-9]+\]"
 813    (lldb) frame variable
 814    (Simple [3]) sarray = [1,4,7]
 815    (Simple [2]) sother = [3,6]
 816
 817 The above scenario works for Simple [3] as well as for any other array of
 818 Simple objects.
 819
 820 While this feature is mostly useful for arrays, you could also use regular
 821 expressions to catch other type sets grouped by name. However, as regular
 822 expression matching is slower than normal name matching, LLDB will first try to
 823 match by name in any way it can, and only when this fails, will it resort to
 824 regular expression matching.
 825
 826 One of the ways LLDB uses this feature internally, is to match the names of STL
 827 container classes, regardless of the template arguments provided. The details
 828 for this are found at FormatManager.cpp
 829
 830 The regular expression language used by LLDB is the POSIX extended language, as
 831 defined by the Single UNIX Specification, of which macOS is a compliant
 832 implementation.
 833
 834 Names Summaries
 835 ---------------
 836
 837 For a given type, there may be different meaningful summary representations.
 838 However, currently, only one summary can be associated to a type at each
 839 moment. If you need to temporarily override the association for a variable,
 840 without changing the summary string for to its type, you can use named
 841 summaries.
 842
 843 Named summaries work by attaching a name to a summary when creating it. Then,
 844 when there is a need to attach the summary to a variable, the frame variable
 845 command, supports a --summary option that tells LLDB to use the named summary
 846 given instead of the default one.
 847
 848 ::
 849
 850    (lldb) type summary add --summary-string "x=${var.integer}" --name NamedSummary
 851    (lldb) frame variable one
 852    (i_am_cool) one = int = 3, float = 3.14159, char = 69
 853    (lldb) frame variable one --summary NamedSummary
 854    (i_am_cool) one = x=3
 855
 856 When defining a named summary, binding it to one or more types becomes
 857 optional. Even if you bind the named summary to a type, and later change the
 858 summary string for that type, the named summary will not be changed by that.
 859 You can delete named summaries by using the type summary delete command, as if
 860 the summary name was the datatype that the summary is applied to
 861
 862 A summary attached to a variable using the --summary option, has the same
 863 semantics that a custom format attached using the -f option has: it stays
 864 attached till you attach a new one, or till you let your program run again.
 865
 866 Synthetic Children
 867 ------------------
 868
 869 Summaries work well when one is able to navigate through an expression path. In
 870 order for LLDB to do so, appropriate debugging information must be available.
 871
 872 Some types are opaque, i.e. no knowledge of their internals is provided. When
 873 that's the case, expression paths do not work correctly.
 874
 875 In other cases, the internals are available to use in expression paths, but
 876 they do not provide a user-friendly representation of the object's value.
 877
 878 For instance, consider an STL vector, as implemented by the GNU C++ Library:
 879
 880 ::
 881
 882    (lldb) frame variable numbers -T
 883    (std::vector<int>) numbers = {
 884       (std::_Vector_base<int, std::allocator<int> >) std::_Vector_base<int, std::allocator<int> > = {
 885          (std::_Vector_base<int, std::allocator&tl;int> >::_Vector_impl) _M_impl = {
 886                (int *) _M_start = 0x00000001001008a0
 887                (int *) _M_finish = 0x00000001001008a8
 888                (int *) _M_end_of_storage = 0x00000001001008a8
 889          }
 890       }
 891    }
 892
 893 Here, you can see how the type is implemented, and you can write a summary for
 894 that implementation but that is not going to help you infer what items are
 895 actually stored in the vector.
 896
 897 What you would like to see is probably something like:
 898
 899 ::
 900
 901    (lldb) frame variable numbers -T
 902    (std::vector<int>) numbers = {
 903       (int) [0] = 1
 904       (int) [1] = 12
 905       (int) [2] = 123
 906       (int) [3] = 1234
 907    }
 908
 909 Synthetic children are a way to get that result.
 910
 911 The feature is based upon the idea of providing a new set of children for a
 912 variable that replaces the ones available by default through the debug
 913 information. In the example, we can use synthetic children to provide the
 914 vector items as children for the std::vector object.
 915
 916 In order to create synthetic children, you need to provide a Python class that
 917 adheres to a given interface (the word is italicized because Python has no
 918 explicit notion of interface, by that word we mean a given set of methods must
 919 be implemented by the Python class):
 920
 921 .. code-block:: python
 922
 923    class SyntheticChildrenProvider:
 924       def __init__(self, valobj, internal_dict):
 925          this call should initialize the Python object using valobj as the variable to provide synthetic children for
 926       def num_children(self):
 927          this call should return the number of children that you want your object to have
 928       def get_child_index(self,name):
 929          this call should return the index of the synthetic child whose name is given as argument
 930       def get_child_at_index(self,index):
 931          this call should return a new LLDB SBValue object representing the child at the index given as argument
 932       def update(self):
 933          this call should be used to update the internal state of this Python object whenever the state of the variables in LLDB changes.[1]
 934          Also, this method is invoked before any other method in the interface.
 935       def has_children(self):
 936          this call should return True if this object might have children, and False if this object can be guaranteed not to have children.[2]
 937       def get_value(self):
 938          this call can return an SBValue to be presented as the value of the synthetic value under consideration.[3]
 939
 940 As a warning, exceptions that are thrown by python formatters are caught
 941 silently by LLDB and should be handled appropriately by the formatter itself.
 942 Being more specific, in case of exceptions, LLDB might assume that the given
 943 object has no children or it might skip printing some children, as they are
 944 printed one by one.
 945
 946 [1] This method is optional. Also, a boolean value must be returned (since lldb
 947 3.1.0). If ``False`` is returned, then whenever the process reaches a new stop,
 948 this method will be invoked again to generate an updated list of the children
 949 for a given variable. Otherwise, if ``True`` is returned, then the value is
 950 cached and this method won't be called again, effectively freezing the state of
 951 the value in subsequent stops. Beware that returning ``True`` incorrectly could
 952 show misleading information to the user.
 953
 954 [2] This method is optional (since lldb 3.2.0). While implementing it in terms
 955 of num_children is acceptable, implementors are encouraged to look for
 956 optimized coding alternatives whenever reasonable.
 957
 958 [3] This method is optional (since lldb 3.5.2). The `SBValue` you return here
 959 will most likely be a numeric type (int, float, ...) as its value bytes will be
 960 used as-if they were the value of the root `SBValue` proper.  As a shortcut for
 961 this, you can inherit from lldb.SBSyntheticValueProvider, and just define
 962 get_value as other methods are defaulted in the superclass as returning default
 963 no-children responses.
 964
 965 If a synthetic child provider supplies a special child named
 966 ``$$dereference$$`` then it will be used when evaluating ``operator *`` and
 967 ``operator ->`` in the frame variable command and related SB API
 968 functions. It is possible to declare this synthetic child without
 969 including it in the range of children displayed by LLDB. For example,
 970 this subset of a synthetic children provider class would allow the
 971 synthetic value to be dereferenced without actually showing any
 972 synthtic children in the UI:
 973
 974 .. code-block:: python
 975
 976       class SyntheticChildrenProvider:
 977           [...]
 978           def num_children(self):
 979               return 0
 980           def get_child_index(self, name):
 981               if name == '$$dereference$$':
 982                   return 0
 983               return -1
 984           def get_child_at_index(self, index):
 985               if index == 0:
 986                   return <valobj resulting from dereference>
 987               return None
 988
 989
 990 For examples of how synthetic children are created, you are encouraged to look
 991 at examples/synthetic in the LLDB trunk. Please, be aware that the code in
 992 those files (except bitfield/) is legacy code and is not maintained. You may
 993 especially want to begin looking at this example to get a feel for this
 994 feature, as it is a very easy and well commented example.
 995
 996 The design pattern consistently used in synthetic providers shipping with LLDB
 997 is to use the __init__ to store the `SBValue` instance as a part of self. The
 998 update function is then used to perform the actual initialization. Once a
 999 synthetic children provider is written, one must load it into LLDB before it
1000 can be used. Currently, one can use the LLDB script command to type Python code
1001 interactively, or use the command script import fileName command to load Python
1002 code from a Python module (ordinary rules apply to importing modules this way).
1003 A third option is to type the code for the provider class interactively while
1004 adding it.
1005
1006 For example, let's pretend we have a class Foo for which a synthetic children
1007 provider class Foo_Provider is available, in a Python module contained in file
1008 ~/Foo_Tools.py. The following interaction sets Foo_Provider as a synthetic
1009 children provider in LLDB:
1010
1011 ::
1012
1013    (lldb) command script import ~/Foo_Tools.py
1014    (lldb) type synthetic add Foo --python-class Foo_Tools.Foo_Provider
1015    (lldb) frame variable a_foo
1016    (Foo) a_foo = {
1017       x = 1
1018       y = "Hello world"
1019    }
1020
1021 LLDB has synthetic children providers for a core subset of STL classes, both in
1022 the version provided by libstdcpp and by libcxx, as well as for several
1023 Foundation classes.
1024
1025 Synthetic children extend summary strings by enabling a new special variable:
1026 ``${svar``.
1027
1028 This symbol tells LLDB to refer expression paths to the synthetic children
1029 instead of the real ones. For instance,
1030
1031 ::
1032
1033    (lldb) type summary add --expand -x "std::vector<" --summary-string "${svar%#} items"
1034    (lldb) frame variable numbers
1035    (std::vector<int>) numbers = 4 items {
1036       (int) [0] = 1
1037       (int) [1] = 12
1038       (int) [2] = 123
1039       (int) [3] = 1234
1040    }
1041
1042 It's important to mention that LLDB invokes the synthetic child provider before
1043 invoking the summary string provider, which allows the latter to have access to
1044 the actual displayable children. This applies to both inlined summary strings
1045 and python-based summary providers.
1046
1047
1048 As a warning, when programmatically accessing the children or children count of
1049 a variable that has a synthetic child provider, notice that LLDB hides the
1050 actual raw children. For example, suppose we have a ``std::vector``, which has
1051 an actual in-memory property ``__begin`` marking the beginning of its data.
1052 After the synthetic child provider is executed, the ``std::vector`` variable
1053 won't show ``__begin`` as child anymore, even through the SB API. It will have
1054 instead the children calculated by the provider. In case the actual raw
1055 children are needed, a call to ``value.GetNonSyntheticValue()`` is enough to
1056 get a raw version of the value. It is import to remember this when implementing
1057 summary string providers, as they run after the synthetic child provider.
1058
1059
1060 In some cases, if LLDB is unable to use the real object to get a child
1061 specified in an expression path, it will automatically refer to the synthetic
1062 children. While in summaries it is best to always use ${svar to make your
1063 intentions clearer, interactive debugging can benefit from this behavior, as
1064 in:
1065
1066 ::
1067
1068    (lldb) frame variable numbers[0] numbers[1]
1069    (int) numbers[0] = 1
1070    (int) numbers[1] = 12
1071
1072 Unlike many other visualization features, however, the access to synthetic
1073 children only works when using frame variable, and is not supported in
1074 expression:
1075
1076 ::
1077
1078    (lldb) expression numbers[0]
1079    Error [IRForTarget]: Call to a function '_ZNSt33vector<int, std::allocator<int> >ixEm' that is not present in the target
1080    error: Couldn't convert the expression to DWARF
1081
1082 The reason for this is that classes might have an overloaded ``operator []``,
1083 or other special provisions and the expression command chooses to ignore
1084 synthetic children in the interest of equivalency with code you asked to have
1085 compiled from source.
1086
1087 Filters
1088 -------
1089
1090 Filters are a solution to the display of complex classes. At times, classes
1091 have many member variables but not all of these are actually necessary for the
1092 user to see.
1093
1094 A filter will solve this issue by only letting the user see those member
1095 variables they care about. Of course, the equivalent of a filter can be
1096 implemented easily using synthetic children, but a filter lets you get the job
1097 done without having to write Python code.
1098
1099 For instance, if your class Foobar has member variables named A thru Z, but you
1100 only need to see the ones named B, H and Q, you can define a filter:
1101
1102 ::
1103
1104    (lldb) type filter add Foobar --child B --child H --child Q
1105    (lldb) frame variable a_foobar
1106    (Foobar) a_foobar = {
1107       (int) B = 1
1108       (char) H = 'H'
1109       (std::string) Q = "Hello world"
1110    }
1111
1112 Callback-based type matching
1113 ----------------------------
1114
1115 Even though regular expression matching works well for the vast majority of data
1116 formatters (you normally know the name of the type you're writing a formatter
1117 for), there are some cases where it's useful to look at the type before deciding
1118 what formatter to apply.
1119
1120 As an example scenario, imagine we have a code generator that produces some
1121 classes that inherit from a common ``GeneratedObject`` class, and we have a
1122 summary function and a synthetic child provider that work for all
1123 ``GeneratedObject`` instances (they all follow the same pattern). However, there
1124 is no common pattern in the name of these classes, so we can't register the
1125 formatter neither by name nor by regular expression.
1126
1127 In that case, you can write a recognizer function like this:
1128
1129 ::
1130
1131    def is_generated_object(sbtype, internal_dict):
1132      for base in sbtype.get_bases_array():
1133        if base.GetName() == "GeneratedObject"
1134          return True
1135      return False
1136
1137 And pass this function to ``type summary add`` and ``type synthetic add`` using
1138 the flag ``--recognizer-function``.
1139
1140 ::
1141
1142    (lldb) type summary add --expand --python-function my_summary_function --recognizer-function is_generated_object
1143    (lldb) type synthetic add --python-class my_child_provider --recognizer-function is_generated_object
1144
1145 Objective-C Dynamic Type Discovery
1146 ----------------------------------
1147
1148 When doing Objective-C development, you may notice that some of your variables
1149 come out as of type id (for instance, items extracted from NSArray). By
1150 default, LLDB will not show you the real type of the object. it can actually
1151 dynamically discover the type of an Objective-C variable, much like the runtime
1152 itself does when invoking a selector. In order to be shown the result of that
1153 discovery that, however, a special option to frame variable or expression is
1154 required: ``--dynamic-type``.
1155
1156
1157 ``--dynamic-type`` can have one of three values:
1158
1159 - ``no-dynamic-values``: the default, prevents dynamic type discovery
1160 - ``no-run-target``: enables dynamic type discovery as long as running code on
1161   the target is not required
1162 - ``run-target``: enables code execution on the target in order to perform
1163   dynamic type discovery
1164
1165 If you specify a value of either no-run-target or run-target, LLDB will detect
1166 the dynamic type of your variables and show the appropriate formatters for
1167 them. As an example:
1168
1169 ::
1170
1171    (lldb) expr @"Hello"
1172    (NSString *) $0 = 0x00000001048000b0 @"Hello"
1173    (lldb) expr -d no-run @"Hello"
1174    (__NSCFString *) $1 = 0x00000001048000b0 @"Hello"
1175
1176 Because LLDB uses a detection algorithm that does not need to invoke any
1177 functions on the target process, no-run-target is enough for this to work.
1178
1179 As a side note, the summary for NSString shown in the example is built right
1180 into LLDB. It was initially implemented through Python (the code is still
1181 available for reference at CFString.py). However, this is out of sync with the
1182 current implementation of the NSString formatter (which is a C++ function
1183 compiled into the LLDB core).
1184
1185 Categories
1186 ----------
1187
1188 Categories are a way to group related formatters. For instance, LLDB itself
1189 groups the formatters for the libstdc++ types in a category named
1190 gnu-libstdc++. Basically, categories act like containers in which to store
1191 formatters for a same library or OS release.
1192
1193 By default, several categories are created in LLDB:
1194
1195 - default: this is the category where every formatter ends up, unless another category is specified
1196 - objc: formatters for basic and common Objective-C types that do not specifically depend on macOS
1197 - gnu-libstdc++: formatters for std::string, std::vector, std::list and std::map as implemented by libstdcpp
1198 - libcxx: formatters for std::string, std::vector, std::list and std::map as implemented by libcxx
1199 - system: truly basic types for which a formatter is required
1200 - AppKit: Cocoa classes
1201 - CoreFoundation: CF classes
1202 - CoreGraphics: CG classes
1203 - CoreServices: CS classes
1204 - VectorTypes: compact display for several vector types
1205
1206 If you want to use a custom category for your formatters, all the type ... add
1207 provide a --category (-w) option, that names the category to add the formatter
1208 to. To delete the formatter, you then have to specify the correct category.
1209
1210 Categories can be in one of two states: enabled and disabled. A category is
1211 initially disabled, and can be enabled using the type category enable command.
1212 To disable an enabled category, the command to use is type category disable.
1213
1214 The order in which categories are enabled or disabled is significant, in that
1215 LLDB uses that order when looking for formatters. Therefore, when you enable a
1216 category, it becomes the second one to be searched (after default, which always
1217 stays on top of the list). The default categories are enabled in such a way
1218 that the search order is:
1219
1220 - default
1221 - objc
1222 - CoreFoundation
1223 - AppKit
1224 - CoreServices
1225 - CoreGraphics
1226 - gnu-libstdc++
1227 - libcxx
1228 - VectorTypes
1229 - system
1230
1231 As said, gnu-libstdc++ and libcxx contain formatters for C++ STL data types.
1232 system contains formatters for char* and char[], which reflect the behavior of
1233 older versions of LLDB which had built-in formatters for these types. Because
1234 now these are formatters, you can even replace them with your own if so you
1235 wish.
1236
1237 There is no special command to create a category. When you place a formatter in
1238 a category, if that category does not exist, it is automatically created. For
1239 instance,
1240
1241 ::
1242
1243    (lldb) type summary add Foobar --summary-string "a foobar" --category newcategory
1244
1245 automatically creates a (disabled) category named newcategory.
1246
1247 Another way to create a new (empty) category, is to enable it, as in:
1248
1249 ::
1250
1251    (lldb) type category enable newcategory
1252
1253 However, in this case LLDB warns you that enabling an empty category has no
1254 effect. If you add formatters to the category after enabling it, they will be
1255 honored. But an empty category per se does not change the way any type is
1256 displayed. The reason the debugger warns you is that enabling an empty category
1257 might be a typo, and you effectively wanted to enable a similarly-named but
1258 not-empty category.
1259
1260 Finding Formatters 101
1261 ----------------------
1262
1263 Searching for a formatter (including formats, since lldb 3.4.0) given a
1264 variable goes through a rather intricate set of rules. Namely, what happens is
1265 that LLDB starts looking in each enabled category, according to the order in
1266 which they were enabled (latest enabled first). In each category, LLDB does the
1267 following:
1268
1269 - If there is a formatter for the type of the variable, use it
1270 - If this object is a pointer, and there is a formatter for the pointee type
1271   that does not skip pointers, use it
1272 - If this object is a reference, and there is a formatter for the referred type
1273   that does not skip references, use it
1274 - If this object is an Objective-C class and dynamic types are enabled, look
1275   for a formatter for the dynamic type of the object. If dynamic types are
1276   disabled, or the search failed, look for a formatter for the declared type of
1277   the object
1278 - If this object's type is a typedef, go through typedef hierarchy (LLDB might
1279   not be able to do this if the compiler has not emitted enough information. If
1280   the required information to traverse typedef hierarchies is missing, type
1281   cascading will not work. The clang compiler, part of the LLVM project, emits
1282   the correct debugging information for LLDB to cascade). If at any level of
1283   the hierarchy there is a valid formatter that can cascade, use it.
1284 - If everything has failed, repeat the above search, looking for regular
1285   expressions instead of exact matches
1286
1287 If any of those attempts returned a valid formatter to be used, that one is
1288 used, and the search is terminated (without going to look in other categories).
1289 If nothing was found in the current category, the next enabled category is
1290 scanned according to the same algorithm. If there are no more enabled
1291 categories, the search has failed.
1292
1293 **Warning**: previous versions of LLDB defined cascading to mean not only going
1294 through typedef chains, but also through inheritance chains. This feature has
1295 been removed since it significantly degrades performance. You need to set up
1296 your formatters for every type in inheritance chains to which you want the
1297 formatter to apply.