doc/il.txt

   1                 ===========================
   2                  QBE Intermediate Language
   3                 ===========================
   4
   5
   6
   7 - Table of Contents
   8 -------------------
   9
  10   1. <@ Basic Concepts >
  11       * <@ Input Files >
  12       * <@ BNF Notation >
  13       * <@ Sigils >
  14   2. <@ Types >
  15       * <@ Simple Types >
  16       * <@ Subtyping >
  17   3. <@ Constants >
  18   4. <@ Definitions >
  19       * <@ Aggregate Types >
  20       * <@ Data >
  21       * <@ Functions >
  22   5. <@ Control >
  23       * <@ Blocks >
  24       * <@ Jumps >
  25   6. <@ Instructions >
  26       * <@ Arithmetic and Bits >
  27       * <@ Memory >
  28       * <@ Comparisons >
  29       * <@ Conversions >
  30       * <@ Cast and Copy >
  31       * <@ Call >
  32       * <@ Variadic >
  33       * <@ Phi >
  34   7. <@ Instructions Index >
  35
  36 - 1. Basic Concepts
  37 -------------------
  38
  39 The intermediate language (IL) is a higher-level language
  40 than the machine's assembly language.  It smoothes most
  41 of the irregularities of the underlying hardware and
  42 allows an infinite number of temporaries to be used.
  43 This higher abstraction level allows frontend programmers
  44 to focus on language design issues.
  45
  46 ~ Input Files
  47 ~~~~~~~~~~~~~
  48
  49 The intermediate language is provided to QBE as text.
  50 Usually, one file is generated per each compilation unit from
  51 the frontend input language.  An IL file is a sequence of
  52 <@ Definitions > for data, functions, and types.  Once
  53 processed by QBE, the resulting file can be assembled and
  54 linked using a standard toolchain (e.g., GNU binutils).
  55
  56 Here is a complete "Hello World" IL file which defines a
  57 function that prints to the screen.  Since the string is
  58 not a first class object (only the pointer is) it is
  59 defined outside the function's body.  Comments start with
  60 a # character and finish with the end of the line.
  61
  62     # Define the string constant.
  63     data $str = { b "hello world", b 0 }
  64
  65     function w $main() {
  66     @start
  67             # Call the puts function with $str as argument.
  68             %r =w call $puts(l $str)
  69             ret 0
  70     }
  71
  72 If you have read the LLVM language reference, you might
  73 recognize the example above.  In comparison, QBE makes a
  74 much lighter use of types and the syntax is terser.
  75
  76 ~ BNF Notation
  77 ~~~~~~~~~~~~~~
  78
  79 The language syntax is vaporously described in the sections
  80 below using BNF syntax.  The different BNF constructs used
  81 are listed below.
  82
  83   * Keywords are enclosed between quotes;
  84   * `... | ...` expresses disjunctions;
  85   * `[ ... ]` marks some syntax as optional;
  86   * `( ... ),` designates a comma-separated list of the
  87     enclosed syntax;
  88   * `...*` and `...+` are used for arbitrary and
  89     at-least-once repetition respectively.
  90
  91 ~ Sigils
  92 ~~~~~~~~
  93
  94 The intermediate language makes heavy use of sigils, all
  95 user-defined names are prefixed with a sigil.  This is
  96 to avoid keyword conflicts, and also to quickly spot the
  97 scope and nature of identifiers.
  98
  99  * `:` is for user-defined <@ Aggregate Types>
 100  * `$` is for globals (represented by a pointer)
 101  * `%` is for function-scope temporaries
 102  * `@` is for block labels
 103
 104 In this BNF syntax, we use `?IDENT` to designate an identifier
 105 starting with the sigil `?`.
 106
 107 - 2. Types
 108 ----------
 109
 110 ~ Simple Types
 111 ~~~~~~~~~~~~~~
 112
 113     `bnf
 114     BASETY := 'w' | 'l' | 's' | 'd'  # Base types
 115     EXTTY  := BASETY    | 'b' | 'h'  # Extended types
 116
 117 The IL makes minimal use of types.  By design, the types
 118 used are restricted to what is necessary for unambiguous
 119 compilation to machine code and C interfacing.  Unlike LLVM,
 120 QBE is not using types as a means to safety; they are only
 121 here for semantic purposes.
 122
 123 The four base types are `w` (word), `l` (long), `s` (single),
 124 and `d` (double), they stand respectively for 32-bit and
 125 64-bit integers, and 32-bit and 64-bit floating-point numbers.
 126 There are no pointer types available; pointers are typed
 127 by an integer type sufficiently wide to represent all memory
 128 addresses (e.g., `l` on 64-bit architectures).  Temporaries
 129 in the IL can only have a basic type.
 130
 131 Extended types contain base types plus `b` (byte) and `h`
 132 (half word), respectively for 8-bit and 16-bit integers.
 133 They are used in <@ Aggregate Types> and <@ Data> definitions.
 134
 135 For C interfacing, the IL also provides user-defined aggregate
 136 types.  The syntax used to designate them is `:foo`.  Details
 137 about their definition are given in the <@ Aggregate Types >
 138 section.
 139
 140 ~ Subtyping
 141 ~~~~~~~~~~~
 142
 143 The IL has a minimal subtyping feature, for integer types only.
 144 Any value of type `l` can be used in a `w` context.  In that
 145 case, only the 32 least significant bits of the word value
 146 are used.
 147
 148 Make note that it is the opposite of the usual subtyping on
 149 integers (in C, we can safely use an `int` where a `long`
 150 is expected).  A long value cannot be used in word context.
 151 The rationale is that a word can be signed or unsigned, so
 152 extending it to a long could be done in two ways, either
 153 by zero-extension, or by sign-extension.
 154
 155 - 3. Constants
 156 --------------
 157
 158     `bnf
 159     CONST :=
 160         ['-'] NUMBER  # Decimal integer
 161       | 's_' FP       # Single-precision float
 162       | 'd_' FP       # Double-precision float
 163       | $IDENT        # Global symbol
 164
 165 Throughout the IL, constants are specified with a unified
 166 syntax and semantics.  Constants are immediates, meaning
 167 that they can be used directly in instructions; there is
 168 no need for a "load constant" instruction.
 169
 170 The representation of integers is two's complement.
 171 Floating-point numbers are represented using the
 172 single-precision and double-precision formats of the
 173 IEEE 754 standard.
 174
 175 Constants specify a sequence of bits and are untyped.
 176 They are always parsed as 64-bit blobs.  Depending on
 177 the context surrounding a constant, only some of its
 178 bits are used.  For example, in the program below, the
 179 two variables defined have the same value since the first
 180 operand of the subtraction is a word (32-bit) context.
 181
 182     %x =w sub -1, 0
 183     %y =w sub 4294967295, 0
 184
 185 Because specifying floating-point constants by their bits
 186 makes the code less readable, syntactic sugar is provided
 187 to express them.  Standard scientific notation is prefixed
 188 with `s_` and `d_` for single and double precision numbers
 189 respectively. Once again, the following example defines twice
 190 the same double-precision constant.
 191
 192     %x =d add d_0, d_-1
 193     %y =d add d_0, -4616189618054758400
 194
 195 Global symbols can also be used directly as constants;
 196 they will be resolved and turned into actual numeric
 197 constants by the linker.
 198
 199 - 4. Definitions
 200 ----------------
 201
 202 Definitions are the essential components of an IL file.
 203 They can define three types of objects: aggregate types,
 204 data, and functions.  Aggregate types are never exported
 205 and do not compile to any code.  Data and function
 206 definitions have file scope and are mutually recursive
 207 (even across IL files).  Their visibility can be controlled
 208 using the `export` keyword.
 209
 210 ~ Aggregate Types
 211 ~~~~~~~~~~~~~~~~~
 212
 213     `bnf
 214     TYPEDEF :=
 215         # Regular type
 216         'type' :IDENT '=' ['align' NUMBER]
 217         '{'
 218             ( SUBTY [NUMBER] ),
 219         '}'
 220       | # Opaque type
 221         'type' :IDENT '=' 'align' NUMBER '{' NUMBER '}'
 222
 223     SUBTY := EXTTY | :IDENT
 224
 225 Aggregate type definitions start with the `type` keyword.
 226 They have file scope, but types must be defined before being
 227 referenced.  The inner structure of a type is expressed by a
 228 comma-separated list of types enclosed in curly braces.
 229
 230     type :fourfloats = { s, s, d, d }
 231
 232 For ease of IL generation, a trailing comma is tolerated by
 233 the parser.  In case many items of the same type are
 234 sequenced (like in a C array), the shorter array syntax
 235 can be used.
 236
 237     type :abyteandmanywords = { b, w 100 }
 238
 239 By default, the alignment of an aggregate type is the
 240 maximum alignment of its members.  The alignment can be
 241 explicitly specified by the programmer.
 242
 243     type :cryptovector = align 16 { w 4 }
 244
 245 Opaque types are used when the inner structure of an
 246 aggregate cannot be specified; the alignment for opaque
 247 types is mandatory.  They are defined simply by enclosing
 248 their size between curly braces.
 249
 250     type :opaque = align 16 { 32 }
 251
 252 ~ Data
 253 ~~~~~~
 254
 255     `bnf
 256     DATADEF :=
 257         ['export'] 'data' $IDENT '=' ['align' NUMBER]
 258         '{'
 259             ( EXTTY DATAITEM+
 260             | 'z'   NUMBER ),
 261         '}'
 262
 263     DATAITEM :=
 264         $IDENT ['+' NUMBER]  # Symbol and offset
 265       |  '"' ... '"'         # String
 266       |  CONST               # Constant
 267
 268 Data definitions express objects that will be emitted in the
 269 compiled file.  They can be local to the file or exported
 270 with global visibility to the whole program.
 271
 272 They define a global identifier (starting with the sigil
 273 `$`), that will contain a pointer to the object specified
 274 by the definition.
 275
 276 Objects are described by a sequence of fields that start with
 277 a type letter.  This letter can either be an extended type,
 278 or the `z` letter.  If the letter used is an extended type,
 279 the data item following specifies the bits to be stored in
 280 the field.  When several data items follow a letter, they
 281 initialize multiple fields of the same size.
 282
 283 The members of a struct will be packed.  This means that
 284 padding has to be emitted by the frontend when necessary.
 285 Alignment of the whole data objects can be manually specified,
 286 and when no alignment is provided, the maximum alignment from
 287 the platform is used.
 288
 289 When the `z` letter is used the number following indicates
 290 the size of the field; the contents of the field are zero
 291 initialized.  It can be used to add padding between fields
 292 or zero-initialize big arrays.
 293
 294 Here are various examples of data definitions.
 295
 296     # Three 32-bit values 1, 2, and 3
 297     # followed by a 0 byte.
 298     data $a = { w 1 2 3, b 0 }
 299
 300     # A thousand bytes 0 initialized.
 301     data $b = { z 1000 }
 302
 303     # An object containing two 64-bit
 304     # fields, one with all bits sets and the
 305     # other containing a pointer to the
 306     # object itself.
 307     data $c = { l -1, l $c }
 308
 309 ~ Functions
 310 ~~~~~~~~~~~
 311
 312     `bnf
 313     FUNCDEF :=
 314         ['export'] 'function' [ABITY] $IDENT '(' (PARAM), ')'
 315         '{'
 316            BLOCK+
 317         '}'
 318
 319     PARAM :=
 320         ABITY %IDENT  # Regular parameter
 321       | 'env' %IDENT  # Environment parameter (first)
 322       | '...'         # Variadic marker (last)
 323
 324     ABITY := BASETY | :IDENT
 325
 326 Function definitions contain the actual code to emit in
 327 the compiled file.  They define a global symbol that
 328 contains a pointer to the function code.  This pointer
 329 can be used in `call` instructions or stored in memory.
 330
 331 The type given right before the function name is the
 332 return type of the function.  All return values of this
 333 function must have this return type.  If the return
 334 type is missing, the function cannot return any value.
 335
 336 The parameter list is a comma separated list of
 337 temporary names prefixed by types.  The types are used
 338 to correctly implement C compatibility.  When an argument
 339 has an aggregate type, a pointer to the aggregate is passed
 340 by the caller.  In the example below, we have to use a load
 341 instruction to get the value of the first (and only)
 342 member of the struct.
 343
 344     type :one = { w }
 345
 346     function w $getone(:one %p) {
 347     @start
 348             %val =w loadw %p
 349             ret %val
 350     }
 351
 352 If the parameter list ends with `...`, the function is
 353 a variadic function: it can accept a variable number of
 354 arguments.  To access the extra arguments provided by
 355 the caller, use the `vastart` and `vaarg` instructions
 356 described in the <@ Variadic > section.
 357
 358 Optionally, the parameter list can start with an
 359 environment parameter `env %e`.  This special parameter is
 360 a 64-bit integer temporary (i.e., of type `l`).  If the
 361 function does not use its environment parameter, callers
 362 can safely omit it.  This parameter is invisible to a C
 363 caller: for example, the function
 364
 365     export function w $add(env %e, w %a, w %b) {
 366     @start
 367             %c =w add %a, %b
 368             ret %c
 369     }
 370
 371 must be given the C prototype `int add(int, int)`.
 372 The intended use of this feature is to pass the
 373 environment pointer of closures while retaining a
 374 very good compatibility with C.  The <@ Call > section
 375 explains how to pass an environment parameter.
 376
 377 Since global symbols are defined mutually recursive,
 378 there is no need for function declarations: a function
 379 can be referenced before its definition.
 380 Similarly, functions from other modules can be used
 381 without previous declaration.  All the type information
 382 is provided in the call instructions.
 383
 384 The syntax and semantics for the body of functions
 385 are described in the <@ Control > section.
 386
 387 - 5. Control
 388 ------------
 389
 390 The IL represents programs as textual transcriptions of
 391 control flow graphs.  The control flow is serialized as
 392 a sequence of blocks of straight-line code which are
 393 connected using jump instructions.
 394
 395 ~ Blocks
 396 ~~~~~~~~
 397
 398     `bnf
 399     BLOCK :=
 400         @IDENT    # Block label
 401         PHI*      # Phi instructions
 402         INST*     # Regular instructions
 403         JUMP      # Jump or return
 404
 405 All blocks have a name that is specified by a label at
 406 their beginning.  Then follows a sequence of instructions
 407 that have "fall-through" flow.  Finally one jump terminates
 408 the block.  The jump can either transfer control to another
 409 block of the same function or return; they are described
 410 further below.
 411
 412 The first block in a function must not be the target of
 413 any jump in the program.  If this is really needed,
 414 the frontend could insert an empty prelude block
 415 at the beginning of the function.
 416
 417 When one block jumps to the next block in the IL file,
 418 it is not necessary to give the jump instruction, it
 419 will be automatically added by the parser.  For example
 420 the start block in the example below jumps directly
 421 to the loop block.
 422
 423     function $loop() {
 424     @start
 425     @loop
 426             %x =w phi @start 100, @loop %x1
 427             %x1 =w sub %x, 1
 428             jnz %x1, @loop, @end
 429     @end
 430             ret
 431     }
 432
 433 ~ Jumps
 434 ~~~~~~~
 435
 436     `bnf
 437     JUMP :=
 438         'jmp' @IDENT               # Unconditional
 439       | 'jnz' VAL, @IDENT, @IDENT  # Conditional
 440       | 'ret' [VAL]                # Return
 441
 442 A jump instruction ends every block and transfers the
 443 control to another program location.  The target of
 444 a jump must never be the first block in a function.
 445 The three kinds of jumps available are described in
 446 the following list.
 447
 448  1. Unconditional jump.
 449
 450     Simply jumps to another block of the same function.
 451
 452  2. Conditional jump.
 453
 454     When its word argument is non-zero, it jumps to its
 455     first label argument; otherwise it jumps to the other
 456     label.  The argument must be of word type; because of
 457     subtyping a long argument can be passed, but only its
 458     least significant 32 bits will be compared to 0.
 459
 460  3. Function return.
 461
 462     Terminates the execution of the current function,
 463     optionally returning a value to the caller.  The value
 464     returned must be of the type given in the function
 465     prototype.  If the function prototype does not specify
 466     a return type, no return value can be used.
 467
 468 - 6. Instructions
 469 -----------------
 470
 471 Instructions are the smallest piece of code in the IL, they
 472 form the body of <@ Blocks >.  The IL uses a three-address
 473 code, which means that one instruction computes an operation
 474 between two operands and assigns the result to a third one.
 475
 476 An instruction has both a name and a return type, this
 477 return type is a base type that defines the size of the
 478 instruction's result.  The type of the arguments can be
 479 unambiguously inferred using the instruction name and the
 480 return type.  For example, for all arithmetic instructions,
 481 the type of the arguments is the same as the return type.
 482 The two additions below are valid if `%y` is a word or a long
 483 (because of <@ Subtyping >).
 484
 485     %x =w add 0, %y
 486     %z =w add %x, %x
 487
 488 Some instructions, like comparisons and memory loads
 489 have operand types that differ from their return types.
 490 For instance, two floating points can be compared to give a
 491 word result (0 if the comparison succeeds, 1 if it fails).
 492
 493     %c =w cgts %a, %b
 494
 495 In the example above, both operands have to have single type.
 496 This is made explicit by the instruction suffix.
 497
 498 The types of instructions are described below using a short
 499 type string.  A type string specifies all the valid return
 500 types an instruction can have, its arity, and the type of
 501 its arguments depending on its return type.
 502
 503 Type strings begin with acceptable return types, then
 504 follows, in parentheses, the possible types for the arguments.
 505 If the N-th return type of the type string is used for an
 506 instruction, the arguments must use the N-th type listed for
 507 them in the type string.  When an instruction does not have a
 508 return type, the type string only contains the types of the
 509 arguments.
 510
 511 The following abbreviations are used.
 512
 513   * `T` stands for `wlsd`
 514   * `I` stands for `wl`
 515   * `F` stands for `sd`
 516   * `m` stands for the type of pointers on the target; on
 517     64-bit architectures it is the same as `l`
 518
 519 For example, consider the type string `wl(F)`, it mentions
 520 that the instruction has only one argument and that if the
 521 return type used is long, the argument must be of type double.
 522
 523 ~ Arithmetic and Bits
 524 ~~~~~~~~~~~~~~~~~~~~~
 525
 526   * `add`, `sub`, `div`, `mul` -- `T(T,T)`
 527   * `neg` -- `T(T)`
 528   * `udiv`, `rem`, `urem` -- `I(I,I)`
 529   * `or`, `xor`, `and` -- `I(I,I)`
 530   * `sar`, `shr`, `shl` -- `I(I,ww)`
 531
 532 The base arithmetic instructions in the first bullet are
 533 available for all types, integers and floating points.
 534
 535 When `div` is used with word or long return type, the
 536 arguments are treated as signed.  The unsigned integral
 537 division is available as `udiv` instruction.  When the
 538 result of a division is not an integer, it is truncated
 539 towards zero.
 540
 541 The signed and unsigned remainder operations are available
 542 as `rem` and `urem`.  The sign of the remainder is the same
 543 as the one of the dividend.  Its magnitude is smaller than
 544 the divisor one.  These two instructions and `udiv` are only
 545 available with integer arguments and result.
 546
 547 Bitwise OR, AND, and XOR operations are available for both
 548 integer types.  Logical operations of typical programming
 549 languages can be implemented using <@ Comparisons > and
 550 <@ Jumps >.
 551
 552 Shift instructions `sar`, `shr`, and `shl`, shift right or
 553 left their first operand by the amount from the second
 554 operand.  The shifting amount is taken modulo the size of
 555 the result type.  Shifting right can either preserve the
 556 sign of the value (using `sar`), or fill the newly freed
 557 bits with zeroes (using `shr`).  Shifting left always
 558 fills the freed bits with zeroes.
 559
 560 Remark that an arithmetic shift right (`sar`) is only
 561 equivalent to a division by a power of two for non-negative
 562 numbers.  This is because the shift right "truncates"
 563 towards minus infinity, while the division truncates
 564 towards zero.
 565
 566 ~ Memory
 567 ~~~~~~~~
 568
 569   * Store instructions.
 570
 571       * `stored` -- `(d,m)`
 572       * `stores` -- `(s,m)`
 573       * `storel` -- `(l,m)`
 574       * `storew` -- `(w,m)`
 575       * `storeh` -- `(w,m)`
 576       * `storeb` -- `(w,m)`
 577
 578     Store instructions exist to store a value of any base type
 579     and any extended type.  Since halfwords and bytes are not
 580     first class in the IL, `storeh` and `storeb` take a word
 581     as argument.  Only the first 16 or 8 bits of this word will
 582     be stored in memory at the address specified in the second
 583     argument.
 584
 585   * Load instructions.
 586
 587       * `loadd` -- `d(m)`
 588       * `loads` -- `s(m)`
 589       * `loadl` -- `l(m)`
 590       * `loadsw`, `loaduw` -- `I(mm)`
 591       * `loadsh`, `loaduh` -- `I(mm)`
 592       * `loadsb`, `loadub` -- `I(mm)`
 593
 594     For types smaller than long, two variants of the load
 595     instruction are available: one will sign extend the loaded
 596     value, while the other will zero extend it.  Note that
 597     all loads smaller than long can load to either a long or
 598     a word.
 599
 600     The two instructions `loadsw` and `loaduw` have the same
 601     effect when they are used to define a word temporary.
 602     A `loadw` instruction is provided as syntactic sugar for
 603     `loadsw` to make explicit that the extension mechanism
 604     used is irrelevant.
 605
 606   * Stack allocation.
 607
 608       * `alloc4` -- `m(l)`
 609       * `alloc8` -- `m(l)`
 610       * `alloc16` -- `m(l)`
 611
 612     These instructions allocate a chunk of memory on the
 613     stack.  The number ending the instruction name is the
 614     alignment required for the allocated slot.  QBE will
 615     make sure that the returned address is a multiple of
 616     that alignment value.
 617
 618     Stack allocation instructions are used, for example,
 619     when compiling the C local variables, because their
 620     address can be taken.  When compiling Fortran,
 621     temporaries can be used directly instead, because
 622     it is illegal to take the address of a variable.
 623
 624 The following example makes use some of the memory
 625 instructions.  Pointers are stored in long temporaries.
 626
 627     %A0 =l alloc4 8      # stack allocate an array A of 2 words
 628     %A1 =l add %A0, 4
 629     storew 43,  %A0      # A[0] <- 43
 630     storew 255, %A1      # A[1] <- 255
 631     %v1 =w loadw  %A0    # %v1 <- A[0] as word
 632     %v2 =w loadsb %A1    # %v2 <- A[1] as signed byte
 633     %v3 =w add %v1, %v2  # %v3 is 42 here
 634
 635 ~ Comparisons
 636 ~~~~~~~~~~~~~
 637
 638 Comparison instructions return an integer value (either a word
 639 or a long), and compare values of arbitrary types.  The returned
 640 value is 1 if the two operands satisfy the comparison
 641 relation, or 0 otherwise.  The names of comparisons respect
 642 a standard naming scheme in three parts.
 643
 644  1. All comparisons start with the letter `c`.
 645
 646  2. Then comes a comparison type.  The following
 647     types are available for integer comparisons:
 648
 649       * `eq` for equality
 650       * `ne` for inequality
 651       * `sle` for signed lower or equal
 652       * `slt` for signed lower
 653       * `sge` for signed greater or equal
 654       * `sgt` for signed greater
 655       * `ule` for unsigned lower or equal
 656       * `ult` for unsigned lower
 657       * `uge` for unsigned greater or equal
 658       * `ugt` for unsigned greater
 659
 660     Floating point comparisons use one of these types:
 661
 662       * `eq` for equality
 663       * `ne` for inequality
 664       * `le` for lower or equal
 665       * `lt` for lower
 666       * `ge` for greater or equal
 667       * `gt` for greater
 668       * `o` for ordered (no operand is a NaN)
 669       * `uo` for unordered (at least one operand is a NaN)
 670
 671     Because floating point types always have a sign bit,
 672     all the comparisons available are signed.
 673
 674  3. Finally, the instruction name is terminated with a
 675     basic type suffix precising the type of the operands
 676     to be compared.
 677
 678 For example, `cod` (`I(dd,dd)`) compares two double-precision
 679 floating point numbers and returns 1 if the two floating points
 680 are not NaNs, or 0 otherwise.  The `csltw` (`I(ww,ww)`)
 681 instruction compares two words representing signed numbers and
 682 returns 1 when the first argument is smaller than the second one.
 683
 684 ~ Conversions
 685 ~~~~~~~~~~~~~
 686
 687 Conversion operations allow to change the representation of
 688 a value, possibly modifying it if the target type cannot hold
 689 the value of the source type.  Conversions can extend the
 690 precision of a temporary (e.g., from signed 8-bit to 32-bit),
 691 or convert a floating point into an integer and vice versa.
 692
 693   * `extsw`, `extuw` -- `l(w)`
 694   * `extsh`, `extuh` -- `I(ww)`
 695   * `extsb`, `extub` -- `I(ww)`
 696   * `exts` -- `d(s)`
 697   * `truncd` -- `s(d)`
 698   * `stosi` -- `I(ss)`
 699   * `dtosi` -- `I(dd)`
 700   * `swtof` -- `F(ww)`
 701   * `sltof` -- `F(ll)`
 702
 703 Extending the precision of a temporary is done using the
 704 `ext` family of instructions.  Because QBE types do not
 705 precise the signedness (like in LLVM), extension instructions
 706 exist to sign-extend and zero-extend a value.  For example,
 707 `extsb` takes a word argument and sign-extend the 8
 708 least-significant bits to a full word or long, depending on
 709 the return type.
 710
 711 The instructions `exts` and `truncd` are provided to change
 712 the precision of a floating point value.  When the double
 713 argument of `truncd` cannot be represented as a
 714 single-precision floating point, it is truncated towards
 715 zero.
 716
 717 Converting between signed integers and floating points is
 718 done using `stosi` (single to signed integer), `dtosi`
 719 (double to signed integer), `swtof` (signed word to float),
 720 and `sltof` (signed long to float).  These instructions
 721 only handle signed integers, conversion to and from
 722 unsigned types are not yet supported.
 723
 724 Because of <@ Subtyping >, there is no need to have an
 725 instruction to lower the precision of an integer temporary.
 726
 727 ~ Cast and Copy
 728 ~~~~~~~~~~~~~~~
 729
 730 The `cast` and `copy` instructions return the bits of their
 731 argument verbatim.  However a `cast` will change an integer
 732 into a floating point of the same width and vice versa.
 733
 734   * `cast` -- `wlsd(sdwl)`
 735   * `copy` -- `T(T)`
 736
 737 Casts can be used to make bitwise operations on the
 738 representation of floating point numbers.  For example
 739 the following program will compute the opposite of the
 740 single-precision floating point number `%f` into `%rs`.
 741
 742     %b0 =w cast %f
 743     %b1 =w xor 2147483648, %b0  # flip the msb
 744     %rs =s cast %b1
 745
 746 ~ Call
 747 ~~~~~~
 748
 749     `bnf
 750     CALL := [%IDENT '=' ABITY] 'call' VAL '(' (ARG), ')'
 751
 752     ARG :=
 753         ABITY VAL  # Regular argument
 754       | 'env' VAL  # Environment argument (first)
 755       | '...'      # Variadic marker
 756
 757     ABITY := BASETY | :IDENT
 758
 759 The call instruction is special in several ways.  It is not
 760 a three-address instruction and requires the type of all
 761 its arguments to be given.  Also, the return type can be
 762 either a base type or an aggregate type.  These specifics
 763 are required to compile calls with C compatibility (i.e.,
 764 to respect the ABI).
 765
 766 When an aggregate type is used as argument type or return
 767 type, the value respectively passed or returned needs to be
 768 a pointer to a memory location holding the value.  This is
 769 because aggregate types are not first-class citizens of
 770 the IL.
 771
 772 Unless the called function does not return a value, a
 773 return temporary must be specified, even if it is never
 774 used afterwards.
 775
 776 An environment parameter can be passed as first argument
 777 using the `env` keyword.  The passed value must be a 64-bit
 778 integer.  If the called function does not expect an environment
 779 parameter, it will be safely discarded.  See the <@ Functions >
 780 section for more information about environment parameters.
 781
 782 When the called function is variadic, there must be a `...`
 783 marker separating the named and variadic arguments.
 784
 785 ~ Variadic
 786 ~~~~~~~~~~
 787
 788 The `vastart` and `vaarg` instructions provide a portable
 789 way to access the extra parameters of a variadic function.
 790
 791   * `vastart` -- `(m)`
 792   * `vaarg` -- `T(mmmm)`
 793
 794 The `vastart` instruction initializes a *variable argument
 795 list* used to access the extra parameters of the enclosing
 796 variadic function.  It is safe to call it multiple times.
 797
 798 The `vaarg` instruction fetches the next argument from
 799 a variable argument list.  It is currently limited to
 800 fetching arguments that have a base type.  This instruction
 801 is essentially effectful: calling it twice in a row will
 802 return two consecutive arguments from the argument list.
 803
 804 Both instructions take a pointer to a variable argument
 805 list as sole argument.  The size and alignment of variable
 806 argument lists depend on the target used.  However, it
 807 is possible to conservatively use the maximum size and
 808 alignment required by all the targets.
 809
 810     type :valist = align 8 { 24 }  # For amd64_sysv
 811     type :valist = align 8 { 32 }  # For arm64
 812
 813 The following example defines a variadic function adding
 814 its first three arguments.
 815
 816     function s $add3(s %a, ...) {
 817     @start
 818             %ap =l alloc8 32
 819             vastart %ap
 820             %r =s call $vadd(s %a, l %ap)
 821             ret %r
 822     }
 823
 824     function s $vadd(s %a, l %ap) {
 825     @start
 826             %b =s vaarg %ap
 827             %c =s vaarg %ap
 828             %d =s add %a, %b
 829             %e =s add %d, %c
 830             ret %e
 831     }
 832
 833 ~ Phi
 834 ~~~~~
 835
 836     `bnf
 837     PHI := %IDENT '=' BASETY 'phi' ( @IDENT VAL ),
 838
 839 First and foremost, phi instructions are NOT necessary when
 840 writing a frontend to QBE.  One solution to avoid having to
 841 deal with SSA form is to use stack allocated variables for
 842 all source program variables and perform assignments and
 843 lookups using <@ Memory > operations.  This is what LLVM
 844 users typically do.
 845
 846 Another solution is to simply emit code that is not in SSA
 847 form!  Contrary to LLVM, QBE is able to fixup programs not
 848 in SSA form without requiring the boilerplate of loading
 849 and storing in memory.  For example, the following program
 850 will be correctly compiled by QBE.
 851
 852     @start
 853             %x =w copy 100
 854             %s =w copy 0
 855     @loop
 856             %s =w add %s, %x
 857             %x =w sub %x, 1
 858             jnz %x, @loop, @end
 859     @end
 860             ret %s
 861
 862 Now, if you want to know what phi instructions are and how
 863 to use them in QBE, you can read the following.
 864
 865 Phi instructions are specific to SSA form.  In SSA form
 866 values can only be assigned once, without phi instructions,
 867 this requirement is too strong to represent many programs.
 868 For example consider the following C program.
 869
 870     int f(int x) {
 871             int y;
 872             if (x)
 873                     y = 1;
 874             else
 875                     y = 2;
 876             return y;
 877     }
 878
 879 The variable `y` is assigned twice, the solution to
 880 translate it in SSA form is to insert a phi instruction.
 881
 882     @ifstmt
 883             jnz %x, @ift, @iff
 884     @ift
 885             jmp @retstmt
 886     @iff
 887             jmp @retstmt
 888     @retstmt
 889             %y =w phi @ift 1, @iff 2
 890             ret %y
 891
 892 Phi instructions return one of their arguments depending
 893 on where the control came from.  In the example, `%y` is
 894 set to 1 if the `@ift` branch is taken, or it is set to
 895 2 otherwise.
 896
 897 An important remark about phi instructions is that QBE
 898 assumes that if a variable is defined by a phi it respects
 899 all the SSA invariants.  So it is critical to not use phi
 900 instructions unless you know exactly what you are doing.
 901
 902 - 7. Instructions Index
 903 -----------------------
 904
 905   * <@ Arithmetic and Bits >:
 906
 907       * `add`
 908       * `and`
 909       * `div`
 910       * `mul`
 911       * `or`
 912       * `rem`
 913       * `sar`
 914       * `shl`
 915       * `shr`
 916       * `sub`
 917       * `udiv`
 918       * `urem`
 919       * `xor`
 920
 921   * <@ Memory >:
 922
 923       * `alloc16`
 924       * `alloc4`
 925       * `alloc8`
 926       * `loadd`
 927       * `loadl`
 928       * `loads`
 929       * `loadsb`
 930       * `loadsh`
 931       * `loadsw`
 932       * `loadub`
 933       * `loaduh`
 934       * `loaduw`
 935       * `loadw`
 936       * `storeb`
 937       * `stored`
 938       * `storeh`
 939       * `storel`
 940       * `stores`
 941       * `storew`
 942
 943   * <@ Comparisons >:
 944
 945       * `ceqd`
 946       * `ceql`
 947       * `ceqs`
 948       * `ceqw`
 949       * `cged`
 950       * `cges`
 951       * `cgtd`
 952       * `cgts`
 953       * `cled`
 954       * `cles`
 955       * `cltd`
 956       * `clts`
 957       * `cned`
 958       * `cnel`
 959       * `cnes`
 960       * `cnew`
 961       * `cod`
 962       * `cos`
 963       * `csgel`
 964       * `csgew`
 965       * `csgtl`
 966       * `csgtw`
 967       * `cslel`
 968       * `cslew`
 969       * `csltl`
 970       * `csltw`
 971       * `cugel`
 972       * `cugew`
 973       * `cugtl`
 974       * `cugtw`
 975       * `culel`
 976       * `culew`
 977       * `cultl`
 978       * `cultw`
 979       * `cuod`
 980       * `cuos`
 981
 982   * <@ Conversions >:
 983
 984       * `dtosi`
 985       * `exts`
 986       * `extsb`
 987       * `extsh`
 988       * `extsw`
 989       * `extub`
 990       * `extuh`
 991       * `extuw`
 992       * `sltof`
 993       * `stosi`
 994       * `swtof`
 995       * `truncd`
 996
 997   * <@ Cast and Copy > :
 998
 999       * `cast`
1000       * `copy`
1001
1002   * <@ Call >:
1003
1004       * `call`
1005
1006   * <@ Variadic >:
1007
1008       * `vastart`
1009       * `vaarg`
1010
1011   * <@ Phi >:
1012
1013       * `phi`
1014
1015   * <@ Jumps >:
1016
1017       * `jmp`
1018       * `jnz`
1019       * `ret`