mu.md

   1 # Mu Reference
   2
   3 Mu programs are sequences of `fn` and `type` definitions.
   4
   5 ## Functions
   6
   7 Define functions with the `fn` keyword. For example:
   8
   9 ```
  10   fn foo arg1: int, arg2: int -> result/eax: boolean
  11 ```
  12
  13 Functions contain `{}` blocks, `var` declarations, primitive statements and
  14 calls to other functions. Only `{}` blocks can nest. Primitive statements and
  15 function calls look similar:
  16
  17 ```
  18   out1, out2, out3, ... <- operation inout1, inout2, inout3, ...
  19 ```
  20
  21 They can take any number of inouts and outputs, including 0. Statements
  22 with 0 outputs also drop the `<-`.
  23
  24 Inouts can be either variables in memory, variables in registers, or
  25 constants. Outputs are always variables in registers.
  26
  27 Inouts in memory can be either inputs or outputs (if they're addresses being
  28 written to). Hence the name.
  29
  30 Primitives usually require their inouts to be in specific combinations of
  31 memory and register. User-defined functions are flexible.
  32
  33 Primitives can often write to arbitrary output registers. User-defined
  34 functions, however, require rigidly specified output registers.
  35
  36 ## Variables, registers and memory
  37
  38 Declare local variables in a function using the `var` keyword.
  39
  40 You can declare local variables in either registers or memory (the stack). So
  41 a `var` statement has two forms:
  42   - Living in a register, e.g. `var x/eax: int <- copy 0` defines `x` which
  43     lives in `eax`.
  44   - Living in memory, e.g. `var x: int` defines `x` on the stack.
  45
  46 Variables in registers must be initialized. Variables on the stack are
  47 implicitly zeroed out.
  48
  49 Variables exist only within the `{}` block they're defined in. Space allocated
  50 to them on the stack is reclaimed after execution leaves the block. Registers
  51 restore whatever variable was using them in the outer block.
  52
  53 It is perfectly ok to reuse a register for a new variable. Even in a single
  54 block (though you permanently lose the old variable then).
  55
  56 Variables can be in six 32-bit _general-purpose_ registers of the x86 processor.
  57   - eax
  58   - ebx
  59   - ecx
  60   - edx
  61   - esi ('s' often a mnemonic for 'source')
  62   - edi ('d' often a mnemonic for 'destination')
  63
  64 Most functions return results in `eax` by convention. In practice, it ends up
  65 churning through variables pretty quickly.
  66
  67 You can store several types in these registers:
  68   - int
  69   - boolean
  70   - (addr T) (address into memory)
  71   - byte (uses only 8 bits)
  72   - code-point (Unicode)
  73   - grapheme (code-point encoded in UTF-8)
  74
  75 There's one 32-bit type you _cannot_ store in these registers:
  76   - float
  77
  78 It instead uses eight separate 32-bit registers: xmm0, xmm1, ..., xmm7
  79
  80 Types that require more than 32 bits (4 bytes) cannot be stored in registers:
  81   - (array T)
  82   - (handle T)
  83   - (stream T)
  84   - slice
  85   - any compound types you define using the `type` keyword
  86
  87 `T` here can be any type, including combinations of types. For example:
  88   - (array int) -- an array of ints
  89   - (addr int) -- an address to an int
  90   - (addr array byte) -- an address to an array of bytes, useful for Unicode
  91     text strings
  92   - (handle int) -- a handle to an int
  93   - (addr handle int) -- an address to a handle to int
  94   - (addr array handle int) -- an address to an array of handles to ints
  95   - ...and so on.
  96
  97 Other miscellaneous restrictions:
  98   - `byte` variables must be either in registers or on the heap, never local
  99     variables on the stack.
 100   - `addr` variables can never "escape" a function either by being returned or
 101     by being written to a memory location. When you need that sort of thing,
 102     use a `handle` instead.
 103
 104 ## Operations on simple types
 105
 106 We'll now survey a long list of statement forms that operate on 32-bit types.
 107 Most of these are primitives, but some are also implemented as functions
 108 (which have slightly different rules as mentioned up top). Most instructions
 109 with multiple args require types to match. Various operations have other
 110 restrictions which we'll note below, using the following notation:
 111   - `var/reg` indicates a variable in some register. Where we require a
 112     variable in a specific register, we'll mention it explicitly. E.g.
 113     `var/eax`.
 114   - `var/xreg` indicates a variable in some floating-point register `xmm_`.
 115   - `var` without a `reg` indicates either a variable on the stack, or
 116     dereferencing a variable in a (non-floating-point) register: `*var/reg`.
 117   - `var: type` indicates a variable that must satisfy some type constraint.
 118   - `n` indicates a literal integer. There are no floating-point literals.
 119
 120 ### Moving values around
 121
 122 These instructions work with variables of any 32-bit type except `byte` and
 123 `float`.
 124
 125 ```
 126   var/reg <- copy var2/reg2
 127   copy-to var1, var2/reg
 128   var/reg <- copy var2
 129   var/reg <- copy n
 130   copy-to var, n
 131 ```
 132
 133 Byte variables have their own instructions:
 134
 135 ```
 136   var/reg <- copy-byte var2/reg2
 137   var/reg <- copy-byte *var2/reg2     # var2 must have type (addr byte)
 138   copy-byte-to *var1/reg1, var2/reg2  # var1 must have type (addr byte)
 139 ```
 140
 141 Floating point variables can be copied as well, but only to or from
 142 floating-point registers `xmm_`.
 143
 144 ```
 145   var/xreg <- copy var2/xreg2
 146   copy-to var1, var2/xreg
 147   var/xreg <- copy var2
 148   var/xreg <- copy *var2/reg2         # var2 must have type (addr byte) and live in a general-purpose register
 149 ```
 150
 151 There's no way to copy a literal to a floating-point register. However,
 152 there's a few ways to convert non-float values in general-purpose registers.
 153
 154 ```
 155   var/xreg <- convert var2/reg2
 156   var/xreg <- convert var2
 157   var/xreg <- convert *var2/reg2
 158 ```
 159
 160 Correspondingly, there are ways to convert floats into integers, with and
 161 without rounding.
 162
 163 ```
 164   var/reg <- convert var2/xreg2
 165   var/reg <- convert var2
 166   var/reg <- convert *var2/reg2
 167
 168   var/reg <- truncate var2/xreg2
 169   var/reg <- truncate var2
 170   var/reg <- truncate *var2/reg2
 171 ```
 172
 173 Still, the absence of fractional literals is an annoyance. Mu provides some
 174 helpers to mitigate it somewhat:
 175
 176 ```
 177   result/xmm0 <- rational numerator: int, denominator: int
 178   fill-in-rational out: (addr float), numerator: int, denominator: int
 179 ```
 180
 181 These are functions, so the inouts have fewer restrictions while the outputs
 182 have more. The inouts can be registers, or memory, or even literals. The
 183 output for `rational` _must_ be in register `xmm0`.
 184
 185 ### Comparing values
 186
 187 Work with variables of any 32-bit type. `addr` variables can only be compared
 188 to 0.
 189
 190 ```
 191   compare var1, var2/reg
 192   compare var1/reg, var2
 193   compare var/eax, n
 194   compare var, n
 195 ```
 196
 197 Floating-point numbers cannot be compared to literals, and the register must
 198 come first.
 199
 200 ```
 201   compare var1/xreg1, var2/xreg2
 202   compare var1/xreg1, var2
 203 ```
 204
 205 ### Branches
 206
 207 Immediately after a `compare` instruction you can branch on its result. For
 208 example:
 209
 210 ```
 211   break-if-=
 212 ```
 213
 214 This instruction will jump to after the enclosing `{}` block if the previous
 215 `compare` detected equality. Here's the list of conditional and unconditional
 216 `break` instructions:
 217
 218 ```
 219   break
 220   break-if-=
 221   break-if-!=
 222   break-if-<
 223   break-if->
 224   break-if-<=
 225   break-if->=
 226 ```
 227
 228 Similarly, you can jump back to the start of the enclosing `{}` block with
 229 `loop`. Here's the list of `loop` instructions.
 230
 231 ```
 232   loop
 233   loop-if-=
 234   loop-if-!=
 235   loop-if-<
 236   loop-if->
 237   loop-if-<=
 238   loop-if->=
 239 ```
 240
 241 Additionally, there are special variants for comparing `addr` and `float`
 242 values, which results in the following comprehensive list of jumps:
 243
 244 ```
 245   break
 246   break-if-=
 247   break-if-!=
 248   break-if-<    break-if-addr<    break-if-float<
 249   break-if->    break-if-addr>    break-if-float>
 250   break-if-<=   break-if-addr<=   break-if-float<=
 251   break-if->=   break-if-addr>=   break-if-float>=
 252
 253   loop
 254   loop-if-=
 255   loop-if-!=
 256   loop-if-<     loop-if-addr<     loop-if-float<
 257   loop-if->     loop-if-addr>     loop-if-float>
 258   loop-if-<=    loop-if-addr<=    loop-if-float<=
 259   loop-if->=    loop-if-addr>=    loop-if-float>=
 260 ```
 261
 262 One final property all these jump instructions share: they can take an
 263 optional block name to jump to. For example:
 264
 265 ```
 266   a: {
 267     ...
 268     break a     #----------|
 269     ...               #    |
 270   }                   # <--|
 271
 272
 273   a: {                # <--|
 274     ...               #    |
 275     b: {              #    |
 276       ...             #    |
 277       loop a    #----------|
 278       ...
 279     }
 280     ...
 281   }
 282 ```
 283
 284 However, there's no way to jump to a block that doesn't contain the `loop` or
 285 `break` instruction.
 286
 287 ### Integer arithmetic
 288
 289 These instructions require variables of non-`addr`, non-`float` types.
 290
 291 Add:
 292 ```
 293   var1/reg1 <- add var2/reg2
 294   var/reg <- add var2
 295   add-to var1, var2/reg                 # var1 += var2
 296   var/reg <- add n
 297   add-to var, n
 298 ```
 299
 300 Subtract:
 301 ```
 302   var1/reg1 <- subtract var2/reg2
 303   var/reg <- subtract var2
 304   subtract-from var1, var2/reg          # var1 -= var2
 305   var/reg <- subtract n
 306   subtract-from var, n
 307 ```
 308
 309 Add one:
 310 ```
 311   var/reg <- increment
 312   increment var
 313 ```
 314
 315 Subtract one:
 316 ```
 317   var/reg <- decrement
 318   decrement var
 319 ```
 320
 321 Multiply:
 322 ```
 323   var/reg <- multiply var2
 324 ```
 325
 326 The result of a multiply must be a register.
 327
 328 Negate:
 329 ```
 330   var1/reg1 <- negate
 331   negate var
 332 ```
 333
 334 ### Fractional arithmetic
 335
 336 Operations on `float` variables include a few we've seen before and some new
 337 ones. Notice here that we mostly use floating-point registers `xmm_`, but
 338 still use the general-purpose registers when dereferencing variables of type
 339 `(addr float)`.
 340
 341 ```
 342   var/xreg <- add var2/xreg2
 343   var/xreg <- add var2
 344   var/xreg <- add *var2/reg2
 345
 346   var/xreg <- subtract var2/xreg2
 347   var/xreg <- subtract var2
 348   var/xreg <- subtract *var2/reg2
 349
 350   var/xreg <- multiply var2/xreg2
 351   var/xreg <- multiply var2
 352   var/xreg <- multiply *var2/reg2
 353
 354   var/xreg <- divide var2/xreg2
 355   var/xreg <- divide var2
 356   var/xreg <- divide *var2/reg2
 357
 358   var/xreg <- reciprocal var2/xreg2
 359   var/xreg <- reciprocal var2
 360   var/xreg <- reciprocal *var2/reg2
 361
 362   var/xreg <- square-root var2/xreg2
 363   var/xreg <- square-root var2
 364   var/xreg <- square-root *var2/reg2
 365
 366   var/xreg <- inverse-square-root var2/xreg2
 367   var/xreg <- inverse-square-root var2
 368   var/xreg <- inverse-square-root *var2/reg2
 369
 370   var/xreg <- min var2/xreg2
 371   var/xreg <- min var2
 372   var/xreg <- min *var2/reg2
 373
 374   var/xreg <- max var2/xreg2
 375   var/xreg <- max var2
 376   var/xreg <- max *var2/reg2
 377 ```
 378
 379 Two instructions in the above list are approximate. According to the Intel
 380 manual, `reciprocal` and `inverse-square-root` [go off the rails around the
 381 fourth decimal place](linux/x86_approx.md). If you need more precision, use
 382 `divide` separately.
 383
 384 ### Bitwise boolean operations
 385
 386 These require variables of non-`addr`, non-`float` types.
 387
 388 And:
 389 ```
 390   var1/reg1 <- and var2/reg2
 391   var/reg <- and var2
 392   and-with var1, var2/reg
 393   var/reg <- and n
 394   and-with var, n
 395 ```
 396
 397 Or:
 398 ```
 399   var1/reg1 <- or var2/reg2
 400   var/reg <- or var2
 401   or-with var1, var2/reg
 402   var/reg <- or n
 403   or-with var, n
 404 ```
 405
 406 Not:
 407 ```
 408   var1/reg1 <- not
 409   not var
 410 ```
 411
 412 Xor:
 413 ```
 414   var1/reg1 <- xor var2/reg2
 415   var/reg <- xor var2
 416   xor-with var1, var2/reg
 417   var/reg <- xor n
 418   xor-with var, n
 419 ```
 420
 421 ### Bitwise shifts
 422
 423 Shifts require variables of non-`addr`, non-`float` types.
 424
 425 ```
 426   var/reg <- shift-left n
 427   var/reg <- shift-right n
 428   var/reg <- shift-right-signed n
 429   shift-left var, n
 430   shift-right var, n
 431   shift-right-signed var, n
 432 ```
 433
 434 Shifting bits left always inserts zeros on the right.
 435 Shifting bits right inserts zeros on the left by default.
 436 A _signed_ shift right duplicates the leftmost bit, thereby preserving the
 437 sign of an integer.
 438
 439 ## Operations on more complex types
 440
 441 These instructions work with any type `T`. As before we use `/reg` here to
 442 indicate when a variable must live in a register. We also include type
 443 constraints after a `:`.
 444
 445 ### Addresses and handles
 446
 447 You can compute the address of any variable in memory (never in registers):
 448
 449 ```
 450   var/reg: (addr T) <- address var2: T
 451 ```
 452
 453 As mentioned up top, `addr` variables can never escape the function where
 454 they're computed. You can't store them on the heap, or in compound types.
 455 Think of them as short-lived things.
 456
 457 To manage long-lived addresses, _allocate_ them on the heap.
 458
 459 ```
 460   allocate var: (addr handle T)       # var can be in either register or memory
 461 ```
 462
 463 Handles can be copied and stored without restriction. However, they're too
 464 large to fit in a register. You also can't access their payload directly, you
 465 have to first convert them into a short-lived `addr` using _lookup_.
 466
 467 ```
 468   var y/eax: (addr T) <- lookup x: (handle T)
 469 ```
 470
 471 Since handles are large compound types, there's a special helper for comparing
 472 them:
 473
 474 ```
 475   var/eax: boolean <- handle-equal? var1: (handle T), var2: (handle T)
 476 ```
 477
 478 ### Arrays
 479
 480 Arrays are declared in two ways:
 481   1. On the stack with a literal size:
 482 ```
 483   var x: (array int 3)
 484 ```
 485   2. On the heap with a potentially variable size. For example:
 486 ```
 487   var x: (handle array int)
 488   var x-ah/eax: (addr handle array int) <- address x
 489   populate x-ah, 8
 490 ```
 491
 492 The `8` here can also be an int in a register or memory. (The `-ah` is a
 493 common variable naming convention and stands for "address of a handle".
 494 Essential for allocating long-lived data on the heap.)
 495
 496 You can compute the length of an array, though you'll need an `addr` to do so:
 497
 498 ```
 499   var/reg: int <- length arr/reg: (addr array T)
 500 ```
 501
 502 To read from or write to an array, use `index` to first obtain an address to
 503 read from or modify:
 504
 505 ```
 506   var/reg: (addr T) <- index arr/reg: (addr array T), n
 507   var/reg: (addr T) <- index arr: (array T len), n
 508 ```
 509
 510 Like our notation of `n`, `len` here is required to be a literal.
 511
 512 The index requested can also be a variable in a register, with one caveat:
 513
 514 ```
 515   var/reg: (addr T) <- index arr/reg: (addr array T), idx/reg: int
 516   var/reg: (addr T) <- index arr: (array T len), idx/reg: int
 517 ```
 518
 519 The caveat: the size of T must be 1, 2, 4 or 8 bytes. For other sizes of T
 520 you'll need to split up the work, performing a `compute-offset` before the
 521 `index`.
 522
 523 ```
 524   var/reg: (offset T) <- compute-offset arr: (addr array T), idx/reg: int     # arr can be in reg or mem
 525   var/reg: (offset T) <- compute-offset arr: (addr array T), idx: int         # arr can be in reg or mem
 526 ```
 527
 528 The result of a `compute-offset` statement can be passed to `index`:
 529
 530 ```
 531   var/reg: (addr T) <- index arr/reg: (addr array T), idx/reg: (offset T)
 532 ```
 533
 534 ### Stream operations
 535
 536 A common use for arrays is as buffers. Save a few items to a scratch space and
 537 then process them. This pattern is so common (we use it in files) that there's
 538 special support for it with a built-in type: `stream`.
 539
 540 Streams are like arrays in many ways. You can initialize them with a length on
 541 the stack:
 542
 543 ```
 544   var x: (stream int 3)
 545 ```
 546
 547 You can also populate them on the heap:
 548 ```
 549   var x: (handle stream int)
 550   var x-ah/eax: (addr handle stream int) <- address x
 551   populate-stream x-ah, 8
 552 ```
 553
 554 However, streams don't provide random access with an `index` instruction.
 555 Instead, you write to them sequentially, and read back what you wrote.
 556
 557 ```
 558   read-from-stream s: (addr stream T), out: (addr T)
 559   write-to-stream s: (addr stream T), in: (addr T)
 560 ```
 561
 562 Streams of bytes are particularly common for managing Unicode text, and there
 563 are a few functions to help with them:
 564
 565 ```
 566   write s: (addr stream byte), u: (addr array byte)  # write u to s, abort if full
 567   overflow?/eax: boolean <- try-write s: (addr stream byte), u: (addr array byte)
 568   write-stream dest: (addr stream byte), src: (addr stream byte)
 569   # bytes
 570   append-byte s: (addr stream byte), var: int  # write lower byte of var
 571   var/eax: byte <- read-byte s: (addr stream byte)
 572   # 32-bit graphemes encoded in UTF-8
 573   write-grapheme out: (addr stream byte), g: grapheme
 574   g/eax: grapheme <- read-grapheme in: (addr stream byte)
 575 ```
 576
 577 You can check if a stream is empty or full:
 578
 579 ```
 580   var/eax: boolean <- stream-empty? s: (addr stream)
 581   var/eax: boolean <- stream-full? s: (addr stream)
 582 ```
 583
 584 You can clear streams:
 585
 586 ```
 587   clear-stream f: (addr stream T)
 588 ```
 589
 590 You can also rewind them to reread their contents:
 591
 592 ```
 593   rewind-stream f: (addr stream T)
 594 ```
 595
 596 ## Compound types
 597
 598 Primitive types can be combined together using the `type` keyword. For
 599 example:
 600
 601 ```
 602 type point {
 603   x: int
 604   y: int
 605 }
 606 ```
 607
 608 Mu programs are sequences of just `fn` and `type` definitions.
 609
 610 Compound types can't include `addr` types for safety reasons (use `handle` instead,
 611 which is described below). They also can't currently include `array`, `stream`
 612 or `byte` types. Since arrays and streams carry their size with them, supporting
 613 them in compound types complicates variable initialization. Instead of
 614 defining them inline in a type definition, define a `handle` to them. Bytes
 615 shouldn't be used for anything but UTF-8 strings.
 616
 617 To access within a compound type, use the `get` instruction. There are two
 618 forms. You need either a variable of the type itself (say `T`) in memory, or a
 619 variable of type `(addr T)` in a register.
 620
 621 ```
 622 var/reg: (addr T_f) <- get var/reg: (addr T), f
 623 var/reg: (addr T_f) <- get var: T, f
 624 ```
 625
 626 The `f` here is the field name from the `type` definition, and its type `T_f`
 627 must match the type of `f` in the `type` definition. For example, some legal
 628 instructions for the definition of `point` above:
 629
 630 ```
 631 var a/eax: (addr int) <- get p, x
 632 var a/eax: (addr int) <- get p, y
 633 ```
 634
 635 You can clear compound types using the `clear-object` function:
 636
 637 ```
 638 clear-object var: (addr T)
 639 ```
 640
 641 You can shallow-copy compound types using the `copy-object` function:
 642
 643 ```
 644 copy-object src: (addr T), dest: (addr T)
 645 ```