mu.md

   1 # Mu Reference
   2
   3 Mu programs are sequences of `fn` and `type` definitions.
   4
   5 ## Functions
   6
   7 Define functions with the `fn` keyword. For example:
   8
   9 ```
  10   fn foo arg1: int, arg2: int -> _/eax: boolean
  11 ```
  12
  13 Functions contain `{}` blocks, `var` declarations, primitive statements and
  14 calls to other functions. Only `{}` blocks can nest. Primitive statements and
  15 function calls look similar:
  16
  17 ```
  18   out1, out2, out3, ... <- operation inout1, inout2, inout3, ...
  19 ```
  20
  21 They can take any number of inouts and outputs, including 0. Statements
  22 with 0 outputs also drop the `<-`.
  23
  24 Inouts can be either variables in memory, variables in registers, or
  25 literal constants (either integers or string literals that get replaced with
  26 constant addresses). Outputs are always variables in registers.
  27
  28 Inouts in memory can be either inputs or outputs (if they're addresses being
  29 written to). Hence the name.
  30
  31 Primitives usually require their inouts to be in specific combinations of
  32 memory and register. They can't handle more than one inout in memory.
  33 User-defined functions are flexible.
  34
  35 Primitives can often write to arbitrary output registers. User-defined
  36 functions, however, require rigidly specified output registers. Notice how the
  37 definition of `foo` above writes to `eax`.
  38
  39 A function's header names its inouts, but not its outputs (hence the `_`
  40 above).
  41
  42 All Mu programs must define at least one function: `main`. That's where they
  43 begin executing instructions.
  44
  45 ## Variables, registers and memory
  46
  47 Declare local variables in a function using the `var` keyword. As shown above,
  48 all variable declarations should specify a type after a `:`.
  49
  50 You can declare local variables in either registers or memory (the _stack_
  51 segment). So a `var` statement has two forms:
  52   - Living in a register, e.g. `var x/eax: int <- copy 0` defines `x` which
  53     lives in `eax`.
  54   - Living in memory, e.g. `var x: int` defines `x` on the stack.
  55
  56 Variables in registers must be initialized. Variables on the stack are
  57 implicitly zeroed out.
  58
  59 Variables exist only within the `{}` block they're defined in. Space allocated
  60 to them on the stack is reclaimed after execution leaves the block. Registers
  61 restore whatever variable was using them in the outer block.
  62
  63 It is perfectly ok to reuse a register for a new variable. Even in a single
  64 block (though you permanently lose the old variable then).
  65
  66 Variables can be in six 32-bit _general-purpose_ registers of the x86 processor.
  67   - eax
  68   - ebx
  69   - ecx
  70   - edx
  71   - esi ('s' often a mnemonic for 'source')
  72   - edi ('d' often a mnemonic for 'destination')
  73
  74 Most functions return results in `eax` by convention. In practice, it ends up
  75 churning through variables pretty quickly.
  76
  77 You can store several types in these registers:
  78   - int
  79   - boolean
  80   - (addr T) (address into memory)
  81   - byte (uses only 8 bits)
  82   - code-point (Unicode)
  83   - code-point-utf8 (code-point encoded in UTF-8)
  84
  85 There's one 32-bit type you _cannot_ store in these registers:
  86   - float
  87
  88 It instead uses eight separate 32-bit registers: xmm0, xmm1, ..., xmm7
  89
  90 Types that require more than 32 bits (4 bytes) cannot be stored in registers:
  91   - (array T)
  92   - (handle T)
  93   - (stream T)
  94   - slice
  95   - any compound types you define using the `type` keyword
  96
  97 `T` here can be any type, including combinations of types. For example:
  98   - (array int) -- an array of ints
  99   - (addr int) -- an address to an int
 100   - (addr array byte) -- an address to an array of bytes, useful for Unicode
 101     text strings
 102   - (handle int) -- a handle to an int
 103   - (addr handle int) -- an address to a handle to int
 104   - (addr array handle int) -- an address to an array of handles to ints
 105   - ...and so on.
 106
 107 Other miscellaneous restrictions:
 108   - `byte` variables must be either in registers or on the heap, never local
 109     variables on the stack.
 110   - `addr` variables can never "escape" a function either by being returned or
 111     by being written to a memory location. When you need that sort of thing,
 112     use a `handle` instead.
 113
 114 ## Operations on simple types
 115
 116 We'll now survey a long list of statement forms that operate on 32-bit types.
 117 Most of these are primitives, but some are also implemented as functions
 118 (which have slightly different rules as mentioned up top). Most instructions
 119 with multiple args require types to match. Various operations have other
 120 restrictions which we'll note below, using the following notation:
 121   - `var/reg` indicates a variable in some register. Where we require a
 122     variable in a specific register, we'll mention it explicitly. E.g.
 123     `var/eax`.
 124   - `var/xreg` indicates a variable in some floating-point register `xmm_`.
 125   - `var` without a `reg` indicates either a variable on the stack, or
 126     dereferencing a variable in a (non-floating-point) register: `*var/reg`.
 127   - `var: type` indicates a variable that must satisfy some type constraint.
 128   - `n` indicates a literal integer. There are no floating-point literals.
 129
 130 ### Moving values around
 131
 132 These instructions work with variables of any 32-bit type except `byte` and
 133 `float`.
 134
 135 ```
 136   var/reg <- copy var2/reg2
 137   copy-to var1, var2/reg
 138   var/reg <- copy var2
 139   var/reg <- copy n
 140   copy-to var, n
 141 ```
 142
 143 Byte variables have their own instructions:
 144
 145 ```
 146   var/reg <- copy-byte var2/reg2
 147   var/reg <- copy-byte *var2/reg2     # var2 must have type (addr byte)
 148   copy-byte-to *var1/reg1, var2/reg2  # var1 must have type (addr byte)
 149 ```
 150
 151 Floating point variables can be copied as well, but only to or from
 152 floating-point registers `xmm_`.
 153
 154 ```
 155   var/xreg <- copy var2/xreg2
 156   copy-to var1, var2/xreg
 157   var/xreg <- copy var2
 158   var/xreg <- copy *var2/reg2         # var2 must have type (addr byte) and live in a general-purpose register
 159 ```
 160
 161 There's no way to copy a literal to a floating-point register. However,
 162 there's a few ways to convert non-float values in general-purpose registers.
 163
 164 ```
 165   var/xreg <- convert var2/reg2
 166   var/xreg <- convert var2
 167   var/xreg <- convert *var2/reg2
 168 ```
 169
 170 Correspondingly, there are ways to convert floats into integers, with and
 171 without rounding.
 172
 173 ```
 174   var/reg <- convert var2/xreg2
 175   var/reg <- convert var2
 176   var/reg <- convert *var2/reg2
 177
 178   var/reg <- truncate var2/xreg2
 179   var/reg <- truncate var2
 180   var/reg <- truncate *var2/reg2
 181 ```
 182
 183 Still, the absence of fractional literals is an annoyance. Mu provides some
 184 helpers to mitigate it somewhat:
 185
 186 ```
 187   result/xmm0 <- rational numerator: int, denominator: int
 188   fill-in-rational out: (addr float), numerator: int, denominator: int
 189 ```
 190
 191 These are functions, so the inouts have fewer restrictions while the outputs
 192 have more. The inouts can be registers, or memory, or even literals. The
 193 output for `rational` _must_ be in register `xmm0`.
 194
 195 ### Comparing values
 196
 197 Work with variables of any 32-bit type. `addr` variables can only be compared
 198 to 0.
 199
 200 ```
 201   compare var1, var2/reg
 202   compare var1/reg, var2
 203   compare var/eax, n
 204   compare var, n
 205 ```
 206
 207 Floating-point numbers cannot be compared to literals, and the register must
 208 come first.
 209
 210 ```
 211   compare var1/xreg1, var2/xreg2
 212   compare var1/xreg1, var2
 213 ```
 214
 215 ### Branches
 216
 217 Immediately after a `compare` instruction you can branch on its result. For
 218 example:
 219
 220 ```
 221   break-if-=
 222 ```
 223
 224 This instruction will jump to after the enclosing `{}` block if the previous
 225 `compare` detected equality. Here's the list of conditional and unconditional
 226 `break` instructions:
 227
 228 ```
 229   break
 230   break-if-=
 231   break-if-!=
 232   break-if-<
 233   break-if->
 234   break-if-<=
 235   break-if->=
 236 ```
 237
 238 Similarly, you can jump back to the start of the enclosing `{}` block with
 239 `loop`. Here's the list of `loop` instructions.
 240
 241 ```
 242   loop
 243   loop-if-=
 244   loop-if-!=
 245   loop-if-<
 246   loop-if->
 247   loop-if-<=
 248   loop-if->=
 249 ```
 250
 251 Additionally, there are special variants for comparing `addr` and `float`
 252 values, which results in the following comprehensive list of jumps:
 253
 254 ```
 255   break
 256   break-if-=
 257   break-if-!=
 258   break-if-<    break-if-addr<    break-if-float<
 259   break-if->    break-if-addr>    break-if-float>
 260   break-if-<=   break-if-addr<=   break-if-float<=
 261   break-if->=   break-if-addr>=   break-if-float>=
 262
 263   loop
 264   loop-if-=
 265   loop-if-!=
 266   loop-if-<     loop-if-addr<     loop-if-float<
 267   loop-if->     loop-if-addr>     loop-if-float>
 268   loop-if-<=    loop-if-addr<=    loop-if-float<=
 269   loop-if->=    loop-if-addr>=    loop-if-float>=
 270 ```
 271
 272 One final property all these jump instructions share: they can take an
 273 optional block name to jump to. For example:
 274
 275 ```
 276   a: {
 277     ...
 278     break a     #----------|
 279     ...               #    |
 280   }                   # <--|
 281
 282
 283   a: {                # <--|
 284     ...               #    |
 285     b: {              #    |
 286       ...             #    |
 287       loop a    #----------|
 288       ...
 289     }
 290     ...
 291   }
 292 ```
 293
 294 However, there's no way to jump to a block that doesn't contain the `loop` or
 295 `break` instruction.
 296
 297 ### Integer arithmetic
 298
 299 These instructions require variables of non-`addr`, non-`float` types.
 300
 301 Add:
 302 ```
 303   var1/reg1 <- add var2/reg2
 304   var1/reg <- add var2
 305   add-to var1, var2/reg                 # var1 += var2
 306   var/reg <- add n
 307   add-to var, n
 308 ```
 309
 310 Subtract:
 311 ```
 312   var1/reg1 <- subtract var2/reg2
 313   var1/reg <- subtract var2
 314   subtract-from var1, var2/reg          # var1 -= var2
 315   var/reg <- subtract n
 316   subtract-from var, n
 317 ```
 318
 319 Add one:
 320 ```
 321   var/reg <- increment
 322   increment var
 323 ```
 324
 325 Subtract one:
 326 ```
 327   var/reg <- decrement
 328   decrement var
 329 ```
 330
 331 Multiply:
 332 ```
 333   var1/reg <- multiply var2
 334 ```
 335
 336 The result of a multiply must be a register.
 337
 338 Negate:
 339 ```
 340   var/reg1 <- negate
 341   negate var
 342 ```
 343
 344 ### Fractional arithmetic
 345
 346 Operations on `float` variables include a few we've seen before and some new
 347 ones. Notice here that we mostly use floating-point registers `xmm_`, but
 348 still use the general-purpose registers when dereferencing variables of type
 349 `(addr float)`.
 350
 351 ```
 352   var1/xreg <- add var2/xreg2
 353   var1/xreg <- add var2
 354   var1/xreg <- add *var2/reg2
 355
 356   var1/xreg <- subtract var2/xreg2
 357   var1/xreg <- subtract var2
 358   var1/xreg <- subtract *var2/reg2
 359
 360   var1/xreg <- multiply var2/xreg2
 361   var1/xreg <- multiply var2
 362   var1/xreg <- multiply *var2/reg2
 363
 364   var1/xreg <- divide var2/xreg2
 365   var1/xreg <- divide var2
 366   var1/xreg <- divide *var2/reg2
 367
 368   var1/xreg <- reciprocal var2/xreg2
 369   var1/xreg <- reciprocal var2
 370   var1/xreg <- reciprocal *var2/reg2
 371
 372   var1/xreg <- square-root var2/xreg2
 373   var1/xreg <- square-root var2
 374   var1/xreg <- square-root *var2/reg2
 375
 376   var1/xreg <- inverse-square-root var2/xreg2
 377   var1/xreg <- inverse-square-root var2
 378   var1/xreg <- inverse-square-root *var2/reg2
 379
 380   var1/xreg <- min var2/xreg2
 381   var1/xreg <- min var2
 382   var1/xreg <- min *var2/reg2
 383
 384   var1/xreg <- max var2/xreg2
 385   var1/xreg <- max var2
 386   var1/xreg <- max *var2/reg2
 387 ```
 388
 389 Two instructions in the above list are approximate. According to the Intel
 390 manual, `reciprocal` and `inverse-square-root` [go off the rails around the
 391 fourth decimal place](linux/x86_approx.md). If you need more precision, use
 392 `divide` separately.
 393
 394 ### Bitwise boolean operations
 395
 396 These require variables of non-`addr`, non-`float` types.
 397
 398 And:
 399 ```
 400   var1/reg1 <- and var2/reg2
 401   var1/reg <- and var2
 402   and-with var1, var2/reg
 403   var/reg <- and n
 404   and-with var, n
 405 ```
 406
 407 Or:
 408 ```
 409   var1/reg1 <- or var2/reg2
 410   var1/reg <- or var2
 411   or-with var1, var2/reg
 412   var/reg <- or n
 413   or-with var, n
 414 ```
 415
 416 Not:
 417 ```
 418   var1/reg1 <- not
 419   not var
 420 ```
 421
 422 Xor:
 423 ```
 424   var1/reg1 <- xor var2/reg2
 425   var1/reg <- xor var2
 426   xor-with var1, var2/reg
 427   var/reg <- xor n
 428   xor-with var, n
 429 ```
 430
 431 ### Bitwise shifts
 432
 433 Shifts require variables of non-`addr`, non-`float` types.
 434
 435 ```
 436   var/reg <- shift-left n
 437   var/reg <- shift-right n
 438   var/reg <- shift-right-signed n
 439   shift-left var, n
 440   shift-right var, n
 441   shift-right-signed var, n
 442 ```
 443
 444 Shifting bits left always inserts zeros on the right.
 445 Shifting bits right inserts zeros on the left by default.
 446 A _signed_ shift right duplicates the leftmost bit, thereby preserving the
 447 sign of an integer.
 448
 449 ## Operations on more complex types
 450
 451 These instructions work with any type `T`. As before we use `/reg` here to
 452 indicate when a variable must live in a register. We also include type
 453 constraints after a `:`.
 454
 455 ### Addresses and handles
 456
 457 You can compute the address of any variable in memory (never in registers):
 458
 459 ```
 460   var1/reg: (addr T) <- address var2: T
 461 ```
 462
 463 As mentioned up top, `addr` variables can never escape the function where
 464 they're computed. You can't store them on the heap, or in compound types.
 465 Think of them as short-lived things.
 466
 467 To manage long-lived addresses, _allocate_ them on the heap.
 468
 469 ```
 470   allocate var: (addr handle T)       # var can be in either register or memory
 471 ```
 472
 473 Handles can be copied and stored without restriction. However, they're too
 474 large to fit in a register. You also can't access their payload directly, you
 475 have to first convert them into a short-lived `addr` using _lookup_.
 476
 477 ```
 478   var1/eax: (addr T) <- lookup var2: (handle T)  # var2 in either register or memory
 479 ```
 480
 481 Since handles are large compound types, there's a special helper for comparing
 482 them:
 483
 484 ```
 485   var/eax: boolean <- handle-equal? var1: (handle T), var2: (handle T)
 486 ```
 487
 488 ### Arrays
 489
 490 Arrays are declared in two ways:
 491   1. On the stack with a literal size:
 492 ```
 493   var x: (array int 3)
 494 ```
 495   2. On the heap with a potentially variable size. For example:
 496 ```
 497   var x: (handle array int)
 498   var x-ah/eax: (addr handle array int) <- address x
 499   populate x-ah, 8
 500 ```
 501
 502 The `8` here can also be an int in a register or memory. (The `-ah` is a
 503 common variable naming convention and stands for "address of a handle".
 504 Essential for allocating long-lived data on the heap.)
 505
 506 You can compute the length of an array, though you'll need an `addr` to do so:
 507
 508 ```
 509   var/reg: int <- length arr/reg: (addr array T)
 510 ```
 511
 512 To read from or write to an array, use `index` to first obtain an address to
 513 read from or modify:
 514
 515 ```
 516   var/reg: (addr T) <- index arr/reg: (addr array T), n
 517   var/reg: (addr T) <- index arr: (array T len), n
 518 ```
 519
 520 Like our notation of `n`, `len` here is required to be a literal.
 521
 522 The index requested can also be a variable in a register, with one caveat:
 523
 524 ```
 525   var/reg: (addr T) <- index arr/reg: (addr array T), idx/reg: int
 526   var/reg: (addr T) <- index arr: (array T len), idx/reg: int
 527 ```
 528
 529 The caveat: the size of T must be 1, 2, 4 or 8 bytes. For other sizes of T
 530 you'll need to split up the work, performing a `compute-offset` before the
 531 `index`.
 532
 533 ```
 534   var/reg: (offset T) <- compute-offset arr: (addr array T), idx/reg: int     # arr can be in reg or mem
 535   var/reg: (offset T) <- compute-offset arr: (addr array T), idx: int         # arr can be in reg or mem
 536 ```
 537
 538 The result of a `compute-offset` statement can be passed to `index`:
 539
 540 ```
 541   var/reg: (addr T) <- index arr/reg: (addr array T), idx/reg: (offset T)
 542 ```
 543
 544 ### Streams
 545
 546 A common use for arrays is as buffers. Save a few items to a scratch space and
 547 then process them. This pattern is so common (we use it in files) that there's
 548 special support for it with a built-in type: `stream`.
 549
 550 Streams are like arrays in many ways. You can initialize them with a length on
 551 the stack:
 552
 553 ```
 554   var x: (stream int 3)
 555 ```
 556
 557 You can also populate them on the heap:
 558 ```
 559   var x: (handle stream int)
 560   var x-ah/eax: (addr handle stream int) <- address x
 561   populate-stream x-ah, 8
 562 ```
 563
 564 However, streams don't provide random access with an `index` instruction.
 565 Instead, you write to them sequentially, and read back what you wrote.
 566
 567 ```
 568   read-from-stream s: (addr stream T), out: (addr T)
 569   write-to-stream s: (addr stream T), in: (addr T)
 570 ```
 571
 572 Streams of bytes are particularly common for managing Unicode text, and there
 573 are a few functions to help with them:
 574
 575 ```
 576   write s: (addr stream byte), u: (addr array byte)  # write u to s, abort if full
 577   overflow?/eax: boolean <- try-write s: (addr stream byte), u: (addr array byte)
 578   write-stream dest: (addr stream byte), src: (addr stream byte)
 579   # bytes
 580   append-byte s: (addr stream byte), var: int  # write lower byte of var
 581   var/eax: byte <- read-byte s: (addr stream byte)
 582   # 32-bit code-point-utf8s encoded in UTF-8
 583   write-code-point-utf8 out: (addr stream byte), g: code-point-utf8
 584   g/eax: code-point-utf8 <- read-code-point-utf8 in: (addr stream byte)
 585 ```
 586
 587 You can check if a stream is empty or full:
 588
 589 ```
 590   var/eax: boolean <- stream-empty? s: (addr stream)
 591   var/eax: boolean <- stream-full? s: (addr stream)
 592 ```
 593
 594 You can clear streams:
 595
 596 ```
 597   clear-stream f: (addr stream T)
 598 ```
 599
 600 You can also rewind them to reread their contents:
 601
 602 ```
 603   rewind-stream f: (addr stream T)
 604 ```
 605
 606 ## Compound types
 607
 608 Primitive types can be combined together using the `type` keyword. For
 609 example:
 610
 611 ```
 612 type point {
 613   x: int
 614   y: int
 615 }
 616 ```
 617
 618 Mu programs are sequences of just `fn` and `type` definitions.
 619
 620 Compound types can't include `addr` types for safety reasons (use `handle` instead,
 621 which is described below). They also can't currently include `array`, `stream`
 622 or `byte` types. Since arrays and streams carry their size with them, supporting
 623 them in compound types complicates variable initialization. Instead of
 624 defining them inline in a type definition, define a `handle` to them. Bytes
 625 shouldn't be used for anything but UTF-8 strings.
 626
 627 To access within a compound type, use the `get` instruction. There are two
 628 forms. You need either a variable of the type itself (say `T`) in memory, or a
 629 variable of type `(addr T)` in a register.
 630
 631 ```
 632 var/reg: (addr T_f) <- get var/reg: (addr T), f
 633 var/reg: (addr T_f) <- get var: T, f
 634 ```
 635
 636 The `f` here is the field name from the `type` definition, and its type `T_f`
 637 must match the type of `f` in the `type` definition. For example, some legal
 638 instructions for the definition of `point` above:
 639
 640 ```
 641 var a/eax: (addr int) <- get p, x
 642 var a/eax: (addr int) <- get p, y
 643 ```
 644
 645 You can clear compound types using the `clear-object` function:
 646
 647 ```
 648 clear-object var: (addr T)
 649 ```
 650
 651 You can shallow-copy compound types using the `copy-object` function:
 652
 653 ```
 654 copy-object src: (addr T), dest: (addr T)
 655 ```