Source

The language is strongly typed, lexically scoped, and provides direct access to memory through load/stores. This is a low-level language without real data structures, lambdas/closures, nor memory management, but those can be written in this language.

The language is designed to expose the low-level flexibility of Wasm directly but in a friendly manner, which hides most of the complexity of the Wasm format.

It is intended to be used as for low-level Wasm programs, such as language runtimes (incl. memory allocators and gcs) for higher-level languages, or as a target for languages that want to compile directly to Wasm. It also helps explain how Wasm works by hiding some of the low-level details so the structure and semantics are clarified.

Introduction

Make WebAssembly Easy. WASE is a language, which tries to make WASM easy to write
directly. The language maps closely to WebAssembly, and compiles directly to Wasm
bytecode. As such, it can be seen as an alternative to the WAT text format,
but it is easier to use because of these things:

The language is strongly typed, lexically scoped, and provides direct access
to memory through load/stores. This is a low-level language without real data
structures, lambdas/closures, nor memory management, but those can be written
in this language.

The language is designed to expose the low-level flexibility of Wasm directly
but in a friendly manner, which hides most of the complexity of the Wasm format.

It is intended to be used as for low-level Wasm programs, such as language
runtimes (incl. memory allocators and gcs) for higher-level languages, or as
a target for languages that want to compile directly to Wasm. It also helps
explain how Wasm works by hiding some of the low-level details so the structure
and semantics are clarified.

Usage

To compile a tests/euler1.wase program to tests/euler1.wasm, clone this repository
and then use

bin\wase tests/euler1.wase

This requires a suitable Java Runtime in your path.

You can also consider to add wase/bin to your path.

You can also run it directly using java:

java -jar bin\wase.jar tests/euler1.-wase

Example

Here is a solution to the Project Euler challenge 1 (https://projecteuler.net/problem=1)
written in Wase:

// This includes Wasi imports and a printi32 helper
include lib/runtime;

// For Wasi, we have to export the memory
export memory 1;

// Iterative version using a loop
euler1Loop(limit : i32) -> i32 {
	var start = 1;
	var acc = 0;
	loop {
		// Breaks out of the function when start >= limit with the value acc
		break_if<1>(acc, start >= limit);
		if (start % 3 == 0 | start % 5 == 0) {
			acc := acc + start;
		};
		start := start + 1;
		// This is really continue and loops
		break<>();
	};
	// This is never reached, but we have to add this to get the right type
	acc
}

// A recursive implementation
foldRange(start : i32, end : i32, acc : i32) -> i32 {
	if (start <= end) {
		foldRange(start + 1, end, if (start % 3 == 0 | start % 5 == 0) {
			acc + start;
		} else acc)
	} else {
		acc;
	}
}

euler1(limit : i32) -> i32 {
	foldRange(1, limit - 1, 0);
}

// Wasi expects us to have a "_start" function exported
export _start() -> () {
	printi32(euler1Loop(1000)); // Correct: 233168
	printByte(10); // New line
	printi32(euler1(1000)); // Correct: 233168
	printByte(10); // New line
	{}
}

Compile with

bin\wase tests/euler1.wase

and run with wasmer (install)

wasmer tests/euler1.wasm
233168
233168

You can also run with wasm3, but it does not have as deep a stack, so the
recursion above causes a stack overflow unless you provide a bigger stack to
wasm3.

To compare Wase against Wat, check out the tests/ folder where the .wase
files have been compiled to .wasm and then decompiled to .wat.

Status

Beta. The compiler works, and the compiler can parse, type and compile
most instructions directly to WASM binaries that validate and run correctly.

One problem in daily use is that error messages are currently without positions.

This compiler is not validating. For example, you can use dynamic code in constant
contexts, which will be rejected by the Wasm runtime.

Join the flow9 discord, which also hosts discussions on Wase:

https://discord.gg/9gGJu6KU

The Wase Programming Language

Type system

Wase supports the types of Wasm:

i32, i64, f32, f64, v128, func, extern

At compile time, however, the compiler uses the real types for functions to ensure type
checking is complete. The syntax for functions is like this:

// A function that takes an i32 and a f64, and returns a i32
foo(i32, f64) -> i32 { 42 }

// Does not take any arguments, does not return anything
foo() -> () { }

// Multi-values is supported with this syntax: This function returns both an i32 and a f64
foo(i32) -> (i32, f64) { 
	[ 42, 3.141 ]
}

As a special type, there is also auto, where the type will be inferred by
the compiler:

// The return type of this function is inferred to be "i32"
bar() -> auto {
	1;
}

This also works for locals:

bar() -> auto {
	a : auto = 1;
	b = 2.0; // This is implicitly auto, and thus inferred
	a
}

The type inference is based on Hindley-Milner style unification, so it should be robust.

You can use explicit type annotations to verify types:

foo() -> auto {
	bar : i32
}

There are no implicit type conversions.

The compiler tracks whether variables and globals are mutable or not:

foo() {
	a = 1;
	a := 2; // Not allowed.

	var b = 1;
	b := 2; // This is allowed, since b is "var"
}

c : mutable i32 = 1;
bar() {
	c := 2; // This is allowed
}

Top-level Syntax

At the top-level, we have syntax for the different kinds of sections in
Wasm. This includes globals, functions, imports, tables and data.

The compiler will reorder the top-level, so imports, tables, memory and data
come first in the original order, and then after that, globals and functions
in their original order, in accordance with Wasm requirements.

Functions can call each other in mutual recursion, but globals can only refer
to globals that occur before themselves.

Global syntax

The grammar for globals is like this:

// Constant global
<id> : <type> = <expr>;

// Mutable global
<id> : mutable <type> = <expr>;

// Exported global
export <id> : <type> = <expr>;

// Exported mutable global
export <id> : mutable <type> = <expr>;

Some examples:

pi : f64 = 3.14159265359;
counter : mutable i32 = 0;
// Exports this global to the host with the name "secret"
export secret : i32 = 0xdeadbeaf;
// Exports this mutable global with the name "changes"
export changes : mutable f64 = 1.023;
// Exports this global to the host with the name "external"
export "external" internal : i32 = 1;

Function syntax

// Functions
foo(arg : i32) -> i32 {
	a : i32 = 1;
	a + arg
}

// This function is exported to the host using the name "foo"
export foo(arg : i32) -> i32 {
	arg
}

// Exports this function using the "_start" name to the host, in compliance with Wasi
export "_start" start() -> () {
	...
}

main() -> () {
	// This is the initial function automatically called otherwise
	...
}

If the program contains a function called main, it is marked as the
function that starts the program.

Include

Wase code can be split into multiple files, and included using this syntax at the
top-level:

include lib/runtime;

The compiler will read the file wase/lib/runtime.wase, parse it, and splice the
result into that place in the code. If a file is included more than once (also
transitively), it is only included the first occurrence.

The program is only type checked after includes are resolved.

Expressions

The body of globals and functions are expressions. There are no
statements, but only expressions. The syntax is close to the
typescript family:

h32 : i32 = 1;	// Int constants are i32
h32 : i32 = 0xDEADBEEF; // Hex notation

h64 : i64 = 1w; // w suffix for i64
h64 : i64 = 0x12345678DEADBEEFw; // Hex notation with suffix w for i64

fl32 : f32 = 0xDEADBEEFn; // Binary representation

fl64 : f64 = 10.0; // Floats are f64 by default
fl64 : f64 = 0x2345678DEADBEEFh; // Binary representation

// Let-binding have scope
a : i32 = 1;
<scope> // "a" is defined in this scope

var v : i32 = 1;
v := 2; // set local or global, when mutable

// Arithmetic, all signed.
1 + 2 / 3 * 4 % 5

// f64 arithmetic
23.4 + 2.3 * 3.141

// Bitwise operations
1 & 3 | 5 ^ 7

// Comparisons are signed. Use instructions for unsigned comparisons
1 < 5 | 5 >= 2 & a == 2.0 & lt_u<>(b, c) 

// Tuples, aka multi-values
[1, 2.0, 45]

// Tuple-let: This expands into multiple let-bindings with names like "t.0" and "t.1"
t : (i32, f64) = [1, 3.141];

Control flow

// Function call
foo(1, 2) 

if (a) b else c;
if (a) b;

// Sequence
{
	a();	// An implicit drop is added here
	b();	// An implicit drop is added here
	[1, 2];  // Two implicit drops are added here
	c
}

// Return from the function.
foo() -> i32 {
	if (early) {
		return 1;	 // Has to match function return type
	}
	code;
}
bar() -> () {
	if (early) {
		// Matches no result from function return type
		return;
	}
	code
}

Blocks

Wasm uses blocks for control flow. Each function introduces a block.
The block and loop constructs also do that, as well as the then
and else branches of if. The break and break_if instructions
refer to this stack of blocks. A break with level 0 breaks out of the
inner most block, while break<1>() breaks out of the next level. The number defines what parent block to break to.

block {
	code;
	// This breaks to the end of the block
	break_if<>(earlyStop);
	code;
}

loop {
	code;
	// In loops, the break goes to the top of the loop,
	// so this is an infinite loop
	break<>();
}

foo() -> () {
	loop {
		code;
		// This breaks out of the function
		break_if<1>(stopCondition);
		// This is really continue
		break<>();
	}
}

foo () -> f64 {
	loop {
		// Here, the break returns 3.141 from the function
		// since block one level up is the function, which
		// returns f64
		break_if<1>(3.141, earlyStop)
	}
}

// Here is a simple do-while loop, which prints from 1 to 10 and then F,
// but not E.
block {
	var i = 1;
	loop {
		printi32(i);
		printByte(10);
		i := i + 1;
		// This breaks out of the upper loop
		break_if<1>(i > 10);
		// This is really continue inside this loop
		break<>();
	};
	// Never reached "E"
	printByte(69);
	{}
};
// This is "F"
printByte(70);

TODO: illustrations (blocks with colors? with colorful brackets / font)

Indirect calls

Wase helps make indirect calls easier to work with. Consider a set of functions like this:

foo() -> i32 { 42 }
bar() -> i32 { 666	}
add1(v : i32) -> i32 { v + 1 }
sub1(v : i32) -> i32 { v - 1 }

Now it is possible to treat these functions as pointers, and call them indirectly using
call_indirect using the fn<id> construct:

call_indirect<>(fn<bar>());  // 666
call_indirect<>(fn<foo>());  // 42

call_indirect<>(41, fn<add1>()); // 42
call_indirect<>(43, fn<sub1>()); // 43

Behind the scenes, Wase will collect all fn<id> in the program, construct a element table for
them, and resolve them into i32 as required by call_indirect. If you treat these functions
as first order, the type of fn<bar> is i32. Also notice that the type of the indirect function is
inferred, so you might have to add a type annotation to resolve what the resulting type is:

myfn : i32 = if (cond) fn<bar>() else fn<foo>();
// We add a type annotation to "explain" that this call is for functions of type () -> i32 
call_indirect(myfn) : i32;

Low level instructions

The low-level instructions in Wase have the form

instruction<pars>(args)

where pars are parameters for the operation decided at compile time,
while args are arguments to the instruction put on the stack.

The instruction has the same name as the corresponding Wasm WAT format.

Load/store

Loads from memory are written like this:

load<>(index);

The width of the load is inferred from the use of the value.

// This is i32 load, since a is i32
a : i23 = load<>(0);

// This is f32 load, since f is f32
f : f32 = load<>(32):

Stores are written like this:

store<>(index, value);

The width of the store is inferred from the type of the value.

// This is f64.store, because 2.0 is f64
store<>(32, 2.0);

Loads and stores also exist in versions that work with smaller bit-widths:

// Loads a byte from the given memory address, sign extending it into a i32 or i64
signedByteValue : i32 = load8_s<>(0);
signedByteValue64 : i64 = load8_s<>(0);

// Loads an unsigned byte from the given memory address into a i32 or i64
byteValue : i32 = load8_u<>(0);
byteValue64 : i64 = load8_u<>(0);

sword : i32 = load16_s<>(0);
sword64 : i64 = load16_s<>(0);
uword : i32 = load16_u<>(0);
uword64 : i64 = load16_u<>(0);

sint32 : i64 = load32_s<>(0);
uint32 : i64 = load32_u<>(0);

// Stores the byte "32" at address 0
store8<>(0, 32)

// The same, except it comes from an i64
int64 : i64 = big;
store8<>(0, big); // Only picks the lower 8 bits of "big"

// Stores the lower 16 bits of the value at the given address
store16<>(address, value);

// Stores the lower 32 bits of the value at the given address
store32<>(address, value);

You can also define the offset and alignment explicitly:

load<offset>(index)
load<offset, align>(index)
store<offset>(index, value)
store<offset, alignment>(index, value)

The alignment is expressed as what power of 2 to use:

v : i32 = load8_u<0, 0>(i) // alignment at 1 byte
v : i32 = load16_u<0, 1>(i) // alignment at 2 bytes
v : i32 = load32_u<0, 2>(i) // alignment at 4 bytes
v : i64 = load_u<0, 3>(i) // alignment at 8 bytes

TODO: illustrations (offset, alignment)

Instructions

4 instructions plus all v128 SIMD instructions are not implemented yet:

table.init and elem.drop
memory.init and data.drop

Control instructions

| Wasm | Wase | Comments |
|-|-|-|
| block | block { exp }| Type is inferred
| loop | loop { exp } | Type is inferred
| if | if (cond) exp | Type is inferred
| ifelse | if (cond) exp else exp | Type is inferred
| unreachable | unreachable<>() |
| nop | nop<>() | No operation
| br | break<>() or break<int>() or break<>(val)or break<int>(val) | Default break level is 0. If there is a val, that is what we return with this break
| br_if | break_if<int>(cond) or break_if<>(cond) or break_if<int>(val, cond) or break_if<>(val, cond) | Default break level is 0. If there is a val, that is what the break returns
| br_table | br_table<l0, l1, l2, ..., ln, default>(index) or br_table<l0, l1, l2, ..., ln, default>(val, index) | Breaks the number of levels as indexed by the index. If a val is given, that is what the break returns. See tests/break_table.wase for details.
| return | return or return exp |
| call | fn(args) |
| call_indirect | call_indirect<>(args, fn<id>()) or call_indirect<>(fn<id>()) | The fn<id>() construct will convert the function id to an index into an automatically constructed element table of function index pointers.

Reference Instructions

| Wasm | Wase | Comments |
|-|-|-|
| ref.null | ref.null<func>() or ref_null<extern>() | Construct a null function or extern reference
| ref.is_null | exp is null | Is this function or extern reference null?
| ref.func | ref.func<id> | Constructs an opaque reference to a named function. Can be used in tables. Automatically constructs a element table for the referenced functions

Parametric Instructions

| Wasm | Wase | Comments |
|-|-|-|
| drop | drop<> or implicit in sequence {1;2} | The explicit drop<>() variant causes stack errors, since Wase is designed to be stack-safe.
| select | select<>(then, else, cond) | This is an eager if, where both then and else are always evaluated, but only one chosen based on the condition. This is branch-less so can be more efficient than normal if. (Automatically chooses the ref instruction version based on the type.)

Variable Instructions

| Wasm | Wase | Comments |
|-|-|-|
| local.get | id
| local.set | id := exp
| local.tee | id ::= exp | Sets the value like local.set, but returns the result as well, like in C
| global.get | id
| global.set | id := exp

Table Instructions

table.init and elem.drop not implemented yet. Requires elements first.

| Wasm | Wase| Comments |
|-|-|-|
| table.get | table.get<id>(index : i32) | Retrieves a value from a table slot. The id the name of a table given by import.table
| table.set | table.set<id>(index : i32, value : func/extern) | Sets a value in a table slot.
| table.size | table.size<id>() | Returns the size of a table.
| table.grow | table.grow<id>(init, size) | Changes the size of a table, initializing with the init value in empty slots. Returns previous size.
| table.copy | table.copy<id1, id2>(elems : i32, source : i32, dest : i32) | Copies elems slots from one area of a table id1 to another table id2
| table.fill | table.fill<id>(elements, value, dest) | Sets elements slots start at destto the value
| table.init | table.init<tableid, elemid>(elements : i32, source : i32, dest : i32) | Initializes a table with elements slots from an element section
| elem.drop | elem.drop<id>() | Discards the memory in an element segment.

Memory Instructions

memory.init and data.drop not implemented yet. Requires data indexing.

| Wasm | Wase | Comments |
-|-|-|
| *.load | load<>(address) or load<offset>(address) or load<offset, align>(address)| The type is inferred from the use.
| *.load(8,16,32)_(s,u) | load(8,16,32)_(s,u)<>(address) | Load the lower N bits from a memory address. _s implies sign-extension. The type is inferred from the use
| *.store | store<>(address, value) | The width is inferred from the value.
| *.store(8,16,32) | store(8,16,32)<>(address, value) | Store the lower N bits of a value. The width is inferred from the value
| memory.size | memory.size<>() | Returns the unsigned size of memory in terms of pages (64k)
| memory.grow | memory.grow<>(size) | Increases the memory by size pages. Returns the previous size of memory, or -1 if memory can not increase
| memory.copy | memory.copy<>(dest, source, bytes) | Copy bytes bytes from source to destination
| memory.fill | memory.fill<>(dest, bytevalue, bytes) | Fills bytes bytes with the given byte value at dest
| memory.init | memory.init<id>(dest, source, bytes) | Copies bytes from a data section <id> at address source into memory starting at address dest
| data.drop | data.drop<id>() | Frees the memory of data segment

Numeric Instructions

| Wasm | Wase | Comments |
|-|-|-|
| i32.const | 1 or 0x1234
| i64.const | 2w or 0x1234w
| f32.const | 0x1234n | We do not have natural, number syntax for f32
| f64.const | 3.1 or 0x123h
| *.clz | clz<>(exp) | Returns the number of leading zero bits. The width is inferred
| *.ctz | ctz<>(exp) | Returns the number of trailing zero bits. The width is inferred
| *.popcnt | popcnt<>(exp) | Returns the number of 1-bits. The width is inferred
| *.add | <exp> + <exp> | The width is inferred
| *.sub | <exp> - <exp> | The width is inferred
| *.mul | <exp> * <exp> | The width is inferred
| *.div_s | <exp> / <exp> | Signed division. Rounds towards zero. The width is inferred
| *.div_u | div_u<>(<exp>, <exp>) | Unsigned division. The width is inferred
| *.div | <exp> / <exp> | Signed division. The width is inferred
| *.rem_s | <exp> % <exp> | Signed remainder. The width is inferred
| *.rem_u | rem_u<>(<exp>, <exp>) | Unsigned remainder. The width is inferred
| *.and | <exp> & <exp> | Bitwise and. The width is inferred
| *.or | <exp> \| <exp> | Bitwise or. The width is inferred
| *.xor | <exp> ^ <exp> | Bitwise xor. The width is inferred
| *.shl | shl<>(val, bits) | Shift left, i.e. multiplication of power of two. The width is inferred
| *.shr_s | shr_s<>(val, bits) | Signed right shift. Division by power of two, rounding down. The width is inferred
| *.shr_u | shr_u<>(val, bits) | Unsigned right shift. Division by power of two. The width is inferred
| *.rotl | rotl<>(val, bits) | Rotate left. Bits "loop" around. The width is inferred
| *.rotr | rotr<>(val, bits) | Rotate right. Bits "loop" around. The width is inferred
| *.abs | abs<>(val) | Absolute value of floats. The width is inferred
| *.neg | -2.0 | Negate floating point value. The width is inferred
| *.ceil | ceil<>(val) | Rounds up to the nearest integer. The width is inferred
| *.floor | floor<>(val) | Rounds down to the nearest integer. The width is inferred
| *.trunc | trunc<>(val) | Discard the fractional part, rounding to integer towards zero. The width is inferred
| *.nearest | nearest<>(val) | Round to the nearest integer, with ties rounded toward the value with an even least-significant digit. The width is inferred
| *.sqrt | sqrt<>(val) | Square root. The width is inferred
| *.min | min<>(val, val) | Minimum of two values. NaN wins. The width is inferred
| *.max | max<>(val, val) | Maximum of two values. NaN wins. The width is inferred
| *.copysign | copysign<>(val, sign) | Copies the sign from sign into the value of val. The width is inferred
| *.eqz | eqz<>(val) | Is the value equal to 0? The width is inferred
| *.eq | val == val | The width is inferred
| *.ne | val != val | The width is inferred
| *.lt_s | val < val | The width is inferred
| *.lt_u | lt_u<>(val, val) | Unsigned comparison. The width is inferred
| *.gt_s | val > val | The width is inferred
| *.gt_u | gt_u<>(val, val) | Unsigned comparison. The width is inferred
| *.le_s | val <= val | The width is inferred
| *.le_u | le_u<>(val, val) | Unsigned comparison. The width is inferred
| *.ge_s | val >= val | The width is inferred
| *.ge_u | ge_u<>(val, val) | Unsigned comparison. The width is inferred
| i32.wrap_i64 | wrap<>(val) | Takes lower 32 bits of an i64.
| *.trunc*_s | trunc_s<>(val) | Converts f32 or f64 to i32 or i64, considering the result signed.
| *.trunc*_u | trunc_u<>(val) | Converts f32 or f64 to i32 or i64, considering the result unsigned.
| *.trunc_sat*_s | trunc_sat_s<>(val) | Converts f32 or f64 to i32 or i64, considering the result signed, saturating.
| *.trunc_sat*_u | trunc_sat_u<>(val) | Converts f32 or f64 to i32 or i64, considering the result unsigned, and saturating.
| *.extend_i32_s | extend_s<>(val) | Lifts a i32 to a i64, as signed.
| *.extend_i32_u | extend_u<>(val) | Lifts a i32 to a i64, as unsigned.
| *.extend8_s | extend8_s<>(val) | Lifts a byte to a i32/i64, as signed. The val is the same type as the result
| *.extend16_s | extend16_s<>(val) | Lifts 16 bits to a i32/i64, as signed. The val is the same type as the result
| *.convert_*_s | convert_s<>(val) | Lifts signed i32/i64 to f32/f64.
| *.convert*_u | convert_u<>(val) | Lifts unsigned i32/i64 to f32/f64.
| *.demote_f64 | demote*<>(val) | Lowers a f64 to f32
| *.promote_f32 | promote*<>(val) | Lifts a f32 to a f64
| *.reinterpret* | reinterpret<>(val) | Taking bit pattern of i32/i64/f32/f64 and interpret as f32/f64/i32/i64 correspondingly.

Vector Instructions

| Wasm | Wase | Comments |
|-|-|-|
| v128.const | v128.const | Create a constant 128-bit vector. This instructions has a 16 i8 byte, or 8 i16, or 4 i32, or 0 parameters and 0 arguments.
| i8x16.shuffle | i8x16.shuffle | Shuffle 16 of 8-bit lanes. This instruction has exactly 16 parameters of values in the interval 0-31 and 0 arguments.

Memory instructions

| Wasm | Wase | Comments |
|-|-|-|
| v128.load | v128.load | v128.load
| v128.load8x8_s | v128.load8x8_s | v128.load8x8_s
| v128.load8x8_u | v128.load8x8_u | v128.load8x8_u
| v128.load16x4_u | v128.load16x4_u | v128.load16x4_u
| v128.load16x4_s | v128.load16x4_s | v128.load16x4_s
| v128.load32x2_u | v128.load32x2_u | v128.load32x2_u
| v128.load32x2_s | v128.load32x2_s | v128.load32x2_s
| v128.load8_splat | v128.load8_splat | v128.load8_splat
| v128.load16_splat | v128.load16_splat | v128.load16_splat
| v128.load32_splat | v128.load32_splat | v128.load32_splat
| v128.load64_splat | v128.load64_splat | v128.load64_splat
| v128.load32_zero | v128.load32_zero | v128.load32_zero
| v128.load64_splat | v128.load64_splat | v128.load64_splat
| v128.store | v128.store | v128.store
| v128.load8_lane | v128.load8_lane | v128.load8_lane
| v128.load16_lane | v128.load16_lane | v128.load16_lane
| v128.load32_lane | v128.load32_lane | v128.load32_lane
| v128.load64_lane | v128.load64_lane | v128.load64_lane
| v128.store8_lane | v128.store8_lane | v128.store8_lane
| v128.store16_lane | v128.store16_lane | v128.store16_lane
| v128.store32_lane | v128.store32_lane | v128.store32_lane
| v128.store64_lane | v128.store64_lane | v128.store64_lane

Extract/replace lane instructions

| Wasm | Wase | Comments |
|-|-|-|
| i8x16.extract_lane_s | i8x16.extract_lane_s | i8x16.extract_lane_s
| i8x16.extract_lane_u | i8x16.extract_lane_u | i8x16.extract_lane_u
| i8x16.replace_lane | i8x16.replace_lane | i8x16.replace_lane
| i16x8.extract_lane_s | i16x8.extract_lane_s | i16x8.extract_lane_s
| i16x8.extract_lane_u | i16x8.extract_lane_u | i16x8.extract_lane_u
| i16x8.replace_lane | i16x8.replace_lane | i16x8.replace_lane
| i32x4.extract_lane | i32x4.extract_lane | i32x4.extract_lane
| i32x4.replace_lane | i32x4.replace_lane | i32x4.replace_lane
| i64x2.extract_lane | i64x2.extract_lane | i64x2.extract_lane
| i64x2.replace_lane | i64x2.replace_lane | i64x2.replace_lane
| f32x4.extract_lane | f32x4.extract_lane | f32x4.extract_lane
| f32x4.replace_lane | f32x4.replace_lane | f32x4.replace_lane
| f64x2.extract_lane | f64x2.extract_lane | f64x2.extract_lane
| f64x2.replace_lane | f64x2.replace_lane | f64x2.replace_lane

Swizzle/splat instrucions

| Wasm | Wase | Comments |
|-|-|-|
| i8x16.swizzle | i8x16.swizzle | i8x16.swizzle
| i8x16.splat | i8x16.splat | i8x16.splat
| i16x8.splat | i16x8.splat | i16x8.splat
| i32x4.splat | i32x4.splat | i32x4.splat
| i64x2.splat | i64x2.splat | i64x2.splat
| f32x4.splat | f32x4.splat | f32x4.splat
| f64x2.splat | f64x2.splat | f64x2.splat

Relational i8x16 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i8x16.eq | i8x16.eq | i8x16.eq
| i8x16.ne | i8x16.ne | i8x16.ne
| i8x16.lt_s | i8x16.lt_s | i8x16.lt_s
| i8x16.lt_u | i8x16.lt_u | i8x16.lt_u
| i8x16.gt_s | i8x16.gt_s | i8x16.gt_s
| i8x16.gt_u | i8x16.gt_u | i8x16.gt_u
| i8x16.le_s | i8x16.le_s | i8x16.le_s
| i8x16.le_u | i8x16.le_u | i8x16.le_u
| i8x16.ge_s | i8x16.ge_s | i8x16.ge_s
| i8x16.ge_u | i8x16.ge_u | i8x16.ge_u

Relational i16x8 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i16x8.eq | i16x8.eq | i16x8.eq
| i16x8.ne | i16x8.ne | i16x8.ne
| i16x8.lt_s | i16x8.lt_s | i16x8.lt_s
| i16x8.lt_u | i16x8.lt_u | i16x8.lt_u
| i16x8.gt_s | i16x8.gt_s | i16x8.gt_s
| i16x8.gt_u | i16x8.gt_u | i16x8.gt_u
| i16x8.le_s | i16x8.le_s | i16x8.le_s
| i16x8.le_u | i16x8.le_u | i16x8.le_u
| i16x8.ge_s | i16x8.ge_s | i16x8.ge_s
| i16x8.ge_u | i16x8.ge_u | i16x8.ge_u

Relational i32x4 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i32x4.eq | i32x4.eq | i32x4.eq
| i32x4.ne | i32x4.ne | i32x4.ne
| i32x4.lt_s | i32x4.lt_s | i32x4.lt_s
| i32x4.lt_u | i32x4.lt_u | i32x4.lt_u
| i32x4.gt_s | i32x4.gt_s | i32x4.gt_s
| i32x4.gt_u | i32x4.gt_u | i32x4.gt_u
| i32x4.le_s | i32x4.le_s | i32x4.le_s
| i32x4.le_u | i32x4.le_u | i32x4.le_u
| i32x4.ge_s | i32x4.ge_s | i32x4.ge_s
| i32x4.ge_u | i32x4.ge_u | i32x4.ge_u

Relational i64x2 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i64x2.eq | i64x2.eq | i64x2.eq
| i64x2.lt_s | i64x2.lt_s | i64x2.lt_s
| i64x2.gt_s | i64x2.gt_s | i64x2.gt_s
| i64x2.le_s | i64x2.le_s | i64x2.le_s
| i64x2.ge_s | i64x2.ge_s | i64x2.ge_s

Relational f32x4 instructions

| Wasm | Wase | Comments |
|-|-|-|
| f32x4.eq | f32x4.eq | f32x4.eq
| f32x4.lt_s | f32x4.lt_s | f32x4.lt_s
| f32x4.gt_s | f32x4.gt_s | f32x4.gt_s
| f32x4.le_s | f32x4.le_s | f32x4.le_s
| f32x4.ge_s | f32x4.ge_s | f32x4.ge_s

Relational f64x2 instructions

| Wasm | Wase | Comments |
|-|-|-|
| f64x2.eq | f64x2.eq | f64x2.eq
| f64x2.lt_s | f64x2.lt_s | f64x2.lt_s
| f64x2.gt_s | f64x2.gt_s | f64x2.gt_s
| f64x2.le_s | f64x2.le_s | f64x2.le_s
| f64x2.ge_s | f64x2.ge_s | f64x2.ge_s

v128 instructions

| Wasm | Wase | Comments |
|-|-|-|
| v128.not | v128.not | v128.not
| v128.and | v128.and | v128.and
| v128.andnot | v128.andnot | v128.andnot
| v128.or | v128.or | v128.or
| v128.xor | v128.xor | v128.xor
| v128.bitselect | v128.bitselect | v128.bitselect
| v128.any_true | v128.any_true | v128.any_true

i8x16 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i8x16.abs | i8x16.abs | i8x16.abs
| i8x16.neg | i8x16.neg | i8x16.neg
| i8x16.popcnt | i8x16.popcnt | i8x16.popcnt
| i8x16.all_true | i8x16.all_true | i8x16.all_true
| i8x16.bitmask | i8x16.bitmask | i8x16.bitmask
| i8x16.narrow_i8x16_s | i8x16.narrow_i8x16_s | i8x16.narrow_i8x16_s
| i8x16.narrow_i8x16_u | i8x16.narrow_i8x16_u | i8x16.narrow_i8x16_u
| i8x16.shl | i8x16.shl | i8x16.shl
| i8x16.shr_s | i8x16.shr_s | i8x16.shr_s
| i8x16.shr_u | i8x16.shr_u | i8x16.shr_u
| i8x16.add | i8x16.add | i8x16.add
| i8x16.add_sat_s | i8x16.add_sat_s | i8x16.add_sat_s
| i8x16.add_sat_u | i8x16.add_sat_u | i8x16.add_sat_u
| i8x16.sub | i8x16.sub | i8x16.sub
| i8x16.sub_sat_s | i8x16.sub_sat_s | i8x16.sub_sat_s
| i8x16.sub_sat_u | i8x16.sub_sat_u | i8x16.sub_sat_u
| i8x16.min_s | i8x16.min_s | i8x16.min_s
| i8x16.min_u | i8x16.min_u | i8x16.min_u
| i8x16.max_s | i8x16.max_s | i8x16.max_s
| i8x16.max_u | i8x16.max_u | i8x16.max_u
| i8x16.avgr_u | i8x16.avgr_u | i8x16.avgr_u

i16x8 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i16x8.extract_add_pairwise_i8x16_s | i16x8.extract_add_pairwise_i8x16_s | i16x8.extract_add_pairwise_i8x16_s
| i16x8.extract_add_pairwise_i8x16_u | i16x8.extract_add_pairwise_i8x16_u | i16x8.extract_add_pairwise_i8x16_u
| i16x8.abs | i16x8.abs | i16x8.abs
| i16x8.neg | i16x8.neg | i16x8.neg
| i16x8.q15mulr_sat_s | i16x8.q15mulr_sat_s | i16x8.q15mulr_sat_s
| i16x8.all_true | i16x8.all_true | i16x8.all_true
| i16x8.bitmask | i16x8.bitmask | i16x8.bitmask
| i16x8.narrow_i32x4_s | i16x8.narrow_i32x4_s | i16x8.narrow_i32x4_s
| i16x8.narrow_i32x4_u | i16x8.narrow_i32x4_u | i16x8.narrow_i32x4_u
| i16x8.extend_low_i8x16_s | i16x8.extend_low_i8x16_s | i16x8.extend_low_i8x16_s
| i16x8.extend_high_i8x16_s | i16x8.extend_high_i8x16_s | i16x8.extend_high_i8x16_s
| i16x8.extend_low_i8x16_u | i16x8.extend_low_i8x16_u | i16x8.extend_low_i8x16_u
| i16x8.extend_high_i8x16_u | i16x8.extend_high_i8x16_u | i16x8.extend_high_i8x16_u
| i16x8.shl | i16x8.shl | i16x8.shl
| i16x8.shr_s | i16x8.shr_s | i16x8.shr_s
| i16x8.shr_u | i16x8.shr_u | i16x8.shr_u
| i16x8.add | i16x8.add | i16x8.add
| i16x8.add_sat_s | i16x8.add_sat_s | i16x8.add_sat_s
| i16x8.add_sat_u | i16x8.add_sat_u | i16x8.add_sat_u
| i16x8.sub | i16x8.sub | i16x8.sub
| i16x8.sub_sat_s | i16x8.sub_sat_s | i16x8.sub_sat_s
| i16x8.sub_sat_u | i16x8.sub_sat_u | i16x8.sub_sat_u
| i16x8.mul | i16x8.mul | i16x8.mul
| i16x8.min_s | i16x8.min_s | i16x8.min_s
| i16x8.min_u | i16x8.min_u | i16x8.min_u
| i16x8.max_s | i16x8.max_s | i16x8.max_s
| i16x8.max_u | i16x8.max_u | i16x8.max_u
| i16x8.avgr_u | i16x8.avgr_u | i16x8.avgr_u
| i16x8.extmul_low_i8x16_s | i16x8.extmul_low_i8x16_s | i16x8.extmul_low_i8x16_s
| i16x8.extmul_high_i8x16_s | i16x8.extmul_high_i8x16_s | i16x8.extmul_high_i8x16_s
| i16x8.extmul_low_i8x16_u | i16x8.extmul_low_i8x16_u | i16x8.extmul_low_i8x16_u
| i16x8.extmul_high_i8x16_u | i16x8.extmul_high_i8x16_u | i16x8.extmul_high_i8x16_u

i32x4 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i32x4.extract_add_pairwise_i16x8_s | i32x4.extract_add_pairwise_i16x8_s | i32x4.extract_add_pairwise_i16x8_s
| i32x4.extract_add_pairwise_i16x8_u | i32x4.extract_add_pairwise_i16x8_u | i32x4.extract_add_pairwise_i16x8_u
| i32x4.abs | i32x4.abs | i32x4.abs
| i32x4.neg | i32x4.neg | i32x4.neg
| i32x4.all_true | i32x4.all_true | i32x4.all_true
| i32x4.bitmask | i32x4.bitmask | i32x4.bitmask
| i32x4.extend_low_i16x8_s | i32x4.extend_low_i16x8_s | i32x4.extend_low_i16x8_s
| i32x4.extend_high_i16x8_s | i32x4.extend_high_i16x8_s | i32x4.extend_high_i16x8_s
| i32x4.extend_low_i16x8_u | i32x4.extend_low_i16x8_u | i32x4.extend_low_i16x8_u
| i32x4.extend_high_i16x8_u | i32x4.extend_high_i16x8_u | i32x4.extend_high_i16x8_u
| i32x4.shl | i32x4.shl | i32x4.shl
| i32x4.shr_s | i32x4.shr_s | i32x4.shr_s
| i32x4.shr_u | i32x4.shr_u | i32x4.shr_u
| i32x4.add | i32x4.add | i32x4.add
| i32x4.sub | i32x4.sub | i32x4.sub
| i32x4.mul | i32x4.mul | i32x4.mul
| i32x4.min_s | i32x4.min_s | i32x4.min_s
| i32x4.min_u | i32x4.min_u | i32x4.min_u
| i32x4.max_s | i32x4.max_s | i32x4.max_s
| i32x4.max_u | i32x4.max_u | i32x4.max_u
| i32x4.dot_i16x8_s | i32x4.dot_i16x8_s | i32x4.dot_i16x8_s
| i32x4.extmul_low_i16x8_s | i32x4.extmul_low_i16x8_s | i32x4.extmul_low_i16x8_s
| i32x4.extmul_high_i16x8_s | i32x4.extmul_high_i16x8_s | i32x4.extmul_high_i16x8_s
| i32x4.extmul_low_i16x8_u | i32x4.extmul_low_i16x8_u | i32x4.extmul_low_i16x8_u
| i32x4.extmul_high_i16x8_u | i32x4.extmul_high_i16x8_u | i32x4.extmul_high_i16x8_u

i64x2 instructions

| Wasm | Wase | Comments |
|-|-|-|
| i64x2.abs | i64x2.abs | i64x2.abs
| i64x2.neg | i64x2.neg | i64x2.neg
| i64x2.all_true | i64x2.all_true | i64x2.all_true
| i64x2.bitmask | i64x2.bitmask | i64x2.bitmask
| i64x2.extend_low_i32x4_s | i64x2.extend_low_i32x4_s | i64x2.extend_low_i32x4_s
| i64x2.extend_high_i32x4_s | i64x2.extend_high_i32x4_s | i64x2.extend_high_i32x4_s
| i64x2.extend_low_i32x4_u | i64x2.extend_low_i32x4_u | i64x2.extend_low_i32x4_u
| i64x2.extend_high_i32x4_u | i64x2.extend_high_i32x4_u | i64x2.extend_high_i32x4_u
| i64x2.shl | i64x2.shl | i64x2.shl
| i64x2.shr_s | i64x2.shr_s | i64x2.shr_s
| i64x2.shr_u | i64x2.shr_u | i64x2.shr_u
| i64x2.add | i64x2.add | i64x2.add
| i64x2.sub | i64x2.sub | i64x2.sub
| i64x2.mul | i64x2.mul | i64x2.mul
| i64x2.extmul_low_i32x4_s | i64x2.extmul_low_i32x4_s | i64x2.extmul_low_i32x4_s
| i64x2.extmul_high_i32x4_s | i64x2.extmul_high_i32x4_s | i64x2.extmul_high_i32x4_s
| i64x2.extmul_low_i32x4_u | i64x2.extmul_low_i32x4_u | i64x2.extmul_low_i32x4_u
| i64x2.extmul_high_i32x4_u | i64x2.extmul_high_i32x4_u | i64x2.extmul_high_i32x4_u

f32x4 instructions

| Wasm | Wase | Comments |
|-|-|-|
| f32x4.ceil | f32x4.ceil | f32x4.ceil
| f32x4.floor | f32x4.floor | f32x4.floor
| f32x4.trunc | f32x4.trunc | f32x4.trunc
| f32x4.nearest | f32x4.nearest | f32x4.nearest
| f32x4.abs | f32x4.abs | f32x4.abs
| f32x4.neg | f32x4.neg | f32x4.neg
| f32x4.sqrt | f32x4.sqrt | f32x4.sqrt
| f32x4.add | f32x4.add | f32x4.add
| f32x4.sub | f32x4.sub | f32x4.sub
| f32x4.mul | f32x4.mul | f32x4.mul
| f32x4.div | f32x4.div | f32x4.div
| f32x4.min | f32x4.min | f32x4.min
| f32x4.max | f32x4.max | f32x4.max
| f32x4.pmin | f32x4.pmin | f32x4.pmin
| f32x4.pmax | f32x4.pmax | f32x4.pmax

f64x2 instructions

| Wasm | Wase | Comments |
|-|-|-|
| f64x2.ceil | f64x2.ceil | f64x2.ceil
| f64x2.floor | f64x2.floor | f64x2.floor
| f64x2.trunc | f64x2.trunc | f64x2.trunc
| f64x2.nearest | f64x2.nearest | f64x2.nearest
| f64x2.abs | f64x2.abs | f64x2.abs
| f64x2.neg | f64x2.neg | f64x2.neg
| f64x2.sqrt | f64x2.sqrt | f64x2.sqrt
| f64x2.add | f64x2.add | f64x2.add
| f64x2.sub | f64x2.sub | f64x2.sub
| f64x2.mul | f64x2.mul | f64x2.mul
| f64x2.div | f64x2.div | f64x2.div
| f64x2.min | f64x2.min | f64x2.min
| f64x2.max | f64x2.max | f64x2.max
| f64x2.pmin | f64x2.pmin | f64x2.pmin
| f64x2.pmax | f64x2.pmax | f64x2.pmax

Trunc/convert instructions

| Wasm | Wase | Comments |
|-|-|-|
| i32x4.trunc_sat_f32x4_s | i32x4.trunc_sat_f32x4_s | i32x4.trunc_sat_f32x4_s
| i32x4.trunc_sat_f32x4_u | i32x4.trunc_sat_f32x4_u | i32x4.trunc_sat_f32x4_u
| f32x4.convert_i32x4_s | f32x4.convert_i32x4_s | f32x4.convert_i32x4_s
| f32x4.convert_i32x4_u | f32x4.convert_i32x4_u | f32x4.convert_i32x4_u
| i32x4.trunc_sat_f64x2_s | i32x4.trunc_sat_f64x2_s | i32x4.trunc_sat_f64x2_s
| i32x4.trunc_sat_f64x2_u | i32x4.trunc_sat_f64x2_u | i32x4.trunc_sat_f64x2_u
| f64x2.convert_low_i32x4_s | f64x2.convert_low_i32x4_s | f64x2.convert_low_i32x4_s
| f64x2.convert_low_i32x4_u | f64x2.convert_low_i32x4_u | f64x2.convert_low_i32x4_u
| f32x4.demote_f64x2_zero | f32x4.demote_f64x2_zero | f32x4.demote_f64x2_zero
| f64x2.promote_low_f32x4 | f64x2.promote_low_f32x4 | f64x2.promote_low_f32x4

Wasm Sections

Imports of globals and functions

Use this syntax to import a function from the host:

// Function import from host
import println : (i32) -> void = console.log;

Notice imports have to be the first thing in the program.

The same works for globals:

// Global import
import id : i32 = module.name;
import id : mutable i32 = module.name;

Memory

You have to explicitly define how much memory is available for the runtime.
To reserve memory, use syntax like this:

export? memory <min> <max>?;

or in case it is imported from the host:

import memory <min> <max>? = module.name;

Examples:

// Reserves one page of 64k
memory 1;

// Reserves 64k at first, but maximum 1mb
memory 1 4;

// Reserves 128k and exports this memory under the name "memory" to the host
// This is the form that Wasi likes
export memory 2;

// Reserves and exports memory under the name "mymem"
export "mymem" memory 1 4;

// 64k Memory import from the host
import memory 1 = module.name;

Data

You can place constant data in the output file using syntax like this:

// Strings are placed as UTF8 but with the length first
// Hello will be bound to the address this data gets
data hello = "utf8 string is very comfortable";

// We can have a sequence of data. Each int is a byte.
data bytes = 1, 2, 3, "text", 3.0;

// Moving the data into offset 32 of the memory
data moved = "Hello, world!" offset 32;

The result is that this data is copied into memory on startup.

Tables

Importing of tables is done like this:

// Table of functions of min size 1, named module.mytable in the host
import mytable : table<func>(1) = module.mytable;

// Table of externs of min size 5, max size 10, named module.myexterns in the host
import myexterns : table<extern>(5 10) = module.myexterns;

Standard library

There is a minimal standard library you get with include lib/runtime.
It contains these helpers:

// Prints a byte to stdout
printByte(i : i32) -> ()

// Print unsigned i32 as decimal
printu32(i : i32) -> ()

// Print signed i32 as decimal
printi32(i : i32) -> ()

// Prints unsigned as hexadecimal
printHex32(i : i32) -> ()

These require Wasi. There is a stub for the wasi interface in lib/wasi.wase
but that is very incomplete at the moment.

There is a start of a memory allocator in lib/malloc.wase. It has
a double-linked list of blocks, and a separate area for small allocations
(<16 bytes>). It will be probably be refactored to expose the different
management disciplines more directly.

Development

Wase is written in flow9. To develop on Wase itself, first install flow9:

https://github.com/area9innovation/flow9

We recommend to use VS Code with the flow9 extension.

To run the compiler from the command line, use

bin\wased tests/euler1.wase

This will compile the compiler, and use this to compile the euler1.wase file
to a euler1.wasm file. It is thus exactly the same as bin\wase except that
it uses the latest source of the compiler.

When you make changes to the compiler, it is recommended to make sure there are
no regressions by running all test cases and produce .wat output as well
using wasm2wat:

bin\wased test=1 wat=1

This requires that you install WABT first

https://github.com/WebAssembly/wabt

If there are regressions, the .wat and/or .wasm files in the tests
folder will have changes when you do git status.

You can compile a new wase.jar binary release used by bin\wase using:

flowc1 wase/wase.flow jar=bin/wase.jar

This requires a Java JDK like OpenJDK.

Implementation notes

For each instruction, we need to define three basic different things:

TODOS

There are a number of things, that would make Wase better:

Right now, typing infer that type to (i32, i32), while it really is
i32. So the code above compiles correctly, and works at runtime, but
our type inference is not smart enough to know this.


Tags: language   wasm   assembly  

Last modified 16 December 2024