Guide for C/C++ developers

优质
小牛编辑
142浏览
2023-12-01

WebAssembly is being designed to support C and C++ code well, right from the start in the MVP. The following explains the outlook for C and C++ developers.

Porting C and C++ code to WebAssembly

Platform features

WebAssembly has a pretty conventional ISA: 8-bit bytes, two’s complement integers, little-endian, and a lot of other normal properties. Reasonably portable C/C++ code should port to WebAssembly without difficultly.

WebAssembly has 32-bit and 64-bit architecture variants, called wasm32 and wasm64. wasm32 has an ILP32 data model, meaning that int, long, and pointer types are all 32-bit, while the long long type is 64-bit. wasm64 has an LP64 data model, meaning that long and pointer types will be 64-bit, while int is 32-bit.

The MVP will support only wasm32; support for wasm64 will be added in the future to support 64-bit address spaces.

float and double are the IEEE 754-2008 single- and double-precision types, which are native in WebAssembly. long double is the IEEE 754-2008 quad-precision type, which is a software-emulated type. WebAssembly does not have a builtin quad-precision type or associated operators. The long double type here is software-emulated in library code linked into WebAssembly applications that need it.

For performance and compatibility with other platforms, float and double are recommended for most uses.

Language Support

C and C++ language conformance is largely determined by individual compiler support, but WebAssembly includes all the functionality that popular C and C++ compilers need to support high-quality implementations.

While the MVP will be fully functional, additional features enabling greater performance will be added soon after, including:

APIs

WebAssembly applications can use high-level C/C++ APIs such as the C and C++ standard libraries, OpenGL, SDL, pthreads, and others, just as in normal C/C++ development. Under the covers, these libraries implement their functionality by using low-level facilities provided by WebAssembly implementations. On the Web, they utilize Web APIs (for example, OpenGL is executed on WebGL, libc date and time methods use the browser’s Date functionality, etc.). In other contexts, other low-level mechanisms may be used.

ABIs

In the MVP, WebAssembly does not yet have a stable ABI for libraries. Developers will need to ensure that all code linked into an application are compiled with the same compiler and options.

In the future, when WebAssembly is extended to support dynamic linking, stable ABIs are expected to be defined in accompaniment.

Undefined and Implementation-defined Behavior

Undefined Behavior

WebAssembly doesn’t change the C or C++ languages. Things which cause undefined behavior in C or C++ are still bugs when compiling for WebAssembly even when the corresponding behavior in WebAssembly itself is defined. C and C++ optimizers still assume that undefined behavior won’t occur, so such bugs can still lead to surprising behavior.

For example, while unaligned memory access is fully defined in WebAssembly, C and C++ compilers make no guarantee that a (non-packed) unaligned memory access at the source level is harmlessly translated into an unaligned memory access in WebAssembly. And in practice, popular C and C++ compilers do optimize on the assumption that alignment rules are followed, meaning that they don’t always preserve program behavior otherwise.

On WebAssembly, the primary nondeterminism and security invariants are always maintained. Demons can’t actually fly out your nose, as that would constitute an escape from the sandbox. And, callstacks can’t become corrupted.

Other than that, programs which invoke undefined behavior at the source language level may be compiled into WebAssembly programs which do anything else, including corrupting the contents of the application’s linear memory, calling APIs with arbitrary parameters, hanging, trapping, or consuming arbitrary amounts of resources (within the limits).

Tools are being developed and ported to help developers find and fix such bugs in their code.

Implementation-Defined Behavior

Most implementation-defined behavior in C and C++ is dependent on the compiler rather than on the underlying platform. For those details that are dependent on the platform, on WebAssembly they follow naturally from having 8-bit bytes, 32-bit and 64-bit two’s complement integers, and 32-bit and 64-bit IEEE-754-2008-style floating point support.

Portability of compiled code

WebAssembly can be efficiently implemented on a wide variety of platforms, provided they can satisfy certain basic expectations.

WebAssembly has very limited nondeterminism, so it is expected that compiled WebAssembly programs will behave very consistently across different implementations, and across different versions of the same implementation.