asmjit

RepositoryFree

Low-latency machine code generation

Open Source

/ 100

13 capabilities

Capabilities13 decomposed

multi-level code generation abstraction with direct instruction emission

Medium confidence

Provides three distinct emitter abstraction levels (BaseAssembler, BaseBuilder, BaseCompiler) that allow developers to choose between low-level direct instruction encoding to a CodeBuffer, intermediate node-based IR with reordering capabilities, or high-level virtual register allocation with automatic spilling. Each level inherits from the previous, enabling progressive complexity and automation while maintaining control over generated machine code at any abstraction tier.

Solves for

I need to emit raw x86-64 instructions directly to memory with minimal overhead for performance-critical codeI want to generate code with instruction reordering and optimization passes before finalizationI need automatic register allocation and spilling to avoid manual register management in complex code generation

Best for

JIT compiler developers building scripting engines

performance optimization framework authors

dynamic instrumentation tool builders requiring sub-microsecond code generation latency

Requires

C++11 or later compiler

Target architecture support (x86/x64 or AArch64)

CodeHolder instance to store generated code and metadata

Limitations

BaseAssembler provides no instruction reordering or optimization — instructions emit in order

BaseBuilder's node-based IR adds memory overhead for intermediate representation storage

BaseCompiler's register allocation uses greedy algorithms, not graph-coloring, limiting optimization for highly register-constrained scenarios

What makes it unique

Three-tier emitter hierarchy with inheritance-based composition allows seamless progression from raw instruction encoding (BaseAssembler) through IR-based optimization (BaseBuilder) to automated register management (BaseCompiler), all sharing unified operand and instruction APIs across x86/x64 and AArch64 backends without code duplication.

vs alternatives

Offers more granular control than LLVM's IR-only approach while maintaining higher-level abstractions than raw assemblers, enabling latency-sensitive JIT compilers to choose their abstraction level per code path.

architecture-agnostic instruction encoding with backend-specific opcode tables

Medium confidence

Implements unified instruction encoding through architecture-specific backends (X86/X64 and AArch64) that use pre-generated opcode lookup tables and instruction signature matching. The X86 backend uses a table generation system that encodes instruction signatures, operand constraints, and opcode patterns into compact lookup structures; AArch64 uses similar table-driven encoding. A single instruction API call (e.g., `mov(dst, src)`) resolves to the correct machine code encoding based on operand types and target architecture.

Solves for

I want to write architecture-agnostic code generation logic that works on both x86-64 and ARM64 without conditional branchesI need to validate instruction operand combinations against ISA constraints before encodingI want to understand which instruction variants are available for a given mnemonic and operand set

Best for

cross-platform JIT compiler developers

portable dynamic code generation frameworks

ISA researchers building multi-target code generators

Requires

C++11 or later

Target architecture backend compiled in (x86/x64 or AArch64)

Knowledge of target ISA operand constraints and calling conventions

Limitations

Instruction database is static and pre-generated at build time — runtime instruction definition is not supported

X86 instruction encoding handles ~1500+ instructions but excludes some esoteric or vendor-specific extensions

AArch64 backend is less mature than x86/x64 with fewer optimization passes

What makes it unique

Uses pre-generated instruction signature tables that encode operand constraints, size variants, and opcode patterns into compact lookup structures, enabling O(1) instruction resolution without runtime parsing or regex matching; X86 table generation system automatically derives signatures from ISA specifications.

vs alternatives

Faster instruction encoding than LLVM's table-driven approach due to simpler operand model; more maintainable than hand-coded switch statements because table generation is automated from ISA specs.

aarch64 instruction database with table-driven encoding

Medium confidence

Implements AArch64 instruction support through a table-driven encoding system similar to x86/x64, with pre-generated instruction signatures and opcode patterns for AArch64 ISA. The AArch64 Instruction Database encodes instruction variants, operand constraints, and encoding rules into lookup tables. At runtime, instruction encoding resolves operand types to the correct AArch64 opcode and encoding format through signature matching.

Solves for

I want to emit AArch64 instructions with automatic operand validation and encodingI need to generate ARM64 code with correct instruction variants based on operand typesI want to support code generation on Apple Silicon, AWS Graviton, or other ARM64 platforms

Best for

ARM64 JIT compiler developers

cross-platform dynamic code generators targeting ARM64

developers building code generators for Apple Silicon or AWS Graviton

Requires

AArch64 backend compiled in

Knowledge of AArch64 instruction set and operand constraints

Target architecture must be AArch64

Limitations

AArch64 backend is less mature than x86/x64 with fewer optimization passes

Some AArch64 extensions (SVE, SME) may have limited support

Instruction database is static and pre-generated at build time

What makes it unique

Provides AArch64 instruction encoding through table-driven lookup matching x86/x64 architecture, enabling unified cross-architecture code generation APIs while maintaining architecture-specific instruction databases.

vs alternatives

Enables ARM64 code generation with the same API as x86-64, simplifying cross-platform JIT compiler development; more complete than minimal ARM64 assemblers due to comprehensive instruction coverage.

cross-platform virtual memory abstraction with platform-specific backends

Medium confidence

Abstracts platform-specific virtual memory operations (mmap/mprotect on POSIX, VirtualAlloc/VirtualProtect on Windows) through a unified VirtMem interface. The abstraction handles page allocation, protection transitions, and memory deallocation across operating systems. Platform-specific implementations are selected at compile time based on detected OS, enabling single-source code to work on Linux, Windows, macOS, and other platforms.

Solves for

I want to allocate executable memory without writing platform-specific codeI need to transition memory from writable to executable with W^X enforcement across platformsI want to handle platform differences in page size and protection semantics transparently

Best for

cross-platform JIT compiler developers

portable dynamic code generation frameworks

developers targeting multiple operating systems

Requires

Operating system with virtual memory support (all modern OSes)

Platform detection at compile time (CMake handles this)

C++11 or later

Limitations

VirtMem abstraction is limited to common operations (allocate, protect, deallocate); advanced features (huge pages, NUMA) not supported

Page size is platform-dependent and not configurable

Protection semantics differ slightly across platforms (e.g., Windows requires explicit decommit)

What makes it unique

Provides unified VirtMem interface that abstracts POSIX mmap/mprotect and Windows VirtualAlloc/VirtualProtect with compile-time platform selection, enabling W^X enforcement without platform-specific code in user code.

vs alternatives

More portable than OS-specific memory APIs while maintaining lower overhead than full abstraction layers; handles W^X enforcement transparently across platforms.

cmake-based modular build system with feature flags

Medium confidence

Implements a CMake-based build system that enables fine-grained control over compiled features through feature flags (ASMJIT_BUILD_X86, ASMJIT_BUILD_ARM, etc.). Developers can selectively enable/disable architecture backends, instruction databases, and optional features at build time, reducing binary size and compilation time. The build system automatically detects platform capabilities and generates appropriate compiler flags.

Solves for

I want to build asmjit with only the architecture backends I need to reduce binary sizeI need to customize the build for embedded systems with limited resourcesI want to enable/disable optional features (e.g., instruction validation) at build time

Best for

embedded systems developers with size constraints

developers building minimal JIT runtimes

teams customizing asmjit for specific use cases

Requires

CMake 3.10 or later

C++11 or later compiler

Platform-specific build tools (make, Visual Studio, etc.)

Limitations

Feature flags are compile-time only — runtime feature detection not supported

Disabling features may break code that depends on them, requiring careful testing

CMake configuration is complex with many options, potentially confusing for new users

What makes it unique

Uses CMake feature flags to enable selective compilation of architecture backends and optional features, allowing developers to build minimal asmjit instances for embedded systems or specific use cases without modifying source code.

vs alternatives

More flexible than monolithic builds while maintaining simpler configuration than autotools; enables binary size optimization for embedded systems.

automatic register allocation with virtual register abstraction

Medium confidence

The BaseCompiler emitter provides virtual register allocation by allowing developers to request unlimited virtual registers (VReg) that are automatically mapped to physical registers and spilled to stack as needed. The allocator tracks register liveness, performs greedy allocation, and inserts spill/reload instructions transparently. This abstraction hides the complexity of manual register management while maintaining control over register-level optimizations through explicit virtual register declarations.

Solves for

I want to generate complex code without manually tracking which physical registers are availableI need automatic spilling to stack when code uses more registers than the architecture providesI want to declare function arguments and return values with automatic calling convention handling

Best for

JIT compiler developers building expression evaluators or bytecode interpreters

dynamic code generation frameworks targeting multiple architectures

developers prioritizing code generation simplicity over micro-optimized register allocation

Requires

BaseCompiler emitter instance

Virtual register declarations via `newReg()` or `newStack()` calls

Target architecture with defined calling conventions (x86-64 or AArch64)

Limitations

Greedy allocation strategy does not perform graph-coloring or interference analysis, potentially generating suboptimal spill code

Virtual register allocation adds ~5-10% overhead compared to hand-optimized assembly

Spill/reload insertion happens after instruction emission, making it difficult to optimize across spill boundaries

What makes it unique

Provides virtual register abstraction at the emitter level (not IR level), allowing direct instruction emission with automatic physical register mapping and transparent spilling, eliminating the need for separate IR-to-assembly lowering passes while maintaining single-pass code generation.

vs alternatives

Simpler API than LLVM's register allocator (no need to understand interference graphs) while still supporting complex register pressure scenarios; faster compilation than graph-coloring allocators due to greedy strategy.

executable memory management with w^x security enforcement

Medium confidence

Manages allocation and lifecycle of executable memory through JitRuntime and JitAllocator, enforcing Write-XOR-Execute (W^X) security semantics where memory is either writable or executable, never both simultaneously. The VirtMem layer abstracts platform-specific virtual memory APIs (mmap on POSIX, VirtualAlloc on Windows) and handles page protection transitions. Code is written to writable memory, then protected as executable before execution, preventing code injection attacks.

Solves for

I need to allocate memory for generated code that is protected against code injection attacksI want to reuse allocated memory for multiple code generation passes without leaking memoryI need to handle platform differences (Linux, Windows, macOS) in executable memory allocation transparently

Best for

security-conscious JIT compiler developers

production runtime environments requiring W^X enforcement

embedded systems with strict memory constraints needing efficient code memory reuse

Requires

Operating system support for virtual memory protection (all modern OSes)

JitRuntime instance to manage allocator lifecycle

CodeHolder with finalized code ready for execution

Limitations

W^X enforcement adds page protection overhead (~1-5ms per code finalization) due to mprotect/VirtualProtect syscalls

Memory fragmentation can occur if many small code blocks are allocated, reducing allocator efficiency

JitAllocator uses simple bump-pointer allocation within pages, not best-fit, potentially wasting space

What makes it unique

Implements W^X enforcement at the allocator level with platform abstraction (VirtMem) that unifies POSIX mmap/mprotect and Windows VirtualAlloc/VirtualProtect, ensuring security guarantees across operating systems without exposing platform-specific APIs to users.

vs alternatives

Provides stronger security guarantees than manual mprotect calls (prevents TOCTOU attacks) while maintaining lower overhead than full sandboxing; more portable than OS-specific memory APIs.

node-based intermediate representation with instruction reordering and optimization

Medium confidence

BaseBuilder emits instructions as nodes in a linked list (Node system) rather than directly to a buffer, enabling instruction reordering, dead code elimination, and optimization passes before final encoding. Each instruction becomes a Node with metadata about operands, dependencies, and side effects. Nodes can be inserted, removed, or reordered before the builder finalizes code, converting the node graph to machine code through the emitter hierarchy.

Solves for

I want to optimize generated code by reordering instructions to reduce dependencies and improve ILPI need to eliminate dead code or redundant instructions after code generationI want to apply peephole optimizations or instruction fusion before final encoding

Best for

JIT compilers targeting performance-critical code paths

dynamic code generation frameworks with optimization budgets

developers building custom optimization passes for generated code

Requires

BaseBuilder emitter instance (not BaseAssembler)

Understanding of instruction dependencies and side effects

Optional: custom optimization pass implementations

Limitations

Node-based IR adds memory overhead (~40-80 bytes per instruction for node metadata)

Instruction reordering requires dependency analysis, adding compilation latency (~5-15% overhead)

No built-in optimization passes — developers must implement custom passes or use basic reordering

What makes it unique

Uses a linked-list node representation that preserves instruction order while enabling arbitrary reordering and optimization before finalization, avoiding the complexity of full IR graphs (like LLVM) while maintaining single-pass code generation semantics.

vs alternatives

Lighter-weight than LLVM's SSA IR (lower memory overhead, faster compilation) while still enabling instruction reordering; more flexible than BaseAssembler's direct emission for optimization-focused use cases.

unified operand system with type-safe register and memory references

Medium confidence

Provides a unified operand abstraction (Operand class hierarchy) that represents registers, immediates, labels, and memory references with type safety and architecture awareness. Operands encode register class (GP, XMM, etc.), size, and constraints into a compact representation. Memory operands support complex addressing modes (base + index*scale + displacement) with automatic validation. The operand system enables generic instruction APIs that work across different operand combinations without overloading.

Solves for

I want to specify instruction operands (registers, immediates, memory) with compile-time type safetyI need to use complex memory addressing modes (e.g., [rax + rbx*4 + 8]) without manual encodingI want to validate operand compatibility with instructions before encoding

Best for

C++ developers building type-safe code generators

JIT compilers requiring operand validation before instruction emission

developers working with multiple architectures needing unified operand APIs

Requires

C++11 or later for type-safe operand construction

Knowledge of target architecture register classes and addressing modes

Instruction API that accepts operand types

Limitations

Operand validation is architecture-specific and happens at encoding time, not compile time

Memory operand addressing modes are limited to base + index*scale + displacement (no complex expressions)

Operand size inference relies on context (instruction mnemonic), not explicit type annotations

What makes it unique

Encodes operand information (register class, size, addressing mode) into a compact representation with type-safe C++ API, enabling generic instruction methods that accept multiple operand types without overloading while maintaining architecture-specific validation.

vs alternatives

More type-safe than string-based operand specifications (like inline assembly) while maintaining simpler API than LLVM's operand hierarchy; compact representation enables efficient operand storage in node metadata.

function prologue/epilogue generation with calling convention support

Medium confidence

Provides automatic generation of function prologue and epilogue code based on declared calling conventions (x86-64 System V, x86-64 Windows, AArch64 AAPCS). Developers declare function arguments, return values, and clobbered registers; the compiler automatically generates stack frame setup, register saves, and cleanup. This abstraction handles platform-specific calling convention details (argument passing, return value location, stack alignment) transparently.

Solves for

I want to generate functions that follow the target platform's calling convention without manual stack frame managementI need to declare which registers my generated function clobbers so the caller can save themI want to access function arguments and return values with automatic location mapping

Best for

JIT compilers generating callable functions from bytecode or IR

dynamic code generation frameworks targeting multiple platforms

developers building language runtimes with native code generation

Requires

BaseCompiler emitter instance

Target architecture with defined calling convention (x86-64 or AArch64)

FuncSignature declaration with argument and return value types

Limitations

Calling convention support is limited to standard conventions (System V, Windows x64, AArch64 AAPCS); custom conventions not supported

Stack frame layout is determined by the compiler, not user-configurable

Prologue/epilogue generation assumes standard stack alignment; non-standard alignment requires manual adjustment

What makes it unique

Abstracts calling convention details into FuncSignature declarations that automatically generate platform-specific prologue/epilogue code, eliminating manual stack frame management while maintaining compatibility with native calling conventions across x86-64 and AArch64.

vs alternatives

Simpler than manual prologue/epilogue writing while more flexible than fixed-format function templates; automatically handles platform differences without conditional code.

label-based code relocation and forward reference resolution

Medium confidence

Implements a label system that enables forward references and code relocation through a two-pass approach: labels are declared during code emission, then resolved during finalization. The CodeHolder maintains a relocation table mapping label references to code offsets. Relocations support multiple types (absolute, relative, section-relative) and are resolved when code is finalized and moved to executable memory, enabling jumps and calls to unresolved targets.

Solves for

I want to emit jumps and calls to labels that haven't been defined yet in the code streamI need to support code relocation when generated code is moved to different memory addressesI want to reference external functions or data from generated code with automatic relocation

Best for

JIT compilers with multi-pass code generation

dynamic code generators needing forward references

developers building code generators with complex control flow

Requires

CodeHolder instance to store labels and relocations

Label declarations via `newLabel()` before use

Finalization pass to resolve all relocations

Limitations

Labels are resolved only at finalization time, not during emission, delaying error detection

Relocation types are limited to common patterns (absolute, relative, section-relative); custom relocations not supported

Forward references require sufficient space for relocation patching (e.g., 5-byte jumps on x86-64), potentially wasting code space

What makes it unique

Uses a deferred relocation model where labels are collected during emission and resolved during finalization, enabling forward references without requiring multiple passes while maintaining compact relocation records.

vs alternatives

Simpler than LLVM's relocation model (fewer relocation types) while supporting the common cases; more efficient than runtime relocation patching due to batch resolution at finalization.

section-based code organization with metadata storage

Medium confidence

Organizes generated code into sections (code, data, read-only data) within a CodeHolder, enabling separation of concerns and metadata storage. Each section has its own buffer, relocation table, and label namespace. This abstraction allows code generators to emit code and data independently, then combine them during finalization. Sections support different protection levels (executable, writable, read-only) and can be linked together.

Solves for

I want to separate generated code from constant data and read-only tablesI need to emit code and data in different sections with different protection levelsI want to organize generated code into logical sections (hot path, cold path, data)

Best for

JIT compilers with complex code organization requirements

dynamic code generators separating code from data

developers building code generators with multiple output sections

Requires

CodeHolder instance

Section declarations via `newSection()` before emission

Manual section switching during code generation

Limitations

Section linking is manual — developers must manage section offsets and cross-section references

No automatic section layout optimization (e.g., cache-aware placement)

Section protection levels are limited to standard types (executable, writable, read-only)

What makes it unique

Provides section abstraction at the CodeHolder level, enabling logical separation of code and data with independent buffers and relocation tables, while maintaining unified finalization and memory allocation.

vs alternatives

Simpler than ELF/Mach-O section models (fewer section types) while supporting the common cases; more flexible than flat code buffers for organizing complex generated code.

x86/x64 instruction database with signature-based encoding

Medium confidence

Implements a comprehensive x86/x64 instruction database (~1500+ instructions) using a table generation system that derives instruction signatures, operand constraints, and opcode patterns from ISA specifications. The X86 Instruction Database encodes each instruction's valid operand combinations, size variants, and encoding rules into lookup tables. At runtime, instruction encoding resolves operand types to the correct opcode and encoding format through signature matching.

Solves for

I want to emit x86-64 instructions with automatic operand validation and encodingI need to understand which operand combinations are valid for a given instructionI want to generate code that uses the correct instruction variant (e.g., mov vs movzx) based on operand types

Best for

x86-64 JIT compiler developers

dynamic code generators targeting x86-64

developers building x86-64 instruction analysis tools

Requires

x86/x64 backend compiled in

Knowledge of x86-64 instruction set and operand constraints

Target architecture must be x86 or x86-64

Limitations

Instruction database is static and pre-generated at build time — new instructions require rebuild

Some esoteric or vendor-specific x86 extensions may not be included

Operand validation is signature-based, not constraint-based, limiting expressiveness

What makes it unique

Uses automated table generation from ISA specifications to derive instruction signatures and opcode patterns, enabling O(1) instruction resolution without hand-coded switch statements; encodes operand constraints into compact lookup structures.

vs alternatives

More maintainable than hand-coded instruction encoders due to automated table generation; faster than regex-based instruction matching due to pre-computed lookup tables.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Related Artifactssharing capabilities

Artifacts that share capabilities with asmjit, ranked by overlap. Discovered automatically through the match graph.

Repository46

llvm

Project moved to: https://github.com/llvm/llvm-project

arm target code generation with conditional execution and neon simdx86 target-specific instruction selection and avx-512 supportselectiondag-based code generation with target-specific loweringglobal instruction selection (gisel) framework for machine-independent code generation

4 shared capabilities

Model47

DeepSeek Coder V2

DeepSeek's 236B MoE model specialized for code.

instruction-following code generation with fine-tuned response formattingefficient inference through sglang framework with mla optimization

2 shared capabilities

Repository44

CodeT5

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

encoder-decoder code generation with instruction tuninginstruction-tuning for natural language-guided code generation

2 shared capabilities

Model22

Qwen: Qwen3 Coder 30B A3B Instruct

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

instruction-following code generation with domain-specific reasoning

1 shared capability

Model44

Codestral

Mistral's dedicated 22B code generation model.

instruction-following code generation with context awareness

1 shared capability

Model20

EssentialAI: Rnj 1 Instruct

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

programming-task instruction following

1 shared capability

Best For

✓JIT compiler developers building scripting engines
✓performance optimization framework authors
✓dynamic instrumentation tool builders requiring sub-microsecond code generation latency
✓cross-platform JIT compiler developers
✓portable dynamic code generation frameworks
✓ISA researchers building multi-target code generators
✓ARM64 JIT compiler developers
✓cross-platform dynamic code generators targeting ARM64

Known Limitations

⚠BaseAssembler provides no instruction reordering or optimization — instructions emit in order
⚠BaseBuilder's node-based IR adds memory overhead for intermediate representation storage
⚠BaseCompiler's register allocation uses greedy algorithms, not graph-coloring, limiting optimization for highly register-constrained scenarios
⚠Cross-architecture code generation requires separate emitter instances per target ISA
⚠Instruction database is static and pre-generated at build time — runtime instruction definition is not supported
⚠X86 instruction encoding handles ~1500+ instructions but excludes some esoteric or vendor-specific extensions

Requirements

C++11 or later compilerTarget architecture support (x86/x64 or AArch64)CodeHolder instance to store generated code and metadataC++11 or laterTarget architecture backend compiled in (x86/x64 or AArch64)Knowledge of target ISA operand constraints and calling conventionsAArch64 backend compiled inKnowledge of AArch64 instruction set and operand constraints

Input / Output

Accepts: operand specifications (registers, immediates, memory references), instruction mnemonics with operand lists, function signatures with calling convention metadata, instruction mnemonic (string or enum), operand list (registers, immediates, memory operands with displacement/scale/index), operand size hints (8-bit, 16-bit, 32-bit, 64-bit), instruction mnemonic (mov, add, ldr, etc.), operand list with types (register, immediate, memory), operand sizes (32-bit, 64-bit), requested memory size (in bytes), protection flags (read, write, execute), alignment requirements, CMake feature flags (ASMJIT_BUILD_X86, ASMJIT_BUILD_ARM, etc.), compiler flags and optimization levels, platform detection results, virtual register requests with type (GP, XMM, YMM, ZMM for x86; GP, FP, NEON for AArch64), instruction sequences using virtual registers, function prologue/epilogue declarations, CodeHolder with generated machine code, alignment requirements (typically page-aligned), instruction sequences with operand dependencies, node insertion/removal/reordering operations, optimization pass specifications, register identifiers (rax, rbx, xmm0, etc.), immediate values (integers, floating-point), memory operand components (base register, index register, scale, displacement), label references for jumps and calls, function signature (argument types, return type, calling convention), clobbered register list, stack frame size requirements, label identifiers (Label objects), relocation type specifications (absolute, relative, etc.), target addresses (code offsets or external references), section type (code, data, read-only), section flags (executable, writable, etc.), code/data to emit to section, instruction mnemonic (mov, add, xor, etc.), operand sizes (8-bit, 16-bit, 32-bit, 64-bit)

Produces: machine code bytes in CodeBuffer, node-based intermediate representation (for Builder), executable function pointers (after JitRuntime finalization), encoded machine code bytes (1-15 bytes for x86, 4 bytes for AArch64), relocation records for unresolved labels or external references, instruction metadata (size, operand read/write info), encoded machine code (4 bytes per instruction), relocation records for immediates or memory operands, instruction metadata, allocated memory pointer, actual allocated size (rounded to page boundary), protection state, compiled asmjit library with selected features, header files with feature-specific declarations, build artifacts (object files, static/shared libraries), physical register assignments (mapped to actual rax, rbx, etc.), spill/reload instructions inserted into code stream, stack frame layout with offsets for spilled values, executable memory pointer (void*) to generated code, JitAllocator handle for memory lifecycle management, page protection state transitions (writable → executable), reordered node graph with optimized instruction sequence, machine code after node-to-buffer finalization, optimization metadata (dead code eliminated, instructions fused, etc.), encoded operand bytes in machine instruction, relocation records for unresolved labels, operand metadata (size, register class, addressing mode), prologue code (stack frame setup, register saves), epilogue code (register restores, stack cleanup), argument/return value location mappings (register or stack offset), relocation records in CodeHolder, patched machine code with resolved addresses, relocation metadata (type, offset, target), section buffers with code/data, section metadata (offset, size, protection level), cross-section relocation records, encoded machine code (1-15 bytes), instruction size

UnfragileRank

Adoption59%(35% weight)

Quality37%(20% weight)

Ecosystem60%(25% weight)

Match Graph10%(15% weight)

Freshness75%(5% weight)

UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.

Type: Repository

13 capabilities

Visit asmjit→

Repository Details

4,488

Stars

572

Forks

C++

Language

Zlib

License

Topics

aarch64asmjitassemblercode-generationcompilercppjitjit-compilationx86x86-64x86-x64

Last commit: Mar 26, 2026

About

Low-latency machine code generation

Alternatives to asmjit

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

Are you the builder of asmjit?

Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.

Claim this artifact →Verification via email

Get the weekly brief

New tools, rising stars, and what's actually worth your time. No spam.

Data Sources

github

Looking for something else?

Search →

Capabilities13 decomposed

multi-level code generation abstraction with direct instruction emission

Medium confidence

Solves for

Best for

JIT compiler developers building scripting engines

performance optimization framework authors

dynamic instrumentation tool builders requiring sub-microsecond code generation latency

Requires

C++11 or later compiler

Target architecture support (x86/x64 or AArch64)

CodeHolder instance to store generated code and metadata

Limitations

BaseAssembler provides no instruction reordering or optimization — instructions emit in order

BaseBuilder's node-based IR adds memory overhead for intermediate representation storage

BaseCompiler's register allocation uses greedy algorithms, not graph-coloring, limiting optimization for highly register-constrained scenarios

What makes it unique

vs alternatives

architecture-agnostic instruction encoding with backend-specific opcode tables

Medium confidence

Solves for

Best for

cross-platform JIT compiler developers

portable dynamic code generation frameworks

ISA researchers building multi-target code generators

Requires

C++11 or later

Target architecture backend compiled in (x86/x64 or AArch64)

Knowledge of target ISA operand constraints and calling conventions

Limitations

Instruction database is static and pre-generated at build time — runtime instruction definition is not supported

X86 instruction encoding handles ~1500+ instructions but excludes some esoteric or vendor-specific extensions

AArch64 backend is less mature than x86/x64 with fewer optimization passes

What makes it unique

vs alternatives

Faster instruction encoding than LLVM's table-driven approach due to simpler operand model; more maintainable than hand-coded switch statements because table generation is automated from ISA specs.

aarch64 instruction database with table-driven encoding

Medium confidence

Solves for

Best for

ARM64 JIT compiler developers

cross-platform dynamic code generators targeting ARM64

developers building code generators for Apple Silicon or AWS Graviton

Requires

AArch64 backend compiled in

Knowledge of AArch64 instruction set and operand constraints

Target architecture must be AArch64

Limitations

AArch64 backend is less mature than x86/x64 with fewer optimization passes

Some AArch64 extensions (SVE, SME) may have limited support

Instruction database is static and pre-generated at build time

What makes it unique

vs alternatives

Enables ARM64 code generation with the same API as x86-64, simplifying cross-platform JIT compiler development; more complete than minimal ARM64 assemblers due to comprehensive instruction coverage.

cross-platform virtual memory abstraction with platform-specific backends

Medium confidence

Solves for

Best for

cross-platform JIT compiler developers

portable dynamic code generation frameworks

developers targeting multiple operating systems

Requires

Operating system with virtual memory support (all modern OSes)

Platform detection at compile time (CMake handles this)

C++11 or later

Limitations

VirtMem abstraction is limited to common operations (allocate, protect, deallocate); advanced features (huge pages, NUMA) not supported

Page size is platform-dependent and not configurable

Protection semantics differ slightly across platforms (e.g., Windows requires explicit decommit)

What makes it unique

vs alternatives

More portable than OS-specific memory APIs while maintaining lower overhead than full abstraction layers; handles W^X enforcement transparently across platforms.

cmake-based modular build system with feature flags

Medium confidence

Solves for

Best for

embedded systems developers with size constraints

developers building minimal JIT runtimes

teams customizing asmjit for specific use cases

Requires

CMake 3.10 or later

C++11 or later compiler

Platform-specific build tools (make, Visual Studio, etc.)

Limitations

Feature flags are compile-time only — runtime feature detection not supported

Disabling features may break code that depends on them, requiring careful testing

CMake configuration is complex with many options, potentially confusing for new users

What makes it unique

vs alternatives

More flexible than monolithic builds while maintaining simpler configuration than autotools; enables binary size optimization for embedded systems.

automatic register allocation with virtual register abstraction

Medium confidence

Solves for

Best for

JIT compiler developers building expression evaluators or bytecode interpreters

dynamic code generation frameworks targeting multiple architectures

developers prioritizing code generation simplicity over micro-optimized register allocation

Requires

BaseCompiler emitter instance

Virtual register declarations via `newReg()` or `newStack()` calls

Target architecture with defined calling conventions (x86-64 or AArch64)

Limitations

Greedy allocation strategy does not perform graph-coloring or interference analysis, potentially generating suboptimal spill code

Virtual register allocation adds ~5-10% overhead compared to hand-optimized assembly

Spill/reload insertion happens after instruction emission, making it difficult to optimize across spill boundaries

What makes it unique

vs alternatives

executable memory management with w^x security enforcement

Medium confidence

Solves for

Best for

security-conscious JIT compiler developers

production runtime environments requiring W^X enforcement

embedded systems with strict memory constraints needing efficient code memory reuse

Requires

Operating system support for virtual memory protection (all modern OSes)

JitRuntime instance to manage allocator lifecycle

CodeHolder with finalized code ready for execution

Limitations

W^X enforcement adds page protection overhead (~1-5ms per code finalization) due to mprotect/VirtualProtect syscalls

Memory fragmentation can occur if many small code blocks are allocated, reducing allocator efficiency

JitAllocator uses simple bump-pointer allocation within pages, not best-fit, potentially wasting space

What makes it unique

vs alternatives

Provides stronger security guarantees than manual mprotect calls (prevents TOCTOU attacks) while maintaining lower overhead than full sandboxing; more portable than OS-specific memory APIs.

node-based intermediate representation with instruction reordering and optimization

Medium confidence

Solves for

Best for

JIT compilers targeting performance-critical code paths

dynamic code generation frameworks with optimization budgets

developers building custom optimization passes for generated code

Requires

BaseBuilder emitter instance (not BaseAssembler)

Understanding of instruction dependencies and side effects

Optional: custom optimization pass implementations

Limitations

Node-based IR adds memory overhead (~40-80 bytes per instruction for node metadata)

Instruction reordering requires dependency analysis, adding compilation latency (~5-15% overhead)

No built-in optimization passes — developers must implement custom passes or use basic reordering

What makes it unique

vs alternatives

unified operand system with type-safe register and memory references

Medium confidence

Solves for

Best for

C++ developers building type-safe code generators

JIT compilers requiring operand validation before instruction emission

developers working with multiple architectures needing unified operand APIs

Requires

C++11 or later for type-safe operand construction

Knowledge of target architecture register classes and addressing modes

Instruction API that accepts operand types

Limitations

Operand validation is architecture-specific and happens at encoding time, not compile time

Memory operand addressing modes are limited to base + index*scale + displacement (no complex expressions)

Operand size inference relies on context (instruction mnemonic), not explicit type annotations

What makes it unique

vs alternatives

function prologue/epilogue generation with calling convention support

Medium confidence

Solves for

Best for

JIT compilers generating callable functions from bytecode or IR

dynamic code generation frameworks targeting multiple platforms

developers building language runtimes with native code generation

Requires

BaseCompiler emitter instance

Target architecture with defined calling convention (x86-64 or AArch64)

FuncSignature declaration with argument and return value types

Limitations

Calling convention support is limited to standard conventions (System V, Windows x64, AArch64 AAPCS); custom conventions not supported

Stack frame layout is determined by the compiler, not user-configurable

Prologue/epilogue generation assumes standard stack alignment; non-standard alignment requires manual adjustment

What makes it unique

vs alternatives

Simpler than manual prologue/epilogue writing while more flexible than fixed-format function templates; automatically handles platform differences without conditional code.

label-based code relocation and forward reference resolution

Medium confidence

Solves for

Best for

JIT compilers with multi-pass code generation

dynamic code generators needing forward references

developers building code generators with complex control flow

Requires

CodeHolder instance to store labels and relocations

Label declarations via `newLabel()` before use

Finalization pass to resolve all relocations

Limitations

Labels are resolved only at finalization time, not during emission, delaying error detection

Relocation types are limited to common patterns (absolute, relative, section-relative); custom relocations not supported

Forward references require sufficient space for relocation patching (e.g., 5-byte jumps on x86-64), potentially wasting code space

What makes it unique

vs alternatives

Simpler than LLVM's relocation model (fewer relocation types) while supporting the common cases; more efficient than runtime relocation patching due to batch resolution at finalization.

section-based code organization with metadata storage

Medium confidence

Solves for

Best for

JIT compilers with complex code organization requirements

dynamic code generators separating code from data

developers building code generators with multiple output sections

Requires

CodeHolder instance

Section declarations via `newSection()` before emission

Manual section switching during code generation

Limitations

Section linking is manual — developers must manage section offsets and cross-section references

No automatic section layout optimization (e.g., cache-aware placement)

Section protection levels are limited to standard types (executable, writable, read-only)

What makes it unique

vs alternatives

Simpler than ELF/Mach-O section models (fewer section types) while supporting the common cases; more flexible than flat code buffers for organizing complex generated code.

x86/x64 instruction database with signature-based encoding

Medium confidence

Solves for

Best for

x86-64 JIT compiler developers

dynamic code generators targeting x86-64

developers building x86-64 instruction analysis tools

Requires

x86/x64 backend compiled in

Knowledge of x86-64 instruction set and operand constraints

Target architecture must be x86 or x86-64

Limitations

Instruction database is static and pre-generated at build time — new instructions require rebuild

Some esoteric or vendor-specific x86 extensions may not be included

Operand validation is signature-based, not constraint-based, limiting expressiveness

What makes it unique

vs alternatives

More maintainable than hand-coded instruction encoders due to automated table generation; faster than regex-based instruction matching due to pre-computed lookup tables.

Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.

Alternatives to asmjit

IntelliCode50Extension

AI-assisted development

Compare →

GitHub Copilot Chat53Extension

AI chat features powered by Copilot

Compare →

GitHub Copilot52Extension

Your AI pair programmer

Compare →

Claude Code for VS Code52Extension

Claude Code for VS Code: Harness the power of Claude Code without leaving your IDE

Compare →

asmjit

Capabilities13 decomposed

multi-level code generation abstraction with direct instruction emission

architecture-agnostic instruction encoding with backend-specific opcode tables

aarch64 instruction database with table-driven encoding

cross-platform virtual memory abstraction with platform-specific backends

cmake-based modular build system with feature flags

automatic register allocation with virtual register abstraction

executable memory management with w^x security enforcement

node-based intermediate representation with instruction reordering and optimization

unified operand system with type-safe register and memory references

function prologue/epilogue generation with calling convention support

label-based code relocation and forward reference resolution

section-based code organization with metadata storage

x86/x64 instruction database with signature-based encoding

Related Artifactssharing capabilities

llvm

DeepSeek Coder V2

CodeT5

Qwen: Qwen3 Coder 30B A3B Instruct

Codestral

EssentialAI: Rnj 1 Instruct

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to asmjit

Are you the builder of asmjit?

Get the weekly brief

Data Sources

asmjit

Capabilities13 decomposed

multi-level code generation abstraction with direct instruction emission

architecture-agnostic instruction encoding with backend-specific opcode tables

aarch64 instruction database with table-driven encoding

cross-platform virtual memory abstraction with platform-specific backends

cmake-based modular build system with feature flags

automatic register allocation with virtual register abstraction

executable memory management with w^x security enforcement

node-based intermediate representation with instruction reordering and optimization

unified operand system with type-safe register and memory references

function prologue/epilogue generation with calling convention support

label-based code relocation and forward reference resolution

section-based code organization with metadata storage

x86/x64 instruction database with signature-based encoding

Related Artifactssharing capabilities

llvm

DeepSeek Coder V2

CodeT5

Qwen: Qwen3 Coder 30B A3B Instruct

Codestral

EssentialAI: Rnj 1 Instruct

Best For

Known Limitations

Requirements

Input / Output

UnfragileRank

Repository Details

About

Categories

Alternatives to asmjit

Are you the builder of asmjit?

Get the weekly brief

Data Sources