Commit Graph

3730 Commits

Author SHA1 Message Date
Benjamin Oldenburg
99d2daeb47 refactor(arm64): use symbolic opcode constants consistently
Replace hardcoded magic numbers with symbolic constants for ARM64
instruction opcodes, matching the style used in x86_64 backend.

Changes:
- arm64-tok.h: Add 93 new opcode constants and helper macros
  - Instruction opcodes: ARM64_ADD_IMM, ARM64_LDR_X, ARM64_B, etc.
  - Helper macros: ARM64_RD(), ARM64_RN(), ARM64_IMM12(), etc.
  - Field encodings: ARM64_SF(), ARM64_S(), ARM64_SH(), etc.

- arm64-asm.c: Refactor all instruction generation functions
  - gen_movz/gen_movn/gen_movk: Use ARM64_MOVZ/MOVN/MOVK
  - gen_add_imm/gen_sub_imm: Use ARM64_ADD_IMM/SUB_IMM
  - gen_dp_reg: Use symbolic opcodes
  - gen_ldst_imm/gen_ldst_pair: Use ARM64_LDR_*/STR_*
  - gen_b/gen_bl/gen_br/gen_blr/gen_ret: Use ARM64_B/BL/BR/BLR/RET
  - gen_cbz/gen_cbnz: Use ARM64_CBZ/CBNZ
  - gen_shift: Use ARM64_LSL_REG/LSR_REG/ASR_REG/ROR_REG
  - gen_barrier: Use ARM64_ISB/DSB/DMB
  - gen_mrs/gen_msr: Use symbolic constants
  - Inline asm save/restore: Use ARM64_STP_X/LDP_X

- arm64-gen.c: Begin systematic refactoring (first batch)
  - arm64_sub_sp: Use ARM64_SUB_IMM with helper macros

Benefits:
- Readability: Self-documenting code (ARM64_LDR_X vs 0xF9400000)
- Maintainability: Easier to spot encoding errors
- Consistency: Matches x86_64 backend style
- Safety: Helper macros prevent bit-shift mistakes

All tests pass with no functional changes.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
0720236ed0 fix: standardize hex literal suffixes for uint64_t operations
Use 'ul' suffix consistently for hex literals used with uint64_t
parameters instead of mixing (uint64_t) casts and suffixes.

Changes:
- arm64_check_offset: 0xffful, 0x1fful (was (uint64_t)0xfff, (uint64_t)0x1ff)
- arm64_ldrx/ldrv/strx/strv: 0xffful (was (uint64_t)0xfff)
- arm64_gen_opic: 0xffful, 0xfff000ul (was (uint64_t)0xfff)

Style: Prefer 'ul' suffix over explicit casts for clarity and
consistency with existing codebase patterns (e.g., 0xffful, 0xfffffful).
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
7c8faac279 fix: add explicit type casts for uint64_t operations
- arm64_check_offset: use (uint64_t)0x1ff for consistency with scaled_mask
- arm64_sub_sp: use 0xffful suffix for uint64_t diff parameter

These changes ensure consistent type handling and avoid implicit
integer promotions when working with 64-bit values.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
44fef0743b style: fix arm64-asm.c to match TCC codebase conventions
- Remove unnecessary braces from single-statement if blocks
- Remove trailing whitespace throughout file
- Remove duplicate comment

Style now matches existing ARM64 backend and TCC conventions:
- Allman style for function definitions
- No braces for single-statement control structures
- Consistent 4-space indentation
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
d9b0c5b920 feat: implement ARM64 extended inline assembly support
Implement full GCC-style extended inline assembly for ARM64 backend:

- Add constraint parsing (constraint_priority, skip_constraint_modifiers)
- Implement register allocation (asm_compute_constraints)
- Add code generation for prolog/epilog and load/store (asm_gen_code)
- Support output/input/read-write operands with r, w, f, x, m, g constraints
- Support immediate constraints (i, I, J, K, L, n)
- Handle clobber lists (registers, memory, cc)
- Support constraint references, early clobber, named operands
- Fix '#' character handling in tccpp.c for ARM64 asm mode

Tests: Add comprehensive test suite with 18 test cases covering all features.

All existing TCC tests continue to pass.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
7fe9c22cf2 arm64-asm.c: reject invalid registers in address operands
parse_addr_operand() silently accepted invalid register names like
[xyz] without error. Now explicitly validates the register and calls
tcc_error() if arm64_parse_regvar() returns -1 or >= 32.

Before: invalid registers caused silent wrong code or confusing errors
After: clear error message 'invalid register in address operand'
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
a702dcce9e arm64-asm.c: fix shift instruction encoding to match ARM ISA
LSL/LSR/ASR immediate shifts are UBFM/SBFM aliases with specific
immr/imms field encodings:
- LSL #shift: immr = (width - shift) & 0x3F, imms = width - 1
- LSR #shift: immr = shift & 0x3F, imms = width - 1
- ASR #shift: immr = shift & 0x3F, imms = width - 1

Fixes:
- immr field now always masked with 0x3F (6 bits), not width-1
- imms field is constant (width-1), not calculated from shift
- ROR uses EXTR format (Rm=shift, Rn=src, Rd=dest), not UBFM format

Based on ARM ARM documentation for UBFM/SBFM/EXTR instructions.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
1153e48335 bt-dll.c: use REDIR_PTR_INDIR macro for __bound_ptr_add
__bound_ptr_add was implemented manually while adjacent __bound_ptr_indir*
functions used the REDIR_PTR_INDIR macro. This consolidates the pattern
for consistency.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
9f5cc97690 tccpe.c: fix typo in pe_add_unwind_info function name
The function was named pe_add_uwwind_info (uwwind → unwind) in two
places (x86_64 and ARM64 versions). Fixed both declarations and their
call sites.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
356e22677a arm64-asm: accept symbolic branch targets 2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
2203a4407a winnt.h: add compile-time CONTEXT size assertions for fallback path
The static assertions in tccrun.c only validate CONTEXT when building
native Windows ARM64 (_WIN64 && __aarch64__). Cross-compilation builds
use the fallback definition without validation, so layout errors would
be silent.

Add matching C_ASSERT() checks after the ARM64_NT_CONTEXT definition
to catch struct layout mismatches during cross-compilation.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
680c2d40e8 arm64-asm.c: remove dead operand type definitions
OPT_VREG, OPT_IM12, OPT_SHIFT, and OPT_REGSET were defined in the enum
and as OP_* bit masks but never used by any parsing function or
instruction handler in arm64-asm.c.

These appear to be artifacts copied from other assembler implementations
(arm-asm.c uses OP_VREG32/OP_VREG64/OP_REGSET32, riscv64-asm.c uses
OP_IM12S) but were never integrated into the ARM64 operand parsing logic.

Removing these unused definitions:
- Eliminates confusion for developers
- Reduces code clutter
- Makes the actual operand types (OPT_REG, OPT_IM, OPT_ADDR, OPT_COND)
  clearer
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
3f26af7b4c arm64-asm.c: consolidate near-identical code generation functions
Three pairs of functions differed only in base opcode constants:
- gen_movz/gen_movn/gen_movk (34 lines → 17 lines core + 9 lines wrappers)
- gen_b/gen_bl (14 lines → 8 lines core + 6 lines wrappers)
- gen_cbz/gen_cbnz (18 lines → 10 lines core + 8 lines wrappers)

Consolidated each into a single core function with base_opcode parameter:
- gen_mov_with_base() - handles MOVZ, MOVN, MOVK
- gen_b_or_bl() - handles B, BL
- gen_cbz_or_cbnz() - handles CBZ, CBNZ

Original functions retained as thin wrappers for backward compatibility.

Net reduction: 20 lines (66 → 46), eliminates code duplication hazard.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
1f60eb4574 arm64-asm.c: deduplicate branch condition mapping
asm_branch() had two identical 15-case switch blocks (30 lines total)
that duplicated condition code mapping. This also duplicated the logic
in the existing parse_condition() helper.

Added get_branch_condition() helper that:
1. Maps branch tokens (TOK_ASM_beq) to condition tokens (TOK_ASM_eq)
2. Calls the existing parse_condition() helper
3. Returns the condition code (0-13) or -1 for non-conditional branches

This reduces code duplication from 30 lines to a single 29-line helper
function, and ensures all condition mapping logic is in one place.
2026-04-04 20:02:33 +07:00
Benjamin Oldenburg
5497f87f59 arm64-asm.c: validate operand types before encoding
Multiple instruction handlers were extracting op->reg without checking
that the operand was actually a register. When parse_operand() failed
to recognize a token, it set op->reg = -1, which when masked with 0x1F
became 31 (xzr/sp), silently encoding wrong instructions.

Now each handler validates operand types before extraction:
- asm_shift: validates op1 and op2 are registers
- asm_data_proc: validates op1, op2, and op3 are registers
- asm_ldst: validates op1 is register, op2 is address
- asm_ldst_pair: validates op1 and op2 are registers, op3 is address

This implements fail-fast behavior to catch typos and invalid operands
immediately rather than producing silently incorrect code.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
a1bf1d187d arm64-asm.c: reject invalid operands in parse_operand()
Previously, parse_operand() would silently accept any unrecognized token
and pass it to asm_expr() as an immediate, causing typos like:
  add x0, x1, xyz    ; 'xyz' is not a valid register
to be silently assembled as a symbol reference instead of erroring.

Now, if a token is not a register, condition code, or valid immediate
prefix (#, :, @, $), an error is emitted for identifier tokens.

This implements fail-fast behavior for invalid operands, making it easier
to catch typos and mistakes in assembly code.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
95c17cae64 winnt.h: remove dead ARM64 CONTEXT fallback code
The fallback CONTEXT definition at lines 2073-2124 was unreachable dead code.
The guard '#if defined(__aarch64__) && !defined(_ARM64_CONTEXT_DECLARED)'
could never be true because:

1. Line 50-51: __aarch64__ automatically defines _ARM64_
2. Line 1426: #if defined(_ARM64_) || defined(__aarch64__) always enters
3. Line 1473: _ARM64_CONTEXT_DECLARED is always defined inside that block
4. Line 2073: The fallback guard is therefore always false

This 52-line duplicate was a maintenance hazard that could silently diverge
from the official ARM64_NT_CONTEXT definition. Remove it entirely.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
aa95cfad10 winnt.h: fix ARM64 CONTEXT Bvr/Wvr register types
The fallback CONTEXT struct incorrectly defined Bvr (Breakpoint Value
Registers) and Wvr (Watchpoint Value Registers) as DWORD (32-bit) instead
of DWORD64 (64-bit).

On ARM64:
- BCR/WCR (Control Registers) are 32-bit ✓
- BVR/WVR (Value Registers) are 64-bit ✓

This mismatch caused struct size and layout errors, potentially corrupting
debug register state when used with Windows debugging APIs.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
040583cb9b winnt.h: define missing PE DLL characteristics 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
f277a57172 tccpe.c: remove duplicate IMAGE_DLLCHARACTERISTICS_* defines
These macros were defined twice (lines ~273 and ~317) with identical
values and #ifndef guards. The duplicates appear to be a copy-paste
oversight from adding ARM64 support.

Remove the redundant second set of defines. The first set (lines 273-284)
already provides the fallback definitions needed when Windows headers
are unavailable.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
5f641e2a25 winnt.h: fix ARM64 CONTEXT struct layout mismatch
The fallback CONTEXT struct for ARM64 had multiple structural issues:
- ContextFlags was DWORD64 (8 bytes) instead of ULONG (4 bytes)
- Missing Cpsr field entirely
- Missing DECLSPEC_ALIGN(16) attribute
- X registers as simple array X[29] instead of union with named struct X[31]

These mismatches caused incorrect struct size and field offsets, leading to
register corruption when used with Windows APIs like GetThreadContext or
RtlRestoreContext.

The fallback struct now matches the official ARM64_NT_CONTEXT layout exactly,
ensuring binary compatibility with Windows ARM64 system calls.
2026-04-04 20:02:32 +07:00
OpenCode
d2c06612a5 arm64-asm: validate register width consistency in data processing instructions
The asm_data_proc function was OR-ing register widths together, which
allowed invalid ARM64 instructions like 'add x0, w1, w2' (mixed widths).
ARM64 requires all registers in data processing instructions to have
the same width (all X or all W).

Fix by validating that all three operand registers have matching widths
and emitting an error if they don't match.
2026-04-04 20:02:32 +07:00
OpenCode
a27bd8a7c3 Remove trailing whitespaces from source files 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
6c5aac0da6 tccpe.c: fix msvcrt.dll handle leak on Windows ARM64
pe_get_process_msvcrt_handle() used LoadLibraryA which increments the
module reference count, but never called FreeLibrary to release it.

Use GetModuleHandleA instead, which returns a handle to the already-
loaded msvcrt.dll module without incrementing the reference count.
This is the correct API for accessing system DLLs that are already
mapped into the process address space.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
4fb1e212c1 winnt.h: fix ARM64 CONTEXT V register type
The fallback CONTEXT struct for ARM64 (used when __aarch64__ is defined
but _ARM64_CONTEXT_DECLARED is not set) incorrectly defined V[32] as
DWORD64 (64-bit) instead of ARM64_NT_NEON128 (128-bit).

This caused register corruption when RtlRestoreContext restores NEON/VFP
registers, as the struct size was 256 bytes instead of the correct
512 bytes.

Fixes potential corruption on toolchains that define __aarch64__ but not
_ARM64_ (e.g., clang on macOS or certain cross-compilation scenarios).
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
da60605fd5 tests: probe builtin support directly in tcctest 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
d317b34c71 win32: make test matrix pass across toolchains 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
c90aae5a9e tests: fix non-fatal Windows ARM64 diagnostics 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
dab39b0fe9 Revert "tests: clean trailing whitespace in win32 reference files"
This reverts commit eebee5340406d853eecb722154fb22c32e54844c.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
3ba2a224fa Revert "tests: avoid trailing spaces in tcctest output"
This reverts commit 2141b3f68f39636c5f3abbee7f1d37e8a2a998f8.
2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
10816e30bb tests: avoid trailing spaces in tcctest output 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
545378ed5b arm64: clarify inline asm support boundary 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
17801cdac8 tests: clean trailing whitespace in win32 reference files 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
0decb1b86f arm64: support mnemonic win32 stack helpers 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
b1763f8629 tests: fix tcctest.c for modern clang (15+) on macOS 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
f220a8c32f win32: recognize MSVC-style ARM64 host macros 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
6cb9d7ae53 arm64: use union type tags in HFA classification 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
684acad263 bcheck: restore atomic never_fatal on Windows 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
98194f55ba Fix arm64 offset mask warnings 2026-04-04 20:02:32 +07:00
Benjamin Oldenburg
e661cb9a62 Restrict arm64 old-style vararg handling to PE 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
55acb9272e Fix backtrace formatting and arm64 old-style calls 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
c24d95db09 Remove Windows ARM64 temp-exe fallback 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
4371ebd682 Fix Windows x64 env/runtime and PE alignment tests 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
396675f74f Fix Windows ARM64 runtime regressions and coverage 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
7e7917c3c9 Restore generic backtrace runtime path 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
a1da6220e3 Fix generic backtrace regression outside Windows 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
177b76b844 Add ARM64 Windows coverage and fix native run guard 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
cf4441c415 Fix Windows ARM64 runtime regressions 2026-04-04 20:02:31 +07:00
Benjamin Oldenburg
4f4b3dda6b Add and stabilize Windows ARM64 support 2026-04-04 20:02:31 +07:00
Stefan
98765e5ebc libtcc.c: Change parameter name of tcc_set_realloc
Some checks failed
build and test / test-x86_64-linux (push) Has been cancelled
build and test / test-x86_64-osx (push) Has been cancelled
build and test / test-aarch64-osx (push) Has been cancelled
build and test / test-x86_64-win32 (push) Has been cancelled
build and test / test-i386-win32 (push) Has been cancelled
build and test / test-armv7-linux (push) Has been cancelled
build and test / test-aarch64-linux (push) Has been cancelled
build and test / test-riscv64-linux (push) Has been cancelled
In libtcc.h there is void tcc_set_realloc(TCCReallocFunc *my_realloc).
Name the parameter in the function definition in libtcc.c accordingly.
2026-03-28 15:44:20 +01:00