refactor(wasm): emit CASM instead of MIR #343

gael-bigot · 2025-10-09T16:18:07Z

This PR refactors the entirety of the WASM crate to emit Cairo-M assembly directly instead of Mir

Changes in codegen API

`generator.rs`

add_function_from_builder helper which allows building functions manually without depending on a MirFunction object

`layout.rs`

make some fields and methods public

builder

make some fields public
make instruction builder helpers public when they depend on Values rather than offsets
add helpers for loading and storing Values

Changes in wasm crate

Biggest conceptual change is switching from a phi node system to a slot system in cfg
ops : had to refactor a bit as many opcodes do not have a direct equivalent in CASM

Other

main : add DAG printing in --verbose mode
improved caching system for tests which would crash all diff tests if a single one did not pass

Closes CORE-1292

…nctions

claude · 2025-10-09T16:18:24Z

Claude finished @gael-bigot's task —— View job

Code Review for WASM → CASM Refactor

TLDR: This is a substantial architectural improvement that moves WASM compilation directly to CASM, bypassing MIR. The implementation looks solid with good test coverage updates. Main concerns are around error handling robustness and some API design choices.

Review Progress

Analyze core architectural changes in lowering module
Review codegen API modifications
Check test coverage and snapshot updates
Assess performance and security implications
Address reviewer comments
Provide comprehensive feedback

Key Architectural Improvements

Excellent Design Decisions:

Slot-based value management: The switch from phi nodes to pre-allocated slots is cleaner and more predictable for CASM generation
Two-pass label handling: Pre-registering labels and their slots prevents forward reference issues
Caching improvements: The new test caching system test_utils.rs:60-72 handles poisoned mutexes gracefully, making tests more robust

API Design:

add_function_from_builder() in generator.rs:351 provides good encapsulation for WASM lowering without exposing internals
Making builder fields public is reasonable for this use case, though consider selective exposure via accessor methods

Code Quality Issues

🔴 Critical Issues

1. Unimplemented BrTable (cfg.rs:211)

Operation::BrTable { targets: _ } => {
    todo!()
}

This will panic at runtime if WASM contains branch tables. Should return a proper error.

2. Type System Inconsistencies (ops.rs:129, 237)

Comparisons produce Bool but used as u32 in conditionals
Need consistent type handling between WASM and Cairo-M semantics

🟡 Design Concerns

1. Error Handling Robustness

Many unwraps in lowering/mod.rs:244-252 that could be more graceful
Consider using Result propagation instead of panicking on type mismatches

2. Code Duplication (mod.rs:220)
Parameter allocation logic duplicates FunctionLayout::allocate_parameters_with_sizes() - consider refactoring shared ABI logic.

3. Memory Safety

No bounds checking for array operations in ops.rs:200-234
Memory addresses should be validated against MAX_ADDRESS

Test Coverage Assessment

✅ Strengths:

Comprehensive snapshot tests covering all major WASM constructs
Good diff testing against WOMIR interpreter
Improved caching prevents cascading test failures

⚠️ Gaps:

No tests for error conditions (invalid WASM, unsupported ops)
Missing edge case tests for loop nesting depth limits
BrTable operations completely untested

Performance Considerations

Positive:

Direct CASM emission avoids MIR intermediate representation overhead
Two-pass approach minimizes label resolution complexity
Caching system improves test performance

Potential Issues:

Slot allocation may use more memory than phi nodes for simple cases
No optimization passes on generated CASM

Security Considerations

Low Risk:

No obvious security vulnerabilities
Proper bounds checking on loop depth cfg.rs:397-402
Good error messages that don't leak sensitive information

Responses to Review Comments

@enitrat on generator.rs:197: The removed code appears to be dead - good cleanup.

@enitrat on layout.rs: Documentation suggestion is good - the current comment doesn't fully explain the allocation behavior.

Recommendations

High Priority

Replace todo!() with proper error: DagToCasmError::UnsupportedOperation for BrTable
Fix type consistency: Standardize bool/u32 handling in comparisons
Add bounds checking: Validate memory addresses in load/store operations

Medium Priority

Refactor parameter allocation: Extract shared logic to avoid duplication
Add error condition tests: Test invalid WASM inputs and unsupported operations
Consider optimization passes: Add basic peephole optimizations for generated CASM

Low Priority

API refinement: Consider accessor methods instead of public fields where appropriate
Documentation: Add more detailed comments explaining the slot allocation strategy

This refactor represents a significant improvement in the WASM compilation pipeline. The core architecture is sound, but addressing the critical issues (especially the todo!()) should be prioritized before merge.

crates/compiler/codegen/src/generator.rs

crates/compiler/codegen/src/layout.rs

enitrat · 2025-10-24T11:00:30Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-10-24T11:07:00Z

crates/wasm/src/lowering/cfg.rs

+                        SolvedBreakTarget::Return(_return_values) => {
+                            // These seem to never be generated by WOMIR
+                            Err(DagToCasmError::InvalidControlFlow {
+                                function_name: self.casm_builder.layout.name.clone(),
+                                reason: "Conditional return not yet implemented".to_string(),
+                                operation_context: "resolving BrIf return".to_string(),
+                            })?;
+                        }
+                    }
                }

                Operation::BrIfZero(target) => {
                    // Inverted conditional branch - condition is the last input
+                    // BrIfZero takes the branch when condition is ZERO
                    let cond_idx = node.inputs.len().checked_sub(1).ok_or_else(|| {
-                        DagToMirError::InvalidControlFlow {
-                            function_name: self.mir_function.name.clone(),
+                        DagToCasmError::InvalidControlFlow {
+                            function_name: self.casm_builder.layout.name.clone(),
                            reason: "BrIfZero without condition input".to_string(),
                            operation_context: "resolving BrIfZero condition".to_string(),
                        }
                    })?;
                    let condition_value = self.get_input_value(&node.inputs[cond_idx])?;
-                    let else_target = self.resolve_break_target(node_idx, node, target)?;
-                    let then_target = self.mir_function.add_basic_block();

-                    // Edge copies on the taken edge
-                    self.record_edge_values(node, target)?;
+                    let branch_values = self.get_branch_values(node)?;

-                    let terminator = Terminator::branch(condition_value, then_target, else_target);
-                    self.get_current_block()?.set_terminator(terminator);
-                    self.set_current_block(then_target);
+                    let resolved_target = self.resolve_break_target(node_idx, node, target)?;
+
+                    match resolved_target {
+                        SolvedBreakTarget::Label(label) => {
+                            let fallthrough_label =
+                                self.casm_builder.emit_new_label_name(".fallthrough");
+
+                            // If condition is non-zero, skip the branch (jump to fallthrough)
+                            self.casm_builder
+                                .jnz(condition_value, fallthrough_label.as_str())?;
+
+                            // Taken path (when zero): store values and jump to target
+                            self.store_to_label_slots(target, &branch_values)?;
+                            self.casm_builder.jump(label.as_str());
+
+                            // Fallthrough path continues here
+                            self.casm_builder.emit_add_label(Label {
+                                name: fallthrough_label,
+                                address: None,
+                            });
+                        }
+                        SolvedBreakTarget::Return(_return_values) => {
+                            // These seem to never be generated by WOMIR
+                            Err(DagToCasmError::InvalidControlFlow {
+                                function_name: self.casm_builder.layout.name.clone(),
+                                reason: "Conditional return not yet implemented".to_string(),
+                                operation_context: "resolving BrIfZero return".to_string(),
+                            })?;


Conditional branches to function exit now raise an error

The new BrIf and BrIfZero handlers bail out with InvalidControlFlow whenever the resolved break target represents a return. The previous MIR-based lowering handled this case by emitting a conditional branch whose taken edge performed the return. In WebAssembly it is valid (and common) to conditionally return from a function by branching to the outermost block, so this path can occur in real DAGs. With the current code such constructs will cause lowering to fail with "Conditional return not yet implemented", making the compiler reject valid input. Consider emitting the same store-and-return sequence used in the unconditional Br case instead of erroring out.

Useful? React with 👍 / 👎.

@gael-bigot okpm apres ça

gael-bigot added 10 commits October 9, 2025 16:26

refactor: emit casm

8eb64f9

function_from_builder in CodeGenerator

039d89c

revert pub on CasmBuilder fields

a711d6c

helper functions for load/store in builder

0379bac

make branch_if_nonzero_to public for consistency with other helper fu…

7279d26

…nctions

remove useless .wasm

f6c7b20

revert removing labels()

30b8675

remove name parameter in add_function_from_builder

021ed0c

fix label gen

33ed392

code dedup

13933db

enitrat reviewed Oct 9, 2025

View reviewed changes

crates/compiler/codegen/src/generator.rs Show resolved Hide resolved

enitrat reviewed Oct 9, 2025

View reviewed changes

crates/compiler/codegen/src/layout.rs Outdated Show resolved Hide resolved

gael-bigot added 3 commits October 10, 2025 10:38

suggestions, fix lint

84a934d

fixes

d5493ec

minor refactoring and cursor comment removal

c83a025

gael-bigot marked this pull request as ready for review October 10, 2025 12:17

chatgpt-codex-connector bot reviewed Oct 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(wasm): emit CASM instead of MIR #343

refactor(wasm): emit CASM instead of MIR #343

Uh oh!

gael-bigot commented Oct 9, 2025 •

edited

Loading

Uh oh!

claude bot commented Oct 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

enitrat commented Oct 24, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 24, 2025

Uh oh!

enitrat Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

refactor(wasm): emit CASM instead of MIR #343

Are you sure you want to change the base?

refactor(wasm): emit CASM instead of MIR #343

Uh oh!

Conversation

gael-bigot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes in codegen API

generator.rs

layout.rs

builder

Changes in wasm crate

Other

Uh oh!

claude bot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review for WASM → CASM Refactor

Review Progress

Key Architectural Improvements

Code Quality Issues

🔴 Critical Issues

🟡 Design Concerns

Test Coverage Assessment

Performance Considerations

Security Considerations

Responses to Review Comments

Recommendations

High Priority

Medium Priority

Low Priority

Uh oh!

Uh oh!

Uh oh!

enitrat commented Oct 24, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

enitrat Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gael-bigot commented Oct 9, 2025 •

edited

Loading

`generator.rs`

`layout.rs`

claude bot commented Oct 9, 2025 •

edited

Loading