Home › Developer Docs › Aether Development Guidelines

Aether Development Guidelines

This document provides comprehensive development guidelines for contributing to the Aether programming language interpreter.

First-Time Setup

After cloning the repo, run this once to enable the shared pre-commit hook:

git config core.hooksPath .githooks

The hook runs cargo fmt --check, cargo clippy, and cargo test before every commit. The hook file lives in .githooks/pre-commit (tracked in the repo) so every contributor gets the same checks.

Post-Feature Checklist
Code Organization
Testing Strategy
Memory Leak Detection
Error Handling
Code Style
Incremental Development
Code Quality
Performance
Debugging
Documentation
Dependencies

Post-Feature Checklist

Every feature — no matter how small — must pass this checklist before committing.

1. Tests

Integration test file exists — tests/<feature>_test.rs with at minimum:
- Happy path (normal usage)
- Edge cases (empty input, zero, null, large values)
- Error cases (wrong type, wrong arity, out-of-bounds)
All existing tests still pass: cargo test -- --test-threads=1
Test count in CLAUDE.md is updated if the suite total changes significantly

2. Example Program

examples/<feature>_demo.ae exists and covers every function/method added
Example runs cleanly: cargo run -- examples/<feature>_demo.ae
Each demo section is labeled (--- 1. feature_name ---) for easy reading

3. Documentation

Component doc updated — the relevant doc in docs/ (e.g. INTERPRETER.md, STDLIB.md) mentions the new function/syntax
CLAUDE.md updated — add the new function to the feature summary table and the relevant test count if it changed
docs/BACKLOG.md updated — mark the item done or add any new items discovered during implementation
If the feature introduces a new env var or config knob: docs/CONFIGURATION.md updated

4. Memory Leak Check

Run cargo test --test gc_test -- --test-threads=1 — all GC tests pass
For features that add a new Value variant or struct with Rc<T>: add a GC test that verifies no cycle is introduced (see Memory Leak Detection)
On macOS, spot-check with leaks: leaks --atExit -- ./target/debug/aether examples/<feature>_demo.ae

5. Code Quality

cargo fmt — no formatting changes needed
cargo clippy — zero new warnings
Commit is atomic and message explains the why

Code Organization

Module Structure

src/
├── main.rs                # Entry point, CLI handling
├── lib.rs                 # Library exports
├── lexer/
│   ├── mod.rs             # Lexer module exports
│   ├── token.rs           # Token type definitions
│   ├── scanner.rs         # Tokenization logic
│   └── lexer_tests.rs     # Lexer tests (14 tests) ✅
├── parser/
│   ├── mod.rs             # Parser module exports
│   ├── ast.rs             # AST node definitions
│   ├── parse.rs           # Recursive descent parser
│   └── parser_tests.rs    # Parser tests (53 tests) ✅
└── interpreter/           # ✅ Complete
    ├── mod.rs             # Interpreter module exports
    ├── value.rs           # Runtime value types (Rc-wrapped)
    ├── environment.rs     # Variable scoping + RuntimeError
    ├── builtins.rs        # Built-in functions
    ├── stdlib.rs          # Stdlib module loader
    ├── io_pool.rs         # I/O thread pool + HttpOptions
    ├── event_loop.rs      # EventLoopQueue for on_ready/event_loop
    ├── interpreter_tests.rs # Interpreter tests (17 tests) ✅
    ├── builtins_tests.rs  # Built-in tests (15 tests) ✅
    └── evaluator/         # Split from evaluator.rs (>1000 lines)
        ├── mod.rs         # Evaluator struct, constructors, public API
        ├── expressions.rs # eval_expr, eval_index, eval_slice, await_value
        ├── statements.rs  # exec_stmt_internal (all Stmt variants)
        ├── functions.rs   # eval_call, call_value, try_submit_io_task
        ├── members.rs     # eval_member, eval_method_call
        ├── modules.rs     # load_module, import resolution
        └── operators.rs   # eval_unary, eval_binary

Test File Convention: Use <module>_tests.rs naming pattern for test files.

File Size Limit: 1000 Lines

Rule: When any source file (.rs) grows beyond 1000 lines, split it into a sub-module directory.

For source files — convert foo.rs into foo/mod.rs and extract logical groups into sibling files:

# Before (foo.rs exceeds 1000 lines)
src/interpreter/evaluator.rs

# After (split into module)
src/interpreter/evaluator/
├── mod.rs          # Struct definition, constructors, public API
├── expressions.rs  # eval_expr and sub-expression handlers
├── operators.rs    # Arithmetic, comparison, logical operators
├── statements.rs   # exec_stmt_internal, all statement handlers
├── functions.rs    # eval_call, call_value, exec_async_body
├── members.rs      # eval_member, method dispatch, collection methods
└── modules.rs      # Module loading and import resolution

Each sub-file is part of the same module — all impl Foo { ... } blocks work together. Use pub(super) for shared types (e.g. ControlFlow) and use super::*; or explicit imports in sub-files.

For test files — when an integration test file exceeds 1000 lines, create a sub-directory:

# Before (array_methods_test.rs approaches 1000 lines)
tests/array_methods_test.rs

# After
tests/array_methods/
├── push_pop_test.rs
├── sort_concat_test.rs
└── slice_spread_test.rs

Integration tests in sub-directories require a wrapper file or use #[path] to include them.

Why 1000 lines?

Files beyond this size are hard to navigate and review
Logical splits make it easier to find where to add new functionality
Smaller files have faster incremental compile times
Split boundaries become natural documentation of responsibility

Module Responsibilities

Lexer: Converts source code into tokens
Parser: Converts tokens into an Abstract Syntax Tree
Interpreter: Evaluates the AST and executes code
Each module should be independently testable

Feature Implementation Decision Tree

Should this be a Built-in (Rust) or Stdlib (Aether)?

When adding new functionality to Aether, use this decision tree:

Built-in (Rust) if it:

Requires access to interpreter internals (e.g., type(), len())
Is performance-critical (e.g., operators, indexing, loops)
Needs unsafe code or FFI (e.g., file I/O, system calls)
Is a core I/O operation (e.g., print(), println())
Implements primitive operations that can’t be expressed in Aether

Standard Library (Aether) if it:

Can be built using existing primitives
Contains user-modifiable logic
Benefits from being readable by users
Is a higher-level utility (e.g., map(), filter(), range())
Could be implemented by a user in their own code

Examples:

✅ Built-in: print(), len(), type(), array.push(), arithmetic operators
✅ Stdlib: map(), filter(), range(), abs(), join(), reverse()

Rule of thumb: If you can write it in Aether without accessing interpreter internals, put it in the standard library!

Decision Process

Does it need interpreter internals? ──Yes──> Built-in (Rust)
           │
           No
           │
Is it performance-critical (10x+ difference)? ──Yes──> Built-in (Rust)
           │
           No
           │
Can users understand/modify it? ──Yes──> Stdlib (Aether)
           │
           No
           │
        Built-in (Rust)

Testing Strategy

Test-Driven Development

Write tests first before implementing features
Test each component in isolation before integration
Follow the red-green-refactor cycle:
- Red: Write a failing test
- Green: Write minimal code to pass the test
- Refactor: Improve code while keeping tests green

Test Organization

Tests are organized in separate <module>_tests.rs files:

// In src/lexer/lexer_tests.rs
use super::scanner::Scanner;
use super::token::{Token, TokenKind};

#[test]
fn test_tokenize_integer() {
    let mut scanner = Scanner::new("42");
    let tokens = scanner.scan_tokens().unwrap();
    assert_eq!(tokens.len(), 2); // integer + EOF
    assert_eq!(tokens[0].kind, TokenKind::Integer(42));
}

Module configuration:

// In src/lexer/mod.rs
#[cfg(test)]
mod lexer_tests;

Test Coverage Goals

Lexer: Test all token types, edge cases, error conditions
Parser: Test valid syntax, operator precedence, error recovery
Interpreter: Test all operations, type checking, runtime errors
Integration: Test complete programs end-to-end

Running Tests

cargo test                    # Run all tests
cargo test lexer              # Run lexer tests only
cargo test -- --nocapture     # Show output during tests
cargo test -- --test-threads=1 # Run tests sequentially

Memory Leak Detection

Aether uses Rc<T> for garbage collection. Rc cannot collect reference cycles — if object A holds an Rc to B and B holds an Rc back to A, both will leak. This section describes how to detect and prevent leaks.

When to Run a Memory Check

Run a memory check whenever you:

Add a new Value variant that contains Rc<T> or RefCell<T>
Add a struct that holds back-references to the environment or another value
Implement a feature where a closure or instance can reference itself

Option 1: GC Test Suite (always)

cargo test --test gc_test -- --test-threads=1

tests/gc_test.rs contains Rc-cycle and drop tests. Add a new test for each new Value variant:

#[test]
fn test_file_lines_drops_cleanly() {
    // FileLines should not hold a cycle — just a BufReader<File>
    let src = r#"let it = lines_iter("/tmp/small.txt")"#;
    // If this completes without OOM / leak, the value drops cleanly
    run(src).unwrap();
}

Option 2: macOS `leaks` tool (quick spot-check)

macOS ships a leaks command that attaches to a process at exit and reports memory leaks:

# Build first
cargo build

# Run with leak detection
leaks --atExit -- ./target/debug/aether examples/file_io_demo.ae

# Expected output (clean):
# Process 12345: 0 leaks for 0 total leaked bytes

If leaks are reported, the output shows the allocation call stack. A single “leaked” allocation from Rust’s global allocator at startup is normal and can be ignored; leaked Value/Rc allocations are real bugs.

Option 3: Valgrind (Linux CI)

# Install valgrind, then:
valgrind --leak-check=full --error-exitcode=1 \
    ./target/debug/aether examples/file_io_demo.ae

Preventing Rc Cycles — Rules

Pattern	Safe?	Fix
`Rc<Environment>` in closure	✅ Closure captures env before the fn is defined, so no cycle	—
`Instance` holding a method that closes over `self`	⚠️ Potential cycle	Use `Weak<T>` for the self-reference
New `Value` variant holding `Rc<Value>`	⚠️ Review carefully	Ensure no path where `v` contains `Rc<v>`
`FileLines(Rc<RefCell<FileIterState>>)`	✅ No Value references	—

Rule: If a Value variant can transitively point back to itself through Rc chains, break the cycle with Weak<T> on the back-pointer.

Checking Rc Strong Counts in Tests

For precise control, use Rc::strong_count in unit tests:

use std::rc::Rc;

#[test]
fn test_no_cycle_after_scope_exit() {
    let arr = Rc::new(vec![1i64, 2, 3]);
    let weak = Rc::downgrade(&arr);
    drop(arr); // Only owner dropped
    assert!(weak.upgrade().is_none(), "Rc was not freed — possible cycle");
}

Error Handling

Use Result Types

// Good: Return Result for operations that can fail
pub fn parse(tokens: &[Token]) -> Result<Expr, ParseError> {
    // parsing logic
}

// Bad: Using unwrap() or panic!() in production code
pub fn parse(tokens: &[Token]) -> Expr {
    tokens.first().unwrap() // Don't do this!
}

Custom Error Types

Define clear, specific error types:

#[derive(Debug)]
pub enum LexerError {
    UnexpectedCharacter(char, usize, usize),
    UnterminatedString(usize, usize),
    InvalidNumber(String, usize, usize),
}

#[derive(Debug)]
pub enum RuntimeError {
    UndefinedVariable(String),
    TypeMismatch { expected: String, got: String },
    DivisionByZero,
}

Error Messages

Be specific: “Undefined variable ‘x’ at line 10” vs “Error”
Be helpful: Suggest fixes when possible
Include context: Line numbers, column numbers, surrounding code

Code Style

Rust Idioms

Use match for exhaustive pattern matching
Prefer if let for single-pattern matches
Use iterators instead of explicit loops where appropriate
Leverage the type system for safety

Naming Conventions

snake_case for functions and variables
PascalCase for types and enums
SCREAMING_SNAKE_CASE for constants
Clear, descriptive names (e.g., tokenize_number vs tn)

Documentation

/// Tokenizes a string of Aether source code into tokens.
///
/// # Arguments
/// * `input` - The source code to tokenize
///
/// # Returns
/// A vector of tokens or a lexer error
///
/// # Example
/// ```
/// let tokens = tokenize("let x = 42")?;
/// ```
pub fn tokenize(input: &str) -> Result<Vec<Token>, LexerError> {
    // implementation
}

Incremental Development

Build in Small Steps

Don’t create all files at once
Implement one feature completely before moving to the next
Verify each step works before proceeding

Phase 1 Example

Define basic token types (integers, operators)
Write tests for tokenizing integers
Implement integer tokenization
Write tests for tokenizing operators
Implement operator tokenization
Test complete tokenization of simple expressions
Only then move to the next feature

Commit Frequency

Commit after each working feature
Keep commits focused and atomic
Write clear commit messages explaining the “why”

Code Quality

Before Committing

cargo fmt                              # Format code
cargo clippy                           # Run linter
cargo test -- --test-threads=1        # Run all tests sequentially
cargo test --test gc_test -- --test-threads=1  # Memory/GC tests
cargo build --release                  # Ensure release build works

See the Post-Feature Checklist for the full per-feature gate.

Clippy Warnings

Address all clippy warnings
Use #[allow(clippy::...)] sparingly and with justification
Prefer fixing the code over silencing warnings

Code Review Checklist

All tests pass (--test-threads=1)
New code has tests AND an example program
GC / memory leak check done (see Memory Leak Detection)
Relevant docs updated (CLAUDE.md, component doc, BACKLOG.md)
Error handling is robust
No clippy warnings
Code follows Rust idioms
Commit messages are descriptive

Performance

Start Simple

Correctness first, performance second
Profile before optimizing
Don’t prematurely optimize

When Optimizing

Use cargo bench for benchmarks
Profile with cargo flamegraph or similar tools
Document why optimizations are necessary

Key Design Decisions (with rationale)

`Rc<Stmt>` for function bodies — not `Box<Stmt>`

Function bodies are stored as Rc<Stmt> so that cloning a Value::Function (which happens on every function call because functions live in the environment) only increments a reference count rather than deep-copying the entire AST. Changing this back to Box<Stmt> would cause a ~41% slowdown in recursive workloads.

`Rc<Vec<Value>>` for arrays — not `Vec<Value>`

Arrays use reference counting so cloning is O(1). Rc::make_mut is used for mutations: it mutates in-place when there is only one owner, and copies on write when shared. Never use (**arr).to_vec() to clone-then-mutate — use Rc::make_mut instead.

`std::mem::swap` for call frames — not `env.clone()`

Function call setup swaps the current environment pointer with a fresh call frame (std::mem::swap). This is O(1). The previous approach (saved_env = self.environment.clone()) was O(n) in the size of the environment. Never reintroduce saved_env = self.environment.clone() for call frame management.

`Rc<String>` for strings

Same reasoning as arrays — cheap clone, shared immutable data.

Running Benchmarks

cargo bench --bench interpreter_bench

Benchmarks are in benches/interpreter_bench.rs and cover:

arithmetic_loop_10k — tight numeric loops
fibonacci_20 — deep recursive calls
scope_lookups_5k — variable lookup through scopes
string_ops_1k — string concatenation
array_ops_1k — push/index operations
many_fn_calls_5k — function call overhead

Memory Safety

Aether uses Rc<T> (reference counting) for GC, not Arc<T> (atomic ref counting), because the interpreter is single-threaded. Rc is cheaper but not thread-safe — do not add Send/Sync bounds or use Arc without good reason.

Avoiding Rc cycles (memory leaks): Closures capture the environment before the function is defined in it, so there is no cycle between a function value and the environment it lives in. If you add a new feature that allows an object to reference itself, use Weak<T> for the back-pointer to break the cycle.

Debugging

Debug Output

// Use Debug trait for development
#[derive(Debug)]
pub struct Token {
    pub kind: TokenKind,
    pub lexeme: String,
    pub line: usize,
}

// Enable detailed debugging
println!("{:?}", token);  // Compact
println!("{:#?}", token); // Pretty-printed

REPL for Testing

Use the REPL to quickly test language features
Add debug commands (e.g., _tokens, _ast) to inspect internals

Documentation

Keep Updated

Update docs/DESIGN.md when making design changes
Document design decisions and trade-offs
Keep CLAUDE.md current with project status

Code Comments

Explain why, not what
Document complex algorithms
Add examples for non-obvious code

Dependencies

Minimize Dependencies

Prefer standard library when possible
Only add dependencies that provide significant value
Review and understand dependencies before adding

Useful Dependencies

Consider these for the interpreter:

clap - Command-line argument parsing
rustyline - REPL readline support
colored - Terminal colors for errors

Continuous Integration

Future CI Setup

When setting up CI, include:

Run tests on multiple platforms
Check formatting (cargo fmt --check)
Run clippy (cargo clippy -- -D warnings)
Build documentation (cargo doc --no-deps)

Common Pitfalls When Extending Aether

1. Forgetting Rc Wrappers

Problem: Creating values without using helper methods.

❌ Wrong:

Value::String("hello".to_string())  // Compilation error - expects Rc<String>
Value::Array(vec![Value::Int(1)])   // Compilation error - expects Rc<Vec<Value>>

✅ Right:

Value::string("hello".to_string())  // Uses helper method
Value::array(vec![Value::Int(1)])   // Uses helper method

Why: Strings and Arrays are wrapped in Rc<T> for garbage collection. Always use the helper methods Value::string() and Value::array().

2. Not Using –test-threads=1

Problem: Running tests without limiting thread count causes memory pressure.

❌ Wrong:

cargo test  # May cause OOM on large test suites

✅ Right:

cargo test -- --test-threads=1  # Sequential execution, lower memory usage

Why: Parallel test execution can consume excessive memory (135 GB observed without GC). Sequential execution is safer.

3. Adding Built-ins Instead of Stdlib

Problem: Implementing features in Rust that could be written in Aether.

❌ Wrong: Adding max() function to src/interpreter/builtins.rs

✅ Right: Implementing max() in stdlib/math.ae

Why: Stdlib functions are:

User-readable and modifiable
Easier to maintain and test
Prove the language is expressive enough

Rule: If it can be written in Aether, it belongs in stdlib!

4. Forgetting Optional Parameter Support

Problem: Not handling null for optional parameters.

❌ Wrong:

fn range(start, end) {
    // Assumes both parameters are always provided
    let i = start
    while (i < end) { ... }
}

✅ Right:

fn range(start, end) {
    // Handle single argument: range(n) -> range(0, n)
    if (end == null) {
        end = start
        start = 0
    }
    let i = start
    while (i < end) { ... }
}

Why: Aether supports optional parameters. Functions should handle null for missing arguments.

5. Not Testing Edge Cases

Problem: Only testing the happy path.

❌ Insufficient:

#[test]
fn test_division() {
    assert_eq!(eval("10 / 2"), Value::Int(5));
}

✅ Comprehensive:

#[test]
fn test_division() {
    assert_eq!(eval("10 / 2"), Value::Int(5));
    assert_eq!(eval("10.0 / 3.0"), Value::Float(3.333...));
    assert!(eval("10 / 0").is_err());  // Division by zero
    assert!(eval("10 / null").is_err()); // Type error
}

Why: Edge cases include:

Empty arrays/strings
Null values
Type mismatches
Division by zero
Out of bounds access
Negative numbers

Always test error conditions, not just success cases!

6. Mutating Rc-wrapped Values Incorrectly

Problem: Trying to mutate shared data.

❌ Wrong:

if let Value::Array(arr) = &mut value {
    arr.push(Value::Int(42));  // Error: Rc is not mutable
}

✅ Right:

if let Value::Array(arr) = &value {
    let mut new_arr = arr.as_ref().clone();  // Clone the Vec
    new_arr.push(Value::Int(42));
    value = Value::array(new_arr);  // Create new Rc-wrapped array
}

Why: Rc<T> provides shared ownership but not interior mutability. To modify, clone the data, modify it, and create a new Rc.

7. Not Using TDD

Problem: Writing implementation before tests.

❌ Wrong:

Write entire feature
Write tests later
Find bugs
Fix and repeat

✅ Right (TDD workflow):

Write failing test
Write minimal code to pass
Refactor
Repeat

Why: TDD ensures:

Clear requirements before coding
All code is tested
Easier debugging (smaller changes)
Better design (testable code)

Learning Resources

Rust Interpreter Resources

“Crafting Interpreters” by Robert Nystrom
“Writing An Interpreter In Go” by Thorsten Ball
Rust Book: https://doc.rust-lang.org/book/
Rust by Example: https://doc.rust-lang.org/rust-by-example/

Last Updated: April 17, 2026 Phase: 5 Complete (base) Status: Comprehensive development guidelines with TDD workflow and common pitfalls

← Architecture Testing Guide →

Aether Development Guidelines

First-Time Setup

Table of Contents

Post-Feature Checklist

1. Tests

2. Example Program

3. Documentation

4. Memory Leak Check

5. Code Quality

Code Organization

Module Structure

File Size Limit: 1000 Lines

Module Responsibilities

Feature Implementation Decision Tree

Should this be a Built-in (Rust) or Stdlib (Aether)?

Decision Process

Testing Strategy

Test-Driven Development

Test Organization

Test Coverage Goals

Running Tests

Memory Leak Detection

When to Run a Memory Check

Option 1: GC Test Suite (always)

Option 2: macOS leaks tool (quick spot-check)

Option 3: Valgrind (Linux CI)

Preventing Rc Cycles — Rules

Checking Rc Strong Counts in Tests

Error Handling

Use Result Types

Custom Error Types

Error Messages

Code Style

Rust Idioms

Naming Conventions

Documentation

Incremental Development

Build in Small Steps

Phase 1 Example

Commit Frequency

Code Quality

Before Committing

Clippy Warnings

Code Review Checklist

Performance

Start Simple

When Optimizing

Key Design Decisions (with rationale)

Rc<Stmt> for function bodies — not Box<Stmt>

Rc<Vec<Value>> for arrays — not Vec<Value>

std::mem::swap for call frames — not env.clone()

Rc<String> for strings

Running Benchmarks

Memory Safety

Debugging

Debug Output

REPL for Testing

Documentation

Keep Updated

Code Comments

Dependencies

Minimize Dependencies

Useful Dependencies

Continuous Integration

Future CI Setup

Common Pitfalls When Extending Aether

1. Forgetting Rc Wrappers

2. Not Using –test-threads=1

3. Adding Built-ins Instead of Stdlib

4. Forgetting Optional Parameter Support

5. Not Testing Edge Cases

6. Mutating Rc-wrapped Values Incorrectly

7. Not Using TDD

Learning Resources

Rust Interpreter Resources

Option 2: macOS `leaks` tool (quick spot-check)

`Rc<Stmt>` for function bodies — not `Box<Stmt>`

`Rc<Vec<Value>>` for arrays — not `Vec<Value>`

`std::mem::swap` for call frames — not `env.clone()`

`Rc<String>` for strings