evaluclaude-harness/AGENTS.md

# Evaluclaude Harness - Agent Instructions

## Project Overview

This is a CLI tool for generating evaluation tests for codebases using Claude. The core philosophy is "Zero-to-evals in one command."

## Commands

```bash
# Build the project
npm run build

# Run typecheck
npm run typecheck

# Run tests
npm test

# Run the CLI
npm start -- intro <path>
```

## Project Structure

```
src/
├── cli/              # Commander.js CLI
├── introspector/     # Tree-sitter codebase parsing (NO LLM)
│   ├── parsers/      # Language-specific parsers
│   ├── scanner.ts    # File discovery
│   ├── git.ts        # Git integration
│   └── summarizer.ts # Main analysis logic
└── index.ts          # Main exports
```

## Key Principles

1. **Tree-sitter for introspection**: Never send raw code to Claude for structure extraction
2. **Claude generates specs, not code**: EvalSpec JSON is generated by Claude, test code is rendered deterministically
3. **Git-aware incremental**: Only re-analyze changed files

## Dependencies

- `tree-sitter`: Native AST parsing
- `tree-sitter-python`: Python grammar
- `tree-sitter-typescript`: TypeScript grammar
- `commander`: CLI framework
- `glob`: File pattern matching

## Testing

Use vitest for testing. Test files go in `tests/` directory.