mirror of
https://github.com/harivansh-afk/sandbox-agent.git
synced 2026-04-15 07:04:48 +00:00
75 lines
5.4 KiB
Markdown
75 lines
5.4 KiB
Markdown
# Instructions
|
|
|
|
## SDK Modes
|
|
|
|
There are two ways to work with the SDKs:
|
|
|
|
- **Embedded**: Spawns the `sandbox-agent` server as a subprocess on a unique port and communicates with it locally. Useful for local development or when running the SDK and agent in the same environment.
|
|
- **Server**: Connects to a remotely running `sandbox-agent` server. The server is typically running inside a sandbox (e.g., Docker, E2B, Daytona, Vercel Sandboxes) and the SDK connects to it over HTTP.
|
|
|
|
## Agent Schemas
|
|
|
|
Agent schemas (Claude Code, Codex, OpenCode, Amp) are available for reference in `resources/agent-schemas/artifacts/json-schema/`.
|
|
|
|
Extraction methods:
|
|
- **Claude**: Uses `claude --output-format json --json-schema` CLI command
|
|
- **Codex**: Uses `codex app-server generate-json-schema` CLI command
|
|
- **OpenCode**: Fetches from GitHub OpenAPI spec
|
|
- **Amp**: Scrapes from `https://ampcode.com/manual/appendix?preview#message-schema`
|
|
|
|
All extractors have fallback schemas for when CLI/URL is unavailable.
|
|
|
|
Research on how different agents operate (CLI flags, streaming formats, HITL patterns, etc.) is in `research/agents/`. When adding or making changes to agent docs, follow the same structure as existing files.
|
|
|
|
Universal schema guidance:
|
|
- The universal schema should cover the full feature set of all agents.
|
|
- Conversions must be best-effort overlap without being lossy; preserve raw payloads when needed.
|
|
- **The mock agent acts as the reference implementation** for correct event behavior. Real agents should use synthetic events to match the mock agent's event patterns (e.g., emitting both daemon synthetic and agent native `session.started` events, proper `item.started` → `item.delta` → `item.completed` sequences).
|
|
|
|
## Spec Tracking
|
|
|
|
- Keep CLI subcommands in sync with every HTTP endpoint.
|
|
- Update `CLAUDE.md` to keep CLI endpoints in sync with HTTP API changes.
|
|
- When adding or modifying CLI commands, update `docs/cli.mdx` to reflect the changes.
|
|
- When changing the HTTP API, update the TypeScript SDK and CLI together.
|
|
- Do not make breaking changes to API endpoints.
|
|
- When changing API routes, ensure the HTTP/SSE test suite has full coverage of every route.
|
|
- When agent schema changes, ensure API tests cover the new schema and event shapes end-to-end.
|
|
- When the universal schema changes, update mock-agent events to cover the new fields or event types.
|
|
- Update `docs/conversion.md` whenever agent-native schema terms, synthetic events, identifier mappings, or conversion logic change.
|
|
- Never use synthetic data or mocked responses in tests.
|
|
- Never manually write agent types; always use generated types in `resources/agent-schemas/`. If types are broken, fix the generated types.
|
|
- The universal schema must provide consistent behavior across providers; avoid requiring frontend/client logic to special-case agents.
|
|
- The UI must reflect every field in AgentCapabilities; keep it in sync with `docs/session-transcript-schema.mdx` and `agent_capabilities_for`.
|
|
- When parsing agent data, if something is unexpected or does not match the schema, bail out and surface the error rather than trying to continue with partial parsing.
|
|
- When defining the universal schema, choose the option most compatible with native agent APIs, and add synthetics to fill gaps for other agents.
|
|
- Use `docs/session-transcript-schema.mdx` as the source of truth for schema terminology and keep it updated alongside schema changes.
|
|
- On parse failures, emit an `agent.unparsed` event (source=daemon, synthetic=true) and treat it as a test failure. Preserve raw payloads when `include_raw=true`.
|
|
- Track subagent support in `docs/conversion.md`. For now, normalize subagent activity into normal message/tool flow, but revisit explicit subagent modeling later.
|
|
- Keep the FAQ in `README.md` and `frontend/packages/website/src/components/FAQ.tsx` in sync. When adding or modifying FAQ entries, update both files.
|
|
|
|
### CLI ⇄ HTTP endpoint map (keep in sync)
|
|
|
|
- `sandbox-agent api agents list` ↔ `GET /v1/agents`
|
|
- `sandbox-agent api agents install` ↔ `POST /v1/agents/{agent}/install`
|
|
- `sandbox-agent api agents modes` ↔ `GET /v1/agents/{agent}/modes`
|
|
- `sandbox-agent api sessions list` ↔ `GET /v1/sessions`
|
|
- `sandbox-agent api sessions create` ↔ `POST /v1/sessions/{sessionId}`
|
|
- `sandbox-agent api sessions send-message` ↔ `POST /v1/sessions/{sessionId}/messages`
|
|
- `sandbox-agent api sessions send-message-stream` ↔ `POST /v1/sessions/{sessionId}/messages/stream`
|
|
- `sandbox-agent api sessions terminate` ↔ `POST /v1/sessions/{sessionId}/terminate`
|
|
- `sandbox-agent api sessions events` / `get-messages` ↔ `GET /v1/sessions/{sessionId}/events`
|
|
- `sandbox-agent api sessions events-sse` ↔ `GET /v1/sessions/{sessionId}/events/sse`
|
|
- `sandbox-agent api sessions reply-question` ↔ `POST /v1/sessions/{sessionId}/questions/{questionId}/reply`
|
|
- `sandbox-agent api sessions reject-question` ↔ `POST /v1/sessions/{sessionId}/questions/{questionId}/reject`
|
|
- `sandbox-agent api sessions reply-permission` ↔ `POST /v1/sessions/{sessionId}/permissions/{permissionId}/reply`
|
|
|
|
## Post-Release Testing
|
|
|
|
After cutting a release, verify the release works correctly. Run `/project:post-release-testing` to execute the testing agent.
|
|
|
|
## Git Commits
|
|
|
|
- Do not include any co-authors in commit messages (no `Co-Authored-By` lines)
|
|
- Use conventional commits style (e.g., `feat:`, `fix:`, `docs:`, `chore:`, `refactor:`)
|
|
- Keep commit messages to a single line
|