mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-15 06:04:40 +00:00

History

Mario Zechner 9544a8edf9 Display tool call metrics: Add ⚒ counter to token usage display - Show tool call count alongside token metrics in TUI and console renderers - TUI: Display at bottom with format "↑X ↓Y ⚒Z" - Console: Show metrics after assistant messages complete - Counter increments on each tool_call event		2025-08-09 20:10:15 +02:00
..
src	Display tool call metrics: Add ⚒ counter to token usage display	2025-08-09 20:10:15 +02:00
package-lock.json	v0.5.4 - Fix CLI execution when imported and JSON import issues	2025-08-09 19:38:00 +02:00
package.json	v0.5.4 - Fix CLI execution when imported and JSON import issues	2025-08-09 19:38:00 +02:00
README.md	Initial monorepo setup with npm workspaces and dual TypeScript configuration	2025-08-09 17:18:38 +02:00
tsconfig.build.json	Initial monorepo setup with npm workspaces and dual TypeScript configuration	2025-08-09 17:18:38 +02:00

README.md

pi-agent

A general-purpose agent with tool calling and session persistence, modeled after Claude Code but extremely hackable and minimal. It comes with a built-in TUI (also modeled after Claude Code) for interactive use.

Everything is designed to be easy:

Writing custom UIs on top of it (via JSON mode in any language or the TypeScript API)
Using it for inference steps in deterministic programs (via JSON mode in any language or the TypeScript API)
Providing your own system prompts and tools
Working with various LLM providers or self-hosted LLMs

Installation

npm install -g @mariozechner/pi-agent

This installs the pi-agent command globally.

Quick Start

By default, pi-agent uses OpenAI's API with model gpt-5-mini and authenticates using the OPENAI_API_KEY environment variable. Any OpenAI-compatible endpoint works, including Ollama, vLLM, OpenRouter, Groq, Anthropic, etc.

# Single message
pi-agent "What is 2+2?"

# Multiple messages processed sequentially
pi-agent "What is 2+2?" "What about 3+3?"

# Interactive chat mode (no messages = interactive)
pi-agent

# Continue most recently modified session in current directory
pi-agent --continue "Follow up question"

# GPT-OSS via Groq
pi-agent --base-url https://api.groq.com/openai/v1 --api-key $GROQ_API_KEY --model openai/gpt-oss-120b

# GLM 4.5 via OpenRouter
pi-agent --base-url https://openrouter.ai/api/v1 --api-key $OPENROUTER_API_KEY --model z-ai/glm-4.5

# Claude via Anthropic (no prompt caching support - see https://docs.anthropic.com/en/api/openai-sdk)
pi-agent --base-url https://api.anthropic.com/v1 --api-key $ANTHROPIC_API_KEY --model claude-opus-4-1-20250805

Usage Modes

Single-Shot Mode

Process one or more messages and exit:

pi-agent "First question" "Second question"

Interactive Mode

Start an interactive chat session:

pi-agent

Type messages and press Enter to send
Type exit or quit to end session
Press Escape to interrupt while processing
Press CTRL+C to clear the text editor
Press CTRL+C twice quickly to exit

JSON Mode

JSON mode enables programmatic integration by outputting events as JSONL (JSON Lines).

Single-shot mode: Outputs a stream of JSON events for each message, then exits.

pi-agent --json "What is 2+2?" "And the meaning of life?"
# Outputs: {"type":"session_start","sessionId":"bb6f0acb-80cf-4729-9593-bcf804431a53","model":"gpt-5-mini","api":"completions","baseURL":"https://api.openai.com/v1","systemPrompt":"You are a helpful assistant."} {"type":"user_message","text":"What is 2+2?"} {"type":"assistant_start"} {"type":"token_usage","inputTokens":314,"outputTokens":16,"totalTokens":330,"cacheReadTokens":0,"cacheWriteTokens":0} {"type":"assistant_message","text":"2 + 2 = 4"} {"type":"user_message","text":"And the meaning of life?"} {"type":"assistant_start"} {"type":"token_usage","inputTokens":337,"outputTokens":331,"totalTokens":668,"cacheReadTokens":0,"cacheWriteTokens":0} {"type":"assistant_message","text":"Short answer (pop-culture): 42.\n\nMore useful answers:\n- Philosophical...

Interactive mode: Accepts JSON commands via stdin and outputs JSON events to stdout.

# Start interactive JSON mode
pi-agent --json
# Now send commands via stdin

# Pipe one or more initial messages in
(echo '{"type": "message", "content": "What is 2+2?"}'; cat) | pi-agent --json
# Outputs: {"type":"session_start","sessionId":"bb64cfbe-dd52-4662-bd4a-0d921c332fd1","model":"gpt-5-mini","api":"completions","baseURL":"https://api.openai.com/v1","systemPrompt":"You are a helpful assistant."} {"type":"user_message","text":"What is 2+2?"} {"type":"assistant_start"} {"type":"token_usage","inputTokens":314,"outputTokens":16,"totalTokens":330,"cacheReadTokens":0,"cacheWriteTokens":0} {"type":"assistant_message","text":"2 + 2 = 4"}

Commands you can send via stdin in interactive JSON mode:

{"type": "message", "content": "Your message here"}  // Send a message to the agent
{"type": "interrupt"}                                 // Interrupt current processing

Configuration

Command Line Options

--base-url <url>        API base URL (default: https://api.openai.com/v1)
--api-key <key>         API key (or set OPENAI_API_KEY env var)
--model <model>         Model name (default: gpt-4o-mini)
--api <type>            API type: "completions" or "responses" (default: completions)
--system-prompt <text>  System prompt (default: "You are a helpful assistant.")
--continue              Continue previous session
--json                  JSON mode
--help, -h              Show help message

Environment Variables

OPENAI_API_KEY - OpenAI API key (used if --api-key not provided)

Session Persistence

Sessions are automatically saved to ~/.pi/sessions/ and include:

Complete conversation history
Tool call results
Token usage statistics

Use --continue to resume the last session:

pi-agent "Start a story about a robot"
# ... later ...
pi-agent --continue "Continue the story"

Tools

The agent includes built-in tools for file system operations:

read_file - Read file contents
list_directory - List directory contents
bash - Execute shell commands
glob - Find files by pattern
ripgrep - Search file contents

These tools are automatically available when using the agent through the pi command for code navigation tasks.

JSON Mode Events

When using --json, the agent outputs these event types:

session_start - New session started with metadata
user_message - User input
assistant_start - Assistant begins responding
assistant_message - Assistant's response
thinking - Reasoning/thinking (for models that support it)
tool_call - Tool being called
tool_result - Result from tool
token_usage - Token usage statistics
error - Error occurred
interrupted - Processing was interrupted

The complete TypeScript type definition for AgentEvent can be found in src/agent.ts.

Build an Interactive UI with JSON Mode

Build custom UIs in any language by spawning pi-agent in JSON mode and communicating via stdin/stdout.

import { spawn } from 'child_process';
import { createInterface } from 'readline';

// Start the agent in JSON mode
const agent = spawn('pi-agent', ['--json']);

// Create readline interface for parsing JSONL output from agent
const agentOutput = createInterface({input: agent.stdout, crlfDelay: Infinity});

// Create readline interface for user input
const userInput = createInterface({input: process.stdin, output: process.stdout});

// State tracking
let isProcessing = false, lastUsage, isExiting = false;

// Handle each line of JSON output from agent
agentOutput.on('line', (line) => {
    try {
      const event = JSON.parse(line);

      // Handle all event types
      switch (event.type) {
        case 'session_start':
          console.log(`Session started (${event.model}, ${event.api}, ${event.baseURL})`);
          console.log('Press CTRL + C to exit');
          promptUser();
          break;

        case 'user_message':
          // Already shown in prompt, skip
          break;

        case 'assistant_start':
          isProcessing = true;
          console.log('\n[assistant]');
          break;

        case 'thinking':
          console.log(`[thinking]\n${event.text}\n`);
          break;

        case 'tool_call':
          console.log(`[tool] ${event.name}(${event.args.substring(0, 50)})\n`);
          break;

        case 'tool_result':
            const lines = event.result.split('\n');
            const truncated = lines.length - 5 > 0 ? `\n.  ... (${lines.length - 5} more lines truncated)` : '';
            console.log(`[tool result]\n${lines.slice(0, 5).join('\n')}${truncated}\n`);
          break;

        case 'assistant_message':
          console.log(event.text.trim());
          isProcessing = false;
          promptUser();
          break;

        case 'token_usage':
          lastUsage = event;
          break;

        case 'error':
          console.error('\n❌ Error:', event.message);
          isProcessing = false;
          promptUser();
          break;

        case 'interrupted':
          console.log('\n⚠️  Interrupted by user');
          isProcessing = false;
          promptUser();
          break;
      }
    } catch (e) {
      console.error('Failed to parse JSON:', line, e);
    }
});

// Send a message to the agent
function sendMessage(content) {
  agent.stdin.write(`${JSON.stringify({type: 'message', content: content})}\n`);
}

// Send interrupt signal
function interrupt() {
  agent.stdin.write(`${JSON.stringify({type: 'interrupt'})}\n`);
}

// Prompt for user input
function promptUser() {
  if (isExiting) return;

  if (lastUsage) {
    console.log(`\nin: ${lastUsage.inputTokens}, out: ${lastUsage.outputTokens}, cache read: ${lastUsage.cacheReadTokens}, cache write: ${lastUsage.cacheWriteTokens}`);
  }

  userInput.question('\n[user]\n> ', (answer) => {
    answer = answer.trim();
    if (answer) {
      sendMessage(answer);
    } else {
      promptUser();
    }
  });
}

// Handle Ctrl+C
process.on('SIGINT', () => {
  if (isProcessing) {
    interrupt();
  } else {
    agent.kill();
    process.exit(0);
  }
});

// Handle agent exit
agent.on('close', (code) => {
  isExiting = true;
  userInput.close();
  console.log(`\nAgent exited with code ${code}`);
  process.exit(code);
});

// Handle errors
agent.on('error', (err) => {
  console.error('Failed to start agent:', err);
  process.exit(1);
});

// Start the conversation
console.log('Pi Agent Interactive Chat');

Architecture

The agent is built with:

agent.ts - Core Agent class and API functions
cli.ts - CLI entry point, argument parsing, and JSON mode handler
args.ts - Custom typed argument parser
session-manager.ts - Session persistence
tools/ - Tool implementations
renderers/ - Output formatters (console, TUI, JSON)

Development

# Run from source
npx tsx src/cli.ts "Hello"

# Build
npm run build

# Run built version
dist/cli.js "Hello"

Use as a Library

import { Agent, ConsoleRenderer } from '@mariozechner/pi-agent';

const agent = new Agent({
  apiKey: process.env.OPENAI_API_KEY,
  baseURL: 'https://api.openai.com/v1',
  model: 'gpt-5-mini',
  api: 'completions',
  systemPrompt: 'You are a helpful assistant.'
}, new ConsoleRenderer());

await agent.ask('What is 2+2?');

README.md Unescape Escape