co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-15 21:03:19 +00:00

Author	SHA1	Message	Date
Mario Zechner	d5fd685901	Enable more biome lints and fix things	2025-12-21 22:56:20 +01:00
Mario Zechner	ace3563f0e	Add tests for Gemini 3 models with thinkingLevel	2025-12-21 20:34:39 +01:00
Peter Steinberger	117af076c4	feat(ai): interrupt tool batch on queued messages	2025-12-20 21:34:53 +01:00
Mario Zechner	6a319f9c3c	Add OAuth providers to test suite and improve test coverage Tests updated: - abort.test.ts: Add Google Gemini CLI, add retries - agent.test.ts: Add OAuth providers (Anthropic, GitHub Copilot, Gemini CLI, Antigravity), add retries, remove timeouts - context-overflow.test.ts: Handle Cerebras 429 status code - image-tool-result.test.ts: Add OAuth providers - overflow.ts: Detect 429 as overflow for Cerebras Removed obsolete debug/one-off tests: - copilot-initiator.test.ts - gemini-3-flash-tool-calling.test.ts - google-thought-signature.test.ts - mistral-debug.test.ts - mistral-empty-assistant.test.ts - mistral-sdk.test.ts	2025-12-20 21:34:19 +01:00
Mario Zechner	fb1fdb6006	Fix orphaned tool calls by inserting synthetic empty results When a user interrupts a tool call flow (sends a message without providing tool results), APIs like OpenAI Responses and Anthropic fail because: - OpenAI requires tool outputs for function calls - OpenAI requires reasoning items to have their following items - Anthropic requires non-empty content for error tool results Instead of filtering out orphaned tool calls (which breaks thinking signatures), we now insert synthetic empty tool results with isError: true and content 'No result provided'. This preserves the conversation structure and satisfies all API requirements.	2025-12-20 21:34:19 +01:00
Mario Zechner	95fcda5887	Broader testing, more providers.	2025-12-20 21:34:19 +01:00
Mario Zechner	c359023c3f	Add Google Gemini CLI and Antigravity OAuth providers - Add google-gemini-cli provider: free Gemini 2.0/2.5 via Cloud Code Assist - Add google-antigravity provider: free Gemini 3, Claude, GPT-OSS via sandbox - Move OAuth infrastructure from coding-agent to ai package - Fix thinking signature handling for cross-model handoff - Fix OpenAI message ID length limit (max 64 chars) - Add GitHub Copilot overflow pattern detection - Add OAuth provider tests for context overflow and streaming	2025-12-20 21:34:18 +01:00
Mario Zechner	575dcb2676	Fix X-Initiator header logic for GitHub Copilot Check last message role instead of any message in history. This matches the original correct implementation from PR #200. fixes #209	2025-12-19 05:08:28 +01:00
Mario Zechner	84018b0707	fix(ai): correct Gemini tool result format and improve type safety - Fix tool result format for Gemini 3 Flash Preview compatibility - Use 'output' key for successful results (not 'result') - Use 'error' key for error results (not 'isError') - Per Google SDK documentation for FunctionResponse.response - Improve type safety in google.ts provider - Add ImageContent import and use proper type guards - Replace 'as any' casts with proper typing - Import and use Schema type for tool parameters - Add proper typing for index deletion in error handler - Add comprehensive test for Gemini 3 Flash tool calling - Tests successful tool call and result handling - Tests error tool result handling - Verifies fix for issue #213 Fixes #213	2025-12-18 13:43:39 +00:00
Mario Zechner	4894fa411c	Release v0.23.2 Fixed Claude models via GitHub Copilot re-answering all previous prompts. fixes #209	2025-12-17 17:56:00 +01:00
Mario Zechner	e1ce9c1f49	Fix image limits test to use realistic payload sizes Previous test used compressed 8k images (0.01MB) which was meaningless. Now tests with actual large noise images that don't compress. Realistic payload limits discovered: - Anthropic: 6 x 3MB = ~18MB total (not 32MB as documented) - OpenAI: 2 x 15MB = ~30MB total - Gemini: 10 x 20MB = ~200MB total (very permissive) - Mistral: 4 x 10MB = ~40MB total - xAI: 1 x 20MB (strict request size limit) - Groq: 5 x 5760px images (5 image + pixel limit) - zAI: 2 x 15MB = ~30MB (50MB request limit) - OpenRouter: 2 x 5MB = ~10MB total Also fixed GEMINI_API_KEY env var (was GOOGLE_API_KEY). Related to #120	2025-12-16 23:48:59 +01:00
Mario Zechner	043a8416b0	Update image limits test with comprehensive 8k stress test results Tested max 8kx8k images per provider: - Anthropic: 100 (explicit limit, fails at 101) - OpenAI: 100-200 (100 works, 200 times out) - Mistral: 8 (explicit limit, fails at 9) - xAI: 100-150 (100 works, 150 times out) - Groq: 0 (8k exceeds 33M pixel limit) - zAI: 400 (context window limited at 500) - OpenRouter: 40 (context window limited at 50) - Gemini: untested (no API key in test env) Key finding: Anthropic's 'many images' rule does NOT cause API errors. 100 x 8kx8k images work fine. Anthropic likely auto-resizes internally. Related to #120	2025-12-16 23:01:46 +01:00
Mario Zechner	f1df52ccfd	Add comprehensive image limits test suite for all vision-capable providers Tests max image count, size, dimensions, and 8k stress test for: - Anthropic, OpenAI, Gemini, Mistral, OpenRouter, xAI, Groq, zAI Key finding: Anthropic's 'many images' rule (>20 images = 2000px max) does NOT cause API errors. 100 x 8k images work fine. Anthropic likely auto-resizes internally. Related to #120	2025-12-16 22:21:48 +01:00
Mario Zechner	3c9c47d3bb	ai: add image limits test suite Tests provider-specific image limitations across all supported providers: - Maximum number of images in context - Maximum image size (bytes) - Maximum image dimensions Discovered limits (Dec 2025): - Anthropic: 100 images, 5MB per image, 8000px max dimension - OpenAI: 500 images, >=25MB per image - Gemini: ~2500 images, >=40MB per image - Mistral: 8 images, ~15MB per image - OpenRouter: ~40 images (context limited), ~15MB per image	2025-12-16 20:04:34 +01:00
Ahmed Kamal	c2dea0ce8b	Add X-Initiator header for GitHub Copilot (#200 )	2025-12-16 14:05:22 +01:00
Mario Zechner	f8550a536e	Read GitHub Copilot token from oauth.json in test	2025-12-15 19:16:08 +01:00
Mario Zechner	5a59b8d18d	Add GitHub Copilot test to packages/ai	2025-12-15 19:12:43 +01:00
Mario Zechner	3d35e7c469	Fix branch selector for single message and --no-session mode - Allow branch selector to open with single user message (changed <= 1 to === 0 check) - Support in-memory branching for --no-session mode (no files created) - Add isEnabled() getter to SessionManager - Update sessionFile getter to return null when sessions disabled - Update SessionSwitchEvent types to allow null session files - Add branching tests for single message and --no-session scenarios fixes #163	2025-12-10 22:41:32 +01:00
Mario Zechner	76312ea7e8	Fix Mistral 400 errors after aborted assistant messages - Skip empty assistant messages (no content, no tool calls) to avoid Mistral's 'Assistant message must have either content or tool_calls' error - Remove synthetic assistant bridge message after tool results (Mistral no longer requires this as of Dec 2024) - Add test for empty assistant message handling Follow-up to #165	2025-12-10 21:13:33 +01:00
Mario Zechner	99b4b1aca0	Add Mistral as AI provider - Add Mistral to KnownProvider type and model generation - Implement Mistral-specific compat handling in openai-completions: - requiresToolResultName: tool results need name field - requiresAssistantAfterToolResult: synthetic assistant message between tool/user - requiresThinkingAsText: thinking blocks as <thinking> text - requiresMistralToolIds: tool IDs must be exactly 9 alphanumeric chars - Add MISTRAL_API_KEY environment variable support - Add Mistral tests across all test files - Update documentation (README, CHANGELOG) for both ai and coding-agent packages - Remove client IDs from gemini.md, reference upstream source instead Closes #165	2025-12-10 20:36:19 +01:00
Mario Zechner	5a9d844f9a	Simplify compaction: remove proactive abort, use Agent.continue() for retry - Add agentLoopContinue() to pi-ai for resuming from existing context - Add Agent.continue() method and transport.continue() interface - Simplify AgentSession compaction to two cases: overflow (auto-retry) and threshold (no retry) - Remove proactive mid-turn compaction abort - Merge turn prefix summary into main summary - Add isCompacting property to AgentSession and RPC state - Block input during compaction in interactive mode - Show compaction count on session resume - Rename RPC.md to rpc.md for consistency Related to #128	2025-12-09 21:43:49 +01:00
Mario Zechner	238c5d34e4	Fix tsgo type issues: update tsgo, fix ReasoningEffort import, remove broken enum-test	2025-12-08 22:59:13 +01:00
Mario Zechner	00370cab39	Add xhigh thinking level for OpenAI codex-max models - Add 'xhigh' to ThinkingLevel type in ai and agent packages - Map xhigh to reasoning_effort: 'max' for OpenAI providers - Add thinkingXhigh color token to theme schema and built-in themes - Show xhigh option only when using codex-max models - Update CHANGELOG for both ai and coding-agent packages closes #143	2025-12-08 21:12:54 +01:00
Mario Zechner	b813a8b92b	Implement tool result truncation with actionable notices (#134 ) - read: actionable notices with offset for continuation - First line > 30KB: return empty + bash command suggestion - Hit limit: '[Showing lines X-Y of Z. Use offset=N to continue]' - bash: tail truncation with temp file - Notice includes line range + temp file path - Edge case: last line > 30KB shows partial - grep: pre-truncate match lines to 500 chars - '[... truncated]' suffix on long lines - Notice for match limit and line truncation - find/ls: result/entry limit notices - '[N results limit reached. Use limit=M for more]' - All notices now in text content (LLM sees them) - TUI simplified (notices render as part of output) - Never return partial lines (except bash edge case)	2025-12-07 01:11:31 +01:00
Mario Zechner	86e5a70ec4	Add totalTokens field to Usage type - Added totalTokens field to Usage interface in pi-ai - Anthropic: computed as input + output + cacheRead + cacheWrite - OpenAI/Google: uses native total_tokens/totalTokenCount - Fixed openai-completions to compute totalTokens when reasoning tokens present - Updated calculateContextTokens() to use totalTokens field - Added comprehensive test covering 13 providers fixes #130	2025-12-06 22:46:02 +01:00
Mario Zechner	a325c1c7d1	Add context overflow detection utilities Extract overflow detection logic into reusable utilities: - isContextOverflowError() to detect overflow from error messages - isContextOverflowFromUsage() to detect overflow from token usage - Patterns for Anthropic, OpenAI, Google, xAI, Groq, OpenRouter, llama.cpp, LM Studio Fixes #129	2025-12-06 21:24:15 +01:00
Mario Zechner	4afb3231e4	Fix up prompt for image-tool-result for Gemini	2025-11-20 17:09:32 +01:00
Mario Zechner	a11c1aa4ff	Release v0.7.17	2025-11-18 17:49:12 +01:00
Mario Zechner	84dcab219b	Add image support in tool results across all providers Tool results now use content blocks and can include both text and images. All providers (Anthropic, Google, OpenAI Completions, OpenAI Responses) correctly pass images from tool results to LLMs. - Update ToolResultMessage type to use content blocks - Add placeholder text for image-only tool results in Google/Anthropic - OpenAI providers send tool result + follow-up user message with images - Fix Anthropic JSON parsing for empty tool arguments - Add comprehensive tests for image-only and text+image tool results - Update README with tool result content blocks API	2025-11-12 10:45:56 +01:00
Mario Zechner	bc8d994a7b	Fix token statistics on abort for Anthropic provider - Add handling for message_start event to capture initial token usage - Fix message_delta to use assignment (=) instead of addition (+=) since Anthropic sends cumulative token counts, not incremental - Add comprehensive tests for all providers (Google, OpenAI Completions, OpenAI Responses, Anthropic) - Document OpenAI limitation: token stats only available at stream end Fixes issue where aborted streams had zero token counts despite Anthropic sending input tokens in the initial message_start event.	2025-10-26 21:22:24 +01:00
Mario Zechner	55dc0b6e08	Add timestamp to messages	2025-10-26 00:43:43 +02:00
Mario Zechner	4e7a340460	Add Unicode surrogate sanitization for all providers Fixes issue where unpaired Unicode surrogates in tool results cause JSON serialization errors in API providers, particularly Anthropic. - Add sanitizeSurrogates() utility function to remove unpaired surrogates - Apply sanitization in all provider convertMessages() functions: - User message text content (string and text blocks) - Assistant message text and thinking blocks - Tool result output - System prompts - Valid emoji (properly paired surrogates) are preserved - Add comprehensive test suite covering all 8 providers Previously only Google and Groq handled unpaired surrogates correctly. Now all providers (Anthropic, OpenAI Completions/Responses, Google, xAI, Groq, Cerebras, zAI) sanitize text before API submission.	2025-10-13 14:26:54 +02:00
Mario Zechner	b129154cc8	Add ToolRenderResult interface for custom tool rendering - Changed ToolRenderer return type from TemplateResult to ToolRenderResult - ToolRenderResult = { content: TemplateResult, isCustom: boolean } - isCustom: true = no card wrapper, false = wrap in card - Updated all existing tool renderers to return new format - Updated Messages.ts to handle custom rendering This enables tools to render without default card chrome when needed.	2025-10-11 04:40:42 +02:00
Mario Zechner	51f5448a5c	Remove tool calls for which there are no results in subsequent user messages.	2025-10-01 22:18:30 +02:00
Mario Zechner	2296dc4052	refactor(ai): improve error handling and stop reason types - Add 'aborted' as a distinct stop reason separate from 'error' - Change AssistantMessage.error to errorMessage for clarity - Update error event to include reason field ('error' \| 'aborted') - Map provider-specific safety/refusal reasons to 'error' stop reason - Reorganize utility functions into utils/ directory - Rename agent.ts to agent-loop.ts for better clarity - Fix error handling in all providers to properly distinguish abort from error	2025-09-18 19:57:13 +02:00
Mario Zechner	39c626b6c9	feat(ai): add partial JSON parsing for streaming tool calls - Added partial-json package for parsing incomplete JSON during streaming - Tool call arguments now contain partially parsed JSON during toolcall_delta events - Enables progressive UI updates (e.g., showing file paths before content is complete) - Arguments are always valid objects (minimum empty {}), never undefined - Full validation still occurs at toolcall_end when arguments are complete - Updated all providers (Anthropic, OpenAI Completions/Responses) to use parseStreamingJson - Added comprehensive documentation and examples in README - Added test to verify arguments are always defined during streaming	2025-09-16 12:23:34 +02:00
Mario Zechner	197259c88a	Fix NodeJS compat	2025-09-16 02:19:47 +02:00
Mario Zechner	e8370436d7	Replace Zod with TypeBox for schema validation - Switch from Zod to TypeBox for tool parameter schemas - TypeBox schemas can be serialized/deserialized as JSON - Use AJV for runtime validation instead of Zod's parse - Add StringEnum helper for Google API compatibility (avoids anyOf/const patterns) - Export Type and Static from main package for convenience - Update all tests and documentation to reflect TypeBox usage	2025-09-16 01:10:40 +02:00
Mario Zechner	35fe8f21e9	feat(ai): Implement Zod-based tool validation and improve Agent API - Replace JSON Schema with Zod schemas for tool parameter definitions - Add runtime validation for all tool calls at provider level - Create shared validation module with detailed error formatting - Update Agent API with comprehensive event system - Add agent tests with calculator tool for multi-turn execution - Add abort test to verify proper handling of aborted requests - Update documentation with detailed event flow examples - Rename generate.ts to stream.ts for clarity	2025-09-09 14:58:54 +02:00
Mario Zechner	594b0dac6c	Stop GPT-OSS 20b from being dumb ..	2025-09-09 04:31:09 +02:00
Mario Zechner	98a876f3a0	Fix streaming for z-ai in anthropic provider, add preliminary support for tool call streaming. Only reporting argument string deltas, not partial JSON objects	2025-09-09 04:26:56 +02:00
Mario Zechner	2bdb87dfe7	chore: bump version to 0.5.31	2025-09-07 00:09:34 +02:00
Mario Zechner	d073953ef7	feat(ai): Add zAI provider support - Add 'zai' as a KnownProvider type - Add ZAI_API_KEY environment variable mapping - Generate 4 zAI models (glm-4.5-air, glm-4.5v, etc.) using anthropic-messages API - Add comprehensive test coverage for zAI provider in generate.test.ts and empty.test.ts - Models support reasoning/thinking capabilities and tool calling	2025-09-07 00:09:15 +02:00
Mario Zechner	6679a83b32	fix(ai): Sanitize tool call IDs for Anthropic API compatibility - Anthropic API requires tool call IDs to match pattern ^[a-zA-Z0-9_-]+$ - OpenAI Responses API generates IDs with pipe character (\|) which breaks Anthropic - Added sanitizeToolCallId() to replace invalid characters with underscores - Fixes cross-provider handoffs from OpenAI Responses to Anthropic - Added test to verify the fix works	2025-09-04 05:17:08 +02:00
Mario Zechner	acf0f5aee2	Clean-up	2025-09-03 00:01:32 +02:00
Mario Zechner	66cefb236e	Massive refactor of API - Switch to function based API - Anthropic SDK style async generator - Fully typed with escape hatches for custom models	2025-09-02 23:59:36 +02:00
Mario Zechner	004de3c9d0	feat(ai): Add new streaming generate API with AsyncIterable interface - Implement QueuedGenerateStream class that extends AsyncIterable with finalMessage() method - Add new types: GenerateStream, GenerateOptions, GenerateOptionsUnified, GenerateFunction - Create generateAnthropic function-based implementation replacing class-based approach - Add comprehensive test suite for the new generate API - Support streaming events with text, thinking, and tool call deltas - Map ReasoningEffort to provider-specific options - Include apiKey in options instead of constructor parameter	2025-09-02 18:07:46 +02:00
Mario Zechner	be07c08a75	test(ai): Add empty assistant message tests - Test providers handling empty assistant messages in conversation flow - Pattern: user message -> empty assistant -> user message - All providers handle empty assistant messages gracefully - Tests ensure providers can continue conversation after empty response	2025-09-02 02:10:07 +02:00
Mario Zechner	0ac05a0676	test(ai): Add empty message tests for all providers - Test handling of empty content arrays - Test handling of empty string content - Test handling of whitespace-only content - All providers handle these edge cases gracefully	2025-09-02 02:03:06 +02:00
Mario Zechner	efaa5cdb39	feat(ai): Fetch Anthropic, Google, and OpenAI models from models.dev instead of OpenRouter - Updated generate-models.ts to fetch these providers directly from models.dev API - OpenRouter now only used for xAI and other third-party providers - Fixed test model IDs to match new model names from models.dev - Removed unused import from google.ts	2025-09-02 01:18:59 +02:00

1 2

72 commits