co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-22 02:03:42 +00:00

Author	SHA1	Message	Date
Mario Zechner	111a31e4db	fix(ai): apply cache_control to string user messages	2026-02-02 19:19:12 +01:00
Mario Zechner	abfd04b5c5	feat(ai): add cacheRetention stream option	2026-02-01 09:32:10 +01:00
Mario Zechner	af813f9048	fix(ai): default tool call arguments fixes #1065	2026-01-30 01:13:16 +01:00
Mario Zechner	1b6a147579	feat(ai): add PI_CACHE_RETENTION env var for extended prompt caching Adds support for extended cache retention via PI_CACHE_RETENTION=long: - Anthropic: 5m -> 1h TTL - OpenAI: in-memory -> 24h retention Only applies to direct API calls (api.anthropic.com, api.openai.com). Proxies and other providers are unaffected. fixes #967	2026-01-29 02:22:06 +01:00
Mario Zechner	8b5c81f21f	fix(ai): preserve input token counts from message_start in Anthropic provider Proxies like Portkey omit input_tokens in message_delta events (it's nullable per the SDK). The previous code unconditionally overwrote usage fields, causing input token counts to reset to 0. Now only updates usage fields when they are present (not null), preserving the correct input_tokens value captured from message_start. Fixes #1045	2026-01-29 00:06:51 +01:00
mom	ee7c0a7d18	fix(ai): handle sensitive stop_reason from Anthropic API (fixes #978 )	2026-01-28 02:18:16 +00:00
Mario Zechner	0d24ddbb03	fix(ai): use model.api instead of hardcoding api type in streaming functions - anthropic.ts: use model.api instead of hardcoding 'anthropic-messages' - openai-responses.ts: use model.api instead of hardcoding 'openai-responses' - gitlab-duo: simplify to use actual model IDs, export MODELS array	2026-01-25 00:52:34 +01:00
Mario Zechner	c725135a76	refactor(ai): register api providers	2026-01-24 23:15:11 +01:00
Mario Zechner	d2be6486a4	feat(ai): add headers option to StreamOptions for custom HTTP headers - Added headers field to base StreamOptions interface - Updated all providers to merge options.headers with defaults - Forward headers and onPayload through streamSimple/completeSimple - Bedrock not supported (uses AWS SDK auth)	2026-01-20 01:08:31 +01:00
Mario Zechner	2c7c23b865	fix(ai): normalize tool call ids and handoff tests fixes #821	2026-01-19 00:10:49 +01:00
Mario Zechner	a5f1016da2	fix(ai): normalize tool names case-insensitively against CC tool list - Replace hardcoded pi->CC tool mappings with single CC tool name list - Case-insensitive lookup: if tool name matches CC tool, use CC casing - Remove broken find->Glob mapping (round-trip failed) - Add test coverage for tool name normalization	2026-01-17 21:03:47 +01:00
Mario Zechner	fd268479a4	feat(ai): Add Amazon Bedrock provider (#494 ) Adds support for Amazon Bedrock with Claude models including: - Full streaming support via Converse API - Reasoning/thinking support for Claude models - Cross-region inference model ID handling - Multiple AWS credential sources (profile, IAM keys, API keys) - Image support in messages and tool results - Unicode surrogate sanitization Also adds 'Adding a New Provider' documentation to AGENTS.md and README. Co-authored-by: nickchan2 <nickchan2@users.noreply.github.com>	2026-01-13 00:32:59 +01:00
Mario Zechner	0138eee6f7	Fix tool mapping	2026-01-12 17:56:13 +01:00
Danila Poyarkov	934e7e470b	Avoid cross-provider thought signatures (#654 ) * Avoid cross-provider thought signatures * Fix Google thought signature replay Filter thought signatures to same provider with base64 validation and rename the transform helper for clarity.	2026-01-12 16:38:53 +01:00
Mario Zechner	ec83d91473	fix(ai): resolve OAuth tool names via context	2026-01-10 13:45:08 +01:00
Mario Zechner	60f5a03576	Add [Unreleased] section for next cycle	2026-01-09 20:24:50 +01:00
Helmut Januschka	b4351040a7	pi pi pi pew (#594 )	2026-01-09 12:43:00 +01:00
Mario Zechner	f745321169	Clean-up.	2026-01-09 05:23:08 +01:00
Mario Zechner	f5e6bcac1b	Remove Anthropic OAuth support	2026-01-09 05:10:33 +01:00
Mario Zechner	9f97f0c8da	getApiKeyFromEnv -> getEnvApiKey	2025-12-25 02:38:10 +01:00
Mario Zechner	d93cbf8c32	WIP: remove setApiKey, resolveApiKey	2025-12-24 23:34:23 +01:00
Mario Zechner	29379ea0a6	Fix thinking tag leakage by converting unsigned blocks to plain text Closes #302	2025-12-24 18:15:19 +01:00
Mario Zechner	0fc6689dfb	fix(ai): re-enable SDK retries for Anthropic provider The SDK default of 2 retries handles transient HTTP failures quickly, while coding-agent retries handle persistent errors with user feedback.	2025-12-20 09:56:11 +01:00
Mario Zechner	fd5134f88c	Release v0.22.2	2025-12-15 22:09:14 +01:00
Mario Zechner	a7e3b8625b	Release v0.22.1	2025-12-15 21:53:27 +01:00
Mario Zechner	bb445d24f1	Auto-retry on transient provider errors (overloaded, rate limit, 5xx) - Add retry logic with exponential backoff (2s, 4s, 8s) in AgentSession - Disable Anthropic SDK built-in retries (maxRetries: 0) to allow app-level handling - TUI shows retry status with Escape to cancel - RPC mode: add set_auto_retry, abort_retry commands and auto_retry_start/end events - Configurable via settings.json: retry.enabled, retry.maxRetries, retry.baseDelayMs - Exclude context overflow errors from retry (handled by compaction) fixes #157	2025-12-10 23:36:46 +01:00
Lukas Pitschl	a248e2547a	fix(ai): remove global process.env.ANTHROPIC_API_KEY deletion (#164 ) * fix(ai): remove global process.env.ANTHROPIC_API_KEY deletion The code was deleting process.env.ANTHROPIC_API_KEY to prevent the SDK from using it when OAuth tokens were provided. However, this was a global mutation that affected the entire Node.js process, causing the API key to be unavailable after the first prompt. The Anthropic SDK constructor already handles credential selection via parameters (apiKey: null, authToken: token for OAuth vs apiKey: key for regular keys), so the environment variable deletion was unnecessary. * Update CHANGELOG.md for API key fix	2025-12-10 18:12:16 +01:00
Mario Zechner	8bec289dc6	Remove provider-level tool validation, add validateToolCall helper	2025-12-08 18:04:33 +01:00
Markus Ylisiurunen	0196308266	add option to skip provider tool call validation	2025-12-07 17:24:06 +02:00
Mario Zechner	86e5a70ec4	Add totalTokens field to Usage type - Added totalTokens field to Usage interface in pi-ai - Anthropic: computed as input + output + cacheRead + cacheWrite - OpenAI/Google: uses native total_tokens/totalTokenCount - Fixed openai-completions to compute totalTokens when reasoning tokens present - Updated calculateContextTokens() to use totalTokens field - Added comprehensive test covering 13 providers fixes #130	2025-12-06 22:46:02 +01:00
Mario Zechner	de39f1f493	Add custom headers support for models.json Fixes #39 - Added headers field to Model type (provider and model level) - Model headers override provider headers when merged - Supported in all APIs: - Anthropic: defaultHeaders - OpenAI (completions/responses): defaultHeaders - Google: httpOptions.headers - Enables bypassing Cloudflare bot detection for proxied endpoints - Updated documentation with examples Also fixed: - Mistral/Chutes syntax error (iif -> if) - process.env.ANTHROPIC_API_KEY bug (use delete instead of = undefined)	2025-11-20 17:05:31 +01:00
Mario Zechner	387cc97bac	Fix Anthropic API rejection when resubmitting aborted thinking blocks - Convert thinking blocks with missing/empty signatures to text blocks - Prevents 400 error: 'Invalid signature in thinking block' - Occurs when stream is aborted mid-thinking and message is resubmitted	2025-11-18 14:36:57 +01:00
Mario Zechner	bf1a7d8571	Add 'pi' command alias and fix getApiKey import	2025-11-12 14:31:25 +01:00
Mario Zechner	00d8286523	Handle FinishReason.NO_IMAGE and fix optional chaining - Add NO_IMAGE to error finish reasons in Google provider - Fix non-null assertion after optional chaining in Anthropic provider - Migrate biome config to 2.3.5 - Ignore Tailwind CSS file from biome checks - Bump all packages to version 0.6.0	2025-11-12 10:58:03 +01:00
Mario Zechner	84dcab219b	Add image support in tool results across all providers Tool results now use content blocks and can include both text and images. All providers (Anthropic, Google, OpenAI Completions, OpenAI Responses) correctly pass images from tool results to LLMs. - Update ToolResultMessage type to use content blocks - Add placeholder text for image-only tool results in Google/Anthropic - OpenAI providers send tool result + follow-up user message with images - Fix Anthropic JSON parsing for empty tool arguments - Add comprehensive tests for image-only and text+image tool results - Update README with tool result content blocks API	2025-11-12 10:45:56 +01:00
Mario Zechner	bc8d994a7b	Fix token statistics on abort for Anthropic provider - Add handling for message_start event to capture initial token usage - Fix message_delta to use assignment (=) instead of addition (+=) since Anthropic sends cumulative token counts, not incremental - Add comprehensive tests for all providers (Google, OpenAI Completions, OpenAI Responses, Anthropic) - Document OpenAI limitation: token stats only available at stream end Fixes issue where aborted streams had zero token counts despite Anthropic sending input tokens in the initial message_start event.	2025-10-26 21:22:24 +01:00
Mario Zechner	55dc0b6e08	Add timestamp to messages	2025-10-26 00:43:43 +02:00
Mario Zechner	4e7a340460	Add Unicode surrogate sanitization for all providers Fixes issue where unpaired Unicode surrogates in tool results cause JSON serialization errors in API providers, particularly Anthropic. - Add sanitizeSurrogates() utility function to remove unpaired surrogates - Apply sanitization in all provider convertMessages() functions: - User message text content (string and text blocks) - Assistant message text and thinking blocks - Tool result output - System prompts - Valid emoji (properly paired surrogates) are preserved - Add comprehensive test suite covering all 8 providers Previously only Google and Groq handled unpaired surrogates correctly. Now all providers (Anthropic, OpenAI Completions/Responses, Google, xAI, Groq, Cerebras, zAI) sanitize text before API submission.	2025-10-13 14:26:54 +02:00
Mario Zechner	0496651308	Add Anthropic prompt caching, pluggable storage, and CORS proxy support Storage Architecture: - New pluggable storage system with backends (LocalStorage, ChromeStorage, IndexedDB) - SettingsRepository for app settings (proxy config, etc.) - ProviderKeysRepository for API key management - AppStorage with global accessors (getAppStorage, setAppStorage, initAppStorage) Transport Refactoring: - Renamed DirectTransport → ProviderTransport (calls LLM providers with optional CORS proxy) - Renamed ProxyTransport → AppTransport (uses app server with user auth) - Updated TransportMode: "direct" → "provider", "proxy" → "app" CORS Proxy Integration: - ProviderTransport checks proxy.enabled/proxy.url from storage - When enabled, modifies model baseUrl to route through proxy: {proxyUrl}/?url={originalBaseUrl} - ProviderKeyInput test function also honors proxy settings - Settings dialog with Proxy tab (Switch toggle, URL input, explanatory description) Anthropic Prompt Caching: - System prompt cached with cache_control markers (both OAuth and regular API keys) - Last user message cached to cache conversation history - Saves 90% on input tokens for cached content (10x cost reduction) Settings Dialog Improvements: - Configurable tab system with SettingsTab base class - ApiKeysTab and ProxyTab as custom elements - Switch toggle for proxy enable (instead of Checkbox) - Explanatory paragraphs for each tab - ApiKeyPromptDialog reuses ProviderKeyInput component Removed: - Deprecated ApiKeysDialog (replaced by ProviderKeyInput in SettingsDialog) - Old storage-adapter and key-store (replaced by new storage architecture)	2025-10-05 23:00:36 +02:00
Mario Zechner	2296dc4052	refactor(ai): improve error handling and stop reason types - Add 'aborted' as a distinct stop reason separate from 'error' - Change AssistantMessage.error to errorMessage for clarity - Update error event to include reason field ('error' \| 'aborted') - Map provider-specific safety/refusal reasons to 'error' stop reason - Reorganize utility functions into utils/ directory - Rename agent.ts to agent-loop.ts for better clarity - Fix error handling in all providers to properly distinguish abort from error	2025-09-18 19:57:13 +02:00
Mario Zechner	39c626b6c9	feat(ai): add partial JSON parsing for streaming tool calls - Added partial-json package for parsing incomplete JSON during streaming - Tool call arguments now contain partially parsed JSON during toolcall_delta events - Enables progressive UI updates (e.g., showing file paths before content is complete) - Arguments are always valid objects (minimum empty {}), never undefined - Full validation still occurs at toolcall_end when arguments are complete - Updated all providers (Anthropic, OpenAI Completions/Responses) to use parseStreamingJson - Added comprehensive documentation and examples in README - Added test to verify arguments are always defined during streaming	2025-09-16 12:23:34 +02:00
Mario Zechner	e8370436d7	Replace Zod with TypeBox for schema validation - Switch from Zod to TypeBox for tool parameter schemas - TypeBox schemas can be serialized/deserialized as JSON - Use AJV for runtime validation instead of Zod's parse - Add StringEnum helper for Google API compatibility (avoids anyOf/const patterns) - Export Type and Static from main package for convenience - Update all tests and documentation to reflect TypeBox usage	2025-09-16 01:10:40 +02:00
Mario Zechner	73d2119606	fix: Adjust max tokens for Anthropic and improve Google tools handling - Reduce default max tokens for Anthropic to 1/3 of model max - Fix Google provider to properly handle empty tools array - Ensure toolConfig is undefined when no tools are present	2025-09-15 00:34:52 +02:00
Mario Zechner	35fe8f21e9	feat(ai): Implement Zod-based tool validation and improve Agent API - Replace JSON Schema with Zod schemas for tool parameter definitions - Add runtime validation for all tool calls at provider level - Create shared validation module with detailed error formatting - Update Agent API with comprehensive event system - Add agent tests with calculator tool for multi-turn execution - Add abort test to verify proper handling of aborted requests - Update documentation with detailed event flow examples - Rename generate.ts to stream.ts for clarity	2025-09-09 14:58:54 +02:00
Mario Zechner	98a876f3a0	Fix streaming for z-ai in anthropic provider, add preliminary support for tool call streaming. Only reporting argument string deltas, not partial JSON objects	2025-09-09 04:26:56 +02:00
Mario Zechner	6679a83b32	fix(ai): Sanitize tool call IDs for Anthropic API compatibility - Anthropic API requires tool call IDs to match pattern ^[a-zA-Z0-9_-]+$ - OpenAI Responses API generates IDs with pipe character (\|) which breaks Anthropic - Added sanitizeToolCallId() to replace invalid characters with underscores - Fixes cross-provider handoffs from OpenAI Responses to Anthropic - Added test to verify the fix works	2025-09-04 05:17:08 +02:00
Mario Zechner	66cefb236e	Massive refactor of API - Switch to function based API - Anthropic SDK style async generator - Fully typed with escape hatches for custom models	2025-09-02 23:59:36 +02:00
Mario Zechner	2cfd8ff3c3	fix(ai): Use API type instead of model for message compatibility checks - Add getApi() method to all providers to identify the API type - Add api field to AssistantMessage to track which API generated it - Update transformMessages to check API compatibility instead of model - Fixes issue where OpenAI Responses API failed when switching models - Preserves thinking blocks and signatures when staying within same API	2025-09-02 00:20:06 +02:00
Mario Zechner	a62231987c	fix(ai): Add anthropic-dangerous-direct-browser-access header - Required header for browser-based access to Anthropic API - Added to both OAuth and regular API key authentication - Ensures full browser compatibility	2025-09-01 22:02:50 +02:00
Mario Zechner	da43e625f8	fix(ai): Add dangerouslyAllowBrowser flag for Anthropic client - Enables browser support for Anthropic SDK - Required for browser-based applications using the AI library	2025-09-01 21:55:52 +02:00

1 2

64 commits