co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-20 09:01:49 +00:00

Author	SHA1	Message	Date
Mario Zechner	0d24ddbb03	fix(ai): use model.api instead of hardcoding api type in streaming functions - anthropic.ts: use model.api instead of hardcoding 'anthropic-messages' - openai-responses.ts: use model.api instead of hardcoding 'openai-responses' - gitlab-duo: simplify to use actual model IDs, export MODELS array	2026-01-25 00:52:34 +01:00
Mario Zechner	177c694406	feat: custom provider support with streamSimple - Add resetApiProviders() to clear and re-register built-in providers - Add createAssistantMessageEventStream() factory for extensions - Add streamSimple support in ProviderConfig for custom API implementations - Call resetApiProviders() on /reload to clean up extension providers - Add custom-provider.md documentation - Add custom-provider.ts example with full Anthropic implementation - Update extensions.md with streamSimple config option	2026-01-24 23:15:11 +01:00
Mario Zechner	c725135a76	refactor(ai): register api providers	2026-01-24 23:15:11 +01:00
Markus Ylisiurunen	151099e17e	fix(ai): handle openai responses arguments.done events	2026-01-24 12:05:58 +01:00
Markus Ylisiurunen	c6e966bd1c	adjust azure responses metadata and handoff gating	2026-01-24 12:05:58 +01:00
Markus Ylisiurunen	bd7049b7d1	fix(ai): port openai responses handoff guard	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	5edec3a40a	fix(ai): preserve codex tool strictness	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	284ff81035	refactor(ai): share openai responses logic	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	085c378d34	add Azure deployment name map and refresh generated models	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	391c93800c	switch azure responses to base url config and v1 api	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	01f559efc0	guard azure responses deltas before content parts	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	3112526051	remove service tier from `azure-openai-responses`; add link to changelog entry	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	856012296b	add Azure OpenAI Responses provider with deployment-aware model mapping	2026-01-24 12:04:34 +01:00
Mario Zechner	72de8f26a1	Merge pull request #917 from williballenthin/fix-call-arguments-done fix(ai): handle call arguments done on OpenAI-compatible endpoints	2026-01-24 03:14:02 +01:00
Danila Poyarkov	6d0c544e18	fix: Bun compatibility for build scripts and runtime detection	2026-01-23 19:31:16 +03:00
Willi Ballenthin	fb364c89bf	fix(ai): handle call arguments done on OpenAI-compatible endpoints fix bug encountered when running GLM-4.7-Flash hosted by LM Studio, in which the provider sends tool call arguments via `response.function_call_arguments.done` events instead of streaming them via `response.function_call_arguments.delta` events. The final `response.output_item.done` event then contains empty `{}` arguments. The code only handled delta events, so tool calls failed with validation errors like `"must have required property 'command'"`. Full disclosure, Opus triaged the bug and provided the fix (by adding logging statements to the req/resp to the upstream provider (LM Studio)). I'm to provide prompts/transcripts, and acknowledge that I'm not an expert in Pi internals at this time.	2026-01-23 13:48:54 +01:00
Michael Renner	6289c144bf	fix(ai): batch tool-result images after consecutive tool results (#902 ) Fixes 400 errors when reading multiple images via GitHub Copilot's Claude models. Claude requires tool_use -> tool_result adjacency with no user messages interleaved. Before: assistant(tool_calls) -> tool -> user(images) -> tool -> user(images) After: assistant(tool_calls) -> tool -> tool -> user(all images)	2026-01-22 13:10:10 +01:00
Mario Zechner	b712d1ca43	fix(ai, web-ui): browser compatibility for pi-ai, update tsgo for decorator support - Update @typescript/native-preview to 7.0.0-dev.20260120.1 (supports experimentalDecorators) - Replace top-level node:fs, node:os, node:path imports with dynamic imports in stream.ts - Replace top-level node:os import with dynamic import in openai-codex-responses.ts - Replace top-level node:crypto, node:http imports with dynamic imports in openai-codex.ts - Replace Buffer.from with atob for browser-compatible base64 decoding fixes #873	2026-01-22 01:33:46 +01:00
Mario Zechner	d327b9c768	fix(ai): handle same-provider different-model handoff in OpenAI Responses API When switching between OpenAI models (e.g., gpt-5-mini to gpt-5.2-codex), function_call IDs with fc_ prefix trigger pairing validation errors because OpenAI tracks which fc_xxx IDs were paired with rs_xxx reasoning items. The fix omits the id field for function_calls from different models, which avoids the pairing validation while keeping call_id for matching with function_call_output. Fixes #886	2026-01-22 00:58:49 +01:00
Mario Zechner	d2be6486a4	feat(ai): add headers option to StreamOptions for custom HTTP headers - Added headers field to base StreamOptions interface - Updated all providers to merge options.headers with defaults - Forward headers and onPayload through streamSimple/completeSimple - Bedrock not supported (uses AWS SDK auth)	2026-01-20 01:08:31 +01:00
Mario Zechner	2d27a2c728	fix(ai): skip errored/aborted assistant messages in transform-messages Fixes OpenAI Responses 400 error 'reasoning without following item' by skipping errored/aborted assistant messages entirely rather than filtering at the provider level. This covers openai-responses, openai-codex-responses, and future providers. Removes strictResponsesPairing compat option (no longer needed). Closes #838	2026-01-19 15:55:29 +01:00
Mario Zechner	2c7c23b865	fix(ai): normalize tool call ids and handoff tests fixes #821	2026-01-19 00:10:49 +01:00
Mario Zechner	d43930c818	feat(ai): add strictResponsesPairing for Azure OpenAI Responses API Split OpenAICompat into OpenAICompletionsCompat and OpenAIResponsesCompat for type-safe API-specific compat settings. Added strictResponsesPairing option to suppress orphaned reasoning/tool calls on incomplete turns, fixing 400 errors on Azure's Responses API which requires strict pairing. Closes #768	2026-01-18 20:15:33 +01:00
Mario Zechner	4068bc556a	chore: simplify codex prompt handling	2026-01-17 21:53:01 +01:00
Mario Zechner	a5f1016da2	fix(ai): normalize tool names case-insensitively against CC tool list - Replace hardcoded pi->CC tool mappings with single CC tool name list - Case-insensitive lookup: if tool name matches CC tool, use CC casing - Remove broken find->Glob mapping (round-trip failed) - Add test coverage for tool name normalization	2026-01-17 21:03:47 +01:00
Mario Zechner	0f3a0f78bc	fix(ai): prevent orphaned tool results after errored assistant messages When an assistant message has stopReason 'error', its tool calls are now excluded from pending tool tracking. This prevents synthetic tool results from being generated for calls that will be dropped by provider-specific converters (e.g., Codex drops tool calls from errored messages). Previously, this mismatch caused OpenAI to reject requests with 'No tool call found for function call output with call_id ...' errors. fixes #812	2026-01-17 20:20:39 +01:00
Mario Zechner	5d3e7d5aaa	fix(ai): preserve unsigned tool call context for Gemini 3 with anti-mimicry note Instead of skipping unsigned tool calls entirely (which lobotomizes context), convert them to text with an explicit note telling the model this is historical context from a different model and not a format to mimic. This preserves tool call/result context when switching from providers without thought signatures (e.g. Claude via Antigravity) to Gemini 3.	2026-01-16 23:42:39 +01:00
Mario Zechner	fbb74bb29e	fix(ai): filter empty error assistant messages in transformMessages When 429/500 errors occur during tool execution, empty assistant messages with stopReason='error' get persisted. These break the tool_use -> tool_result chain for Claude/Gemini APIs. Added centralized filtering in transformMessages to skip assistant messages with empty content and no tool calls. Provider-level filters remain for defense-in-depth.	2026-01-16 22:35:50 +01:00
Pablo Tovar	ba8059a502	fix: sanitize bedrock tool call ids (#781 )	2026-01-16 17:51:48 +01:00
Mario Zechner	f900eb591d	Fix provider feature detection to use model.provider, not just URL Previously, OpenAI-compatible provider settings (like developer role support) were detected only from the baseUrl. When using custom URLs (e.g., proxies), detection failed. Now checks model.provider first, then falls back to URL. Fixes #774	2026-01-16 12:41:23 +01:00
Mario Zechner	c08801e4c5	Add retry logic to OpenAI Codex provider Fixes #733	2026-01-16 03:15:59 +01:00
Mario Zechner	be26d362fa	Fix alt+backspace in Kitty mode and clamp Codex effort (refs #752 )	2026-01-16 01:30:46 +01:00
Mario Zechner	6484ae279d	Finalize OpenAI Codex compatibility (#737 ) - align Codex Responses provider with Pi static instructions - simplify Codex request/stream handling and cleanup exports - keep legacy OpenCode Codex prompt for testing until Pi prompt is allowlisted	2026-01-16 00:58:36 +01:00
Melih Mucuk	cceb5908d9	fix: opencode provider uses system role instead of developer (#755 ) * fix: opencode provider uses system role instead of developer for /v1 endpoint * changelog updated	2026-01-15 21:26:31 +01:00
Roshan Singh	b18f401d9e	fix(ai): avoid unsigned Gemini 3 tool calls (#741 )	2026-01-15 13:12:39 +01:00
Burak Varlı	9a438465eb	fix(ai): signature support for non-Anthropic models in Amazon Bedrock provider (#727 ) * Add Amazon Bedrock models test suite for agent package Tests basic prompts, multi-turn conversations with thinking, and synthetic thinking signatures across all Bedrock models. Known issues are categorized and skipped: - Models requiring inference profile (5) - Invalid model IDs for us-east-1 region (6) - Max tokens config exceeds model limit (2) - No signature support in reasoningContent (10) - Rejects reasoning content in user messages (25) - Validates signature format - Anthropic newer models (7) * Fix Bedrock signature support for non-Anthropic models Only include the signature field in reasoningContent.reasoningText for Anthropic Claude models. Other models (OpenAI, Qwen, Minimax, Moonshot, etc.) reject this field with: "This model doesn't support the reasoningContent.reasoningText.signature field" This fix enables multi-turn conversations with thinking content for 10 additional Bedrock models that previously failed. https://buildwithpi.ai/session?7e39c05f66ea358da3f993c267fe3e29 * Add a CHANGELOG entry	2026-01-14 19:21:35 +01:00
Markus Ylisiurunen	653025e6ca	Fix OpenAI responses timeout option (#706 )	2026-01-14 00:07:11 +01:00
Pablo Tovar	b74535dc85	fix: apply message transforms for bedrock tool calls (#707 ) Ensure Bedrock uses transformMessages before conversion.	2026-01-14 00:05:47 +01:00
Mario Zechner	09d409cc92	Fix z.ai thinking/reasoning params, fixes #688 Z.ai uses thinking: { type: "enabled" \| "disabled" } instead of OpenAI's reasoning_effort. Added thinkingFormat compat flag to handle this. Thinking is now explicitly enabled/disabled based on user setting.	2026-01-13 18:34:07 +01:00
Markus Ylisiurunen	00ba005e50	set the prompt cache key to session id (#698 )	2026-01-13 18:29:36 +01:00
Mario Zechner	28072cb31f	Add more models to stream.test.ts for Vercel, set infinite timeout on OpenAI responses, closes #690	2026-01-13 17:08:56 +01:00
Mario Zechner	3c60ffa677	Fix tool call ID normalization for cross-provider switches to Anthropic/GitHub Copilot	2026-01-13 04:07:10 +01:00
Mario Zechner	8af8d0d672	Add MiniMax provider support (#656 by @dannote) - Add minimax to KnownProvider and Api types - Add MINIMAX_API_KEY to getEnvApiKey() - Generate MiniMax-M2 and MiniMax-M2.1 models - Add context overflow detection pattern - Add tests to all required test files - Update README and CHANGELOG with attribution Also fixes: - Bedrock duplicate toolResult ID when content has multiple blocks - Sandbox extension unused parameter lint warning	2026-01-13 02:27:09 +01:00
Ahmed Kamal	ff15414258	Improve Gemini CLI provider retries and headers (#670 ) Improve Gemini CLI provider retries and headers - Add Antigravity endpoint fallback (tries daily sandbox then prod when baseUrl is unset) - Parse retry delays from headers (Retry-After, x-ratelimit-reset, x-ratelimit-reset-after) before body parsing - Derive stable sessionId from first user message for cache affinity - Retry empty SSE streams with backoff without duplicate start/done events - Add anthropic-beta header for Claude thinking models only	2026-01-13 01:04:53 +01:00
Danila Poyarkov	9e4ae98358	Improve Google Cloud Code Assist error handling (#665 ) * Improve Cloud Code Assist error messages - Extract just the message from verbose JSON error responses - Extract cause from generic 'fetch failed' errors for better diagnostics * Make 'other side closed' network error retryable * Make 'other side closed' network error retryable	2026-01-13 00:41:20 +01:00
Mario Zechner	d442bbcc19	feat(ai): Add prompt caching for Claude models on Bedrock Adds cache points to system prompt and last user message for: - Claude 3.5 Haiku - Claude 3.7 Sonnet - Claude 4.x models (Opus, Sonnet, Haiku) Uses Bedrock's cachePoint blocks with 5-minute TTL.	2026-01-13 00:38:12 +01:00
Mario Zechner	fd268479a4	feat(ai): Add Amazon Bedrock provider (#494 ) Adds support for Amazon Bedrock with Claude models including: - Full streaming support via Converse API - Reasoning/thinking support for Claude models - Cross-region inference model ID handling - Multiple AWS credential sources (profile, IAM keys, API keys) - Image support in messages and tool results - Unicode surrogate sanitization Also adds 'Adding a New Provider' documentation to AGENTS.md and README. Co-authored-by: nickchan2 <nickchan2@users.noreply.github.com>	2026-01-13 00:32:59 +01:00
Markus Ylisiurunen	4f216d318f	Apply service tier pricing (#675 )	2026-01-12 23:56:51 +01:00
nathyong	7b2c627079	Insert cache point on openrouter+anthropic completions (#584 ) Co-authored-by: nathyong <nathyong@noreply.github.com>	2026-01-12 23:29:33 +01:00
Markus Ylisiurunen	7b79e8ec51	Add service tier option for OpenAI Responses API (#672 ) * add service tier option for OpenAI responses * add serviceTier option for OpenAI Responses requests	2026-01-12 23:20:18 +01:00

1 2 3 4

193 commits