co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-15 20:03:05 +00:00

Author	SHA1	Message	Date
Mario Zechner	c808de605a	feat(ai): add Hugging Face provider support - Add huggingface to KnownProvider type - Add HF_TOKEN env var mapping - Process huggingface models from models.dev (14 models) - Use openai-completions API with compat settings - Add tests for all provider test suites - Update documentation fixes #994	2026-01-29 02:40:14 +01:00
Mario Zechner	1b6a147579	feat(ai): add PI_CACHE_RETENTION env var for extended prompt caching Adds support for extended cache retention via PI_CACHE_RETENTION=long: - Anthropic: 5m -> 1h TTL - OpenAI: in-memory -> 24h retention Only applies to direct API calls (api.anthropic.com, api.openai.com). Proxies and other providers are unaffected. fixes #967	2026-01-29 02:22:06 +01:00
Mario Zechner	605f6f494b	fix(ai): normalize pipe-separated tool call IDs for cross-provider handoff - Handle pipe-separated IDs from OpenAI Responses API in openai-completions provider - Strip trailing underscores after truncation in openai-responses-shared (OpenAI Codex rejects them) - Add regression tests for tool call ID normalization fixes #1022	2026-01-29 01:28:12 +01:00
Mario Zechner	25707f9ad4	fix(ai): 429 rate limit errors no longer trigger auto-compaction 429 (Too Many Requests) was incorrectly classified as context overflow, triggering compaction instead of retry with backoff. The original logic assumed token-based rate limiting correlates with context overflow, but these are different concepts: - Rate limiting (429): requests/tokens per time period (throughput) - Context overflow: single request exceeds context window (size) Now 429 errors are handled by the existing retry logic with exponential backoff, while 400/413 remain as potential context overflow indicators. fixes #1038	2026-01-29 00:43:38 +01:00
Mario Zechner	8b5c81f21f	fix(ai): preserve input token counts from message_start in Anthropic provider Proxies like Portkey omit input_tokens in message_delta events (it's nullable per the SDK). The previous code unconditionally overwrote usage fields, causing input token counts to reset to 0. Now only updates usage fields when they are present (not null), preserving the correct input_tokens value captured from message_start. Fixes #1045	2026-01-29 00:06:51 +01:00
Mario Zechner	4f9deddd47	fix(ai): detect DeepSeek URLs and disable unsupported developer role fixes #1048	2026-01-28 23:55:54 +01:00
mom	ee7c0a7d18	fix(ai): handle sensitive stop_reason from Anthropic API (fixes #978 )	2026-01-28 02:18:16 +00:00
williamtwomey	41d2c7ff38	OpenAI completions toolChoice fix (#998 ) * openai completions tools fix * Reset generated file --------- Co-authored-by: williamtwomey <ai@shadylawn.net>	2026-01-28 03:03:15 +01:00
Daniel Tatarkin	9f3eef65f8	fix(ai): filter deprecated OpenCode models from generation (#970 ) Add status === 'deprecated' check for OpenCode Zen models, matching the existing pattern used for GitHub Copilot models. This removes deprecated models like glm-4.7-free and minimax-m2.1-free from the generated model catalog.	2026-01-26 23:56:13 +01:00
Mario Zechner	2bbc1a7659	Add [Unreleased] section for next cycle	2026-01-26 16:55:24 +01:00
Mario Zechner	9b903656ae	fix(ai): correct provider error message typo (closes #958 )	2026-01-26 15:37:57 +01:00
Mario Zechner	676de103e1	Update models/package-lock.json after adding deps	2026-01-25 19:29:02 +01:00
haoqixu	1e718e63ea	feat(ai): Support HTTP proxy through environment variables	2026-01-25 15:52:45 +08:00
jake	dac7474da2	feat(ai): add OpenRouter provider routing support Allows custom models to specify which upstream providers OpenRouter should route requests to via the `openRouterRouting` field in model definitions. Supported fields: - `only`: list of provider slugs to exclusively use - `order`: list of provider slugs to try in order	2026-01-25 03:34:49 +01:00
Mario Zechner	a6d878e804	fix(ai): default tool call arguments to empty object for Google providers When Google providers return tool calls without an args field (common for no-argument tools), the arguments field was undefined. This breaks subsequent API calls that require tool_use.input to be present. Now defaults to {} when args is missing. Related: clawdbot/clawdbot#1509	2026-01-25 03:18:02 +01:00
Mario Zechner	0d24ddbb03	fix(ai): use model.api instead of hardcoding api type in streaming functions - anthropic.ts: use model.api instead of hardcoding 'anthropic-messages' - openai-responses.ts: use model.api instead of hardcoding 'openai-responses' - gitlab-duo: simplify to use actual model IDs, export MODELS array	2026-01-25 00:52:34 +01:00
Mario Zechner	177c694406	feat: custom provider support with streamSimple - Add resetApiProviders() to clear and re-register built-in providers - Add createAssistantMessageEventStream() factory for extensions - Add streamSimple support in ProviderConfig for custom API implementations - Call resetApiProviders() on /reload to clean up extension providers - Add custom-provider.md documentation - Add custom-provider.ts example with full Anthropic implementation - Update extensions.md with streamSimple config option	2026-01-24 23:15:11 +01:00
Mario Zechner	c725135a76	refactor(ai): register api providers	2026-01-24 23:15:11 +01:00
Mario Zechner	3256d3c083	refactor(oauth): add provider registry	2026-01-24 23:15:11 +01:00
Markus Ylisiurunen	151099e17e	fix(ai): handle openai responses arguments.done events	2026-01-24 12:05:58 +01:00
Markus Ylisiurunen	c6e966bd1c	adjust azure responses metadata and handoff gating	2026-01-24 12:05:58 +01:00
Markus Ylisiurunen	bd7049b7d1	fix(ai): port openai responses handoff guard	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	5edec3a40a	fix(ai): preserve codex tool strictness	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	284ff81035	refactor(ai): share openai responses logic	2026-01-24 12:05:40 +01:00
Markus Ylisiurunen	085c378d34	add Azure deployment name map and refresh generated models	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	391c93800c	switch azure responses to base url config and v1 api	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	01f559efc0	guard azure responses deltas before content parts	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	3112526051	remove service tier from `azure-openai-responses`; add link to changelog entry	2026-01-24 12:04:34 +01:00
Markus Ylisiurunen	856012296b	add Azure OpenAI Responses provider with deployment-aware model mapping	2026-01-24 12:04:34 +01:00
Mario Zechner	72de8f26a1	Merge pull request #917 from williballenthin/fix-call-arguments-done fix(ai): handle call arguments done on OpenAI-compatible endpoints	2026-01-24 03:14:02 +01:00
Danila Poyarkov	6d0c544e18	fix: Bun compatibility for build scripts and runtime detection	2026-01-23 19:31:16 +03:00
Willi Ballenthin	fb364c89bf	fix(ai): handle call arguments done on OpenAI-compatible endpoints fix bug encountered when running GLM-4.7-Flash hosted by LM Studio, in which the provider sends tool call arguments via `response.function_call_arguments.done` events instead of streaming them via `response.function_call_arguments.delta` events. The final `response.output_item.done` event then contains empty `{}` arguments. The code only handled delta events, so tool calls failed with validation errors like `"must have required property 'command'"`. Full disclosure, Opus triaged the bug and provided the fix (by adding logging statements to the req/resp to the upstream provider (LM Studio)). I'm to provide prompts/transcripts, and acknowledge that I'm not an expert in Pi internals at this time.	2026-01-23 13:48:54 +01:00
Michael Renner	6289c144bf	fix(ai): batch tool-result images after consecutive tool results (#902 ) Fixes 400 errors when reading multiple images via GitHub Copilot's Claude models. Claude requires tool_use -> tool_result adjacency with no user messages interleaved. Before: assistant(tool_calls) -> tool -> user(images) -> tool -> user(images) After: assistant(tool_calls) -> tool -> tool -> user(all images)	2026-01-22 13:10:10 +01:00
Mario Zechner	df8b3544c3	Add [Unreleased] section for next cycle	2026-01-22 01:43:26 +01:00
Mario Zechner	b712d1ca43	fix(ai, web-ui): browser compatibility for pi-ai, update tsgo for decorator support - Update @typescript/native-preview to 7.0.0-dev.20260120.1 (supports experimentalDecorators) - Replace top-level node:fs, node:os, node:path imports with dynamic imports in stream.ts - Replace top-level node:os import with dynamic import in openai-codex-responses.ts - Replace top-level node:crypto, node:http imports with dynamic imports in openai-codex.ts - Replace Buffer.from with atob for browser-compatible base64 decoding fixes #873	2026-01-22 01:33:46 +01:00
Mario Zechner	d327b9c768	fix(ai): handle same-provider different-model handoff in OpenAI Responses API When switching between OpenAI models (e.g., gpt-5-mini to gpt-5.2-codex), function_call IDs with fc_ prefix trigger pairing validation errors because OpenAI tracks which fc_xxx IDs were paired with rs_xxx reasoning items. The fix omits the id field for function_calls from different models, which avoids the pairing validation while keeping call_id for matching with function_call_output. Fixes #886	2026-01-22 00:58:49 +01:00
Mario Zechner	693112e395	feat(ai): add originator option to loginOpenAICodex	2026-01-20 01:28:51 +01:00
Mario Zechner	d2be6486a4	feat(ai): add headers option to StreamOptions for custom HTTP headers - Added headers field to base StreamOptions interface - Updated all providers to merge options.headers with defaults - Forward headers and onPayload through streamSimple/completeSimple - Bedrock not supported (uses AWS SDK auth)	2026-01-20 01:08:31 +01:00
Mario Zechner	2b04aefa6d	feat(ai): add AWS ECS/IRSA credential detection for Bedrock, fixes #848 Added support for additional AWS credential environment variables: - AWS_CONTAINER_CREDENTIALS_RELATIVE_URI (ECS task roles) - AWS_CONTAINER_CREDENTIALS_FULL_URI (ECS task roles) - AWS_WEB_IDENTITY_TOKEN_FILE (IRSA for Kubernetes) Also fixed undefined currentModel variable in OAuth error handling.	2026-01-19 16:10:10 +01:00
Mario Zechner	2d27a2c728	fix(ai): skip errored/aborted assistant messages in transform-messages Fixes OpenAI Responses 400 error 'reasoning without following item' by skipping errored/aborted assistant messages entirely rather than filtering at the provider level. This covers openai-responses, openai-codex-responses, and future providers. Removes strictResponsesPairing compat option (no longer needed). Closes #838	2026-01-19 15:55:29 +01:00
Mario Zechner	0e5977ad39	Add [Unreleased] section for next cycle	2026-01-19 00:23:57 +01:00
Mario Zechner	2c7c23b865	fix(ai): normalize tool call ids and handoff tests fixes #821	2026-01-19 00:10:49 +01:00
Mario Zechner	d43930c818	feat(ai): add strictResponsesPairing for Azure OpenAI Responses API Split OpenAICompat into OpenAICompletionsCompat and OpenAIResponsesCompat for type-safe API-specific compat settings. Added strictResponsesPairing option to suppress orphaned reasoning/tool calls on incomplete turns, fixing 400 errors on Azure's Responses API which requires strict pairing. Closes #768	2026-01-18 20:15:33 +01:00
Mario Zechner	632495338f	Add [Unreleased] section for next cycle	2026-01-17 22:02:59 +01:00
Mario Zechner	4068bc556a	chore: simplify codex prompt handling	2026-01-17 21:53:01 +01:00
Mario Zechner	a5f1016da2	fix(ai): normalize tool names case-insensitively against CC tool list - Replace hardcoded pi->CC tool mappings with single CC tool name list - Case-insensitive lookup: if tool name matches CC tool, use CC casing - Remove broken find->Glob mapping (round-trip failed) - Add test coverage for tool name normalization	2026-01-17 21:03:47 +01:00
Mario Zechner	0f3a0f78bc	fix(ai): prevent orphaned tool results after errored assistant messages When an assistant message has stopReason 'error', its tool calls are now excluded from pending tool tracking. This prevents synthetic tool results from being generated for calls that will be dropped by provider-specific converters (e.g., Codex drops tool calls from errored messages). Previously, this mismatch caused OpenAI to reject requests with 'No tool call found for function call output with call_id ...' errors. fixes #812	2026-01-17 20:20:39 +01:00
Pablo Tovar	cd43b8a9ca	fix: ensure max_tokens > thinking.budget_tokens for bedrock claude (#797 ) Bedrock Claude models require max_tokens to exceed thinking.budget_tokens. This constraint was handled for anthropic-messages API but missing for bedrock-converse-stream, causing compaction failures. Extracted adjustMaxTokensForThinking() helper that: - Adds thinking budget on top of desired output tokens - Reduces thinking budget if insufficient room (min 1024 output tokens) - Applied to both anthropic-messages and bedrock-converse-stream APIs	2026-01-17 10:55:30 +01:00
Mario Zechner	7f9cf543d5	Add [Unreleased] section for next cycle	2026-01-17 00:32:57 +01:00
Mario Zechner	5d3e7d5aaa	fix(ai): preserve unsigned tool call context for Gemini 3 with anti-mimicry note Instead of skipping unsigned tool calls entirely (which lobotomizes context), convert them to text with an explicit note telling the model this is historical context from a different model and not a format to mimic. This preserves tool call/result context when switching from providers without thought signatures (e.g. Claude via Antigravity) to Gemini 3.	2026-01-16 23:42:39 +01:00

1 2 3 4 5 ...

435 commits