co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-16 06:02:42 +00:00

Author	SHA1	Message	Date
Mario Zechner	ee53b53689	fix(ai): enable xhigh for anthropic opus 4.6	2026-02-05 22:57:43 +01:00
Mario Zechner	c07277b9ac	fix(ai): set opus 4.6 context window to 200k	2026-02-05 22:25:26 +01:00
Mario Zechner	712d0c6ada	fix(ai,coding-agent): fix Bedrock Opus 4.6 model IDs, cache pricing, and add EU profile - Remove :0 suffix from Opus 4.6 Bedrock model IDs (not valid for this model) - Fix us/eu Opus 4.6 cache pricing (0.5/6.25 instead of 1.5/18.75) - Add missing eu.anthropic.claude-opus-4-6-v1 inference profile - Fix coding-agent default Bedrock model ID to match catalog	2026-02-05 22:21:22 +01:00
Mario Zechner	898ad73d8a	Add [Unreleased] section for next cycle	2026-02-05 21:21:19 +01:00
Mario Zechner	4c4d787b1a	feat(ai): add adaptive thinking support for Claude Opus 4.6 - Add adaptive thinking mode (type: 'adaptive') for Opus 4.6+ - Add effort parameter ('low', 'medium', 'high', 'max') for adaptive thinking - thinkingEnabled now auto-detects: adaptive for 4.6+, budget-based for older - streamSimple/completeSimple map ThinkingLevel to effort levels for Opus 4.6 - Add tests for Opus 4.6 adaptive thinking and GPT-5.3 Codex - Update @anthropic-ai/sdk to 0.73.0 - Update @aws-sdk/client-bedrock-runtime to 3.983.0 - Update @google/genai to 1.40.0 - Remove fast-xml-parser override (no longer needed)	2026-02-05 21:14:11 +01:00
Mario Zechner	b07b72ba2b	feat(ai): add xhigh thinking level support for gpt-5.3 models	2026-02-05 20:54:57 +01:00
Mario Zechner	b94c17885d	feat(ai): add Claude Opus 4.6 and GPT-5.3 Codex models	2026-02-05 20:34:56 +01:00
Mario Zechner	0404a93e33	Add [Unreleased] section for next cycle	2026-02-04 14:25:32 +01:00
Mario Zechner	150aeebf7d	fix(ai): respect codex baseUrl (closes #1244 )	2026-02-04 12:30:31 +01:00
Mario Zechner	6cc1676eae	Add [Unreleased] section for next cycle	2026-02-04 02:33:53 +01:00
Burak Varlı	be1d5a0299	chore(ai): clean up Bedrock-specific workarounds from `generate-models.ts` We had some workarounds in `generate-models.ts` initially - mainly to make cross-region inference work for Amazon Bedrock provider, but now these are upstreamed into models.dev and we no longer need those.	2026-02-03 22:04:49 +00:00
Mario Zechner	c983bfdb1e	Add [Unreleased] section for next cycle	2026-02-03 17:30:37 +01:00
Mario Zechner	2e1c5ebdee	fix(ai): relax xhigh model check fixes #1209	2026-02-03 13:03:46 +01:00
Mario Zechner	ff9a3f0660	Add [Unreleased] section for next cycle	2026-02-03 02:19:00 +01:00
Mario Zechner	0aa0b5fdba	Add [Unreleased] section for next cycle	2026-02-02 19:36:52 +01:00
Mario Zechner	111a31e4db	fix(ai): apply cache_control to string user messages	2026-02-02 19:19:12 +01:00
Mario Zechner	419c07fb19	Add [Unreleased] section for next cycle	2026-02-02 00:51:29 +01:00
Mario Zechner	ff0eb3ecd4	fix(ai): omit strict for unsupported openai completions	2026-02-02 00:44:55 +01:00
Mario Zechner	469fb5d27c	fix(ai): OAuth login/refresh now respects HTTP proxy env vars Extracted HTTP proxy setup to shared module and imported it from both stream.ts and oauth/index.ts. This ensures fetch() calls during OAuth flows (token exchange, refresh, project discovery) go through the proxy. fixes #1132	2026-02-01 19:08:13 +01:00
Mario Zechner	8f7ef85833	fix(ai): pass through cacheRetention in buildBaseOptions fixes #1154	2026-02-01 17:37:45 +01:00
Mario Zechner	abfd04b5c5	feat(ai): add cacheRetention stream option	2026-02-01 09:32:10 +01:00
Mario Zechner	e9ca0be769	feat(ai): add PI_AI_ANTIGRAVITY_VERSION env var override Allows users to override the Antigravity User-Agent version when Google updates their version requirements, avoiding the need to wait for a package release. Fixes #1129	2026-02-01 09:32:10 +01:00
Mario Zechner	aa83170e0f	Add [Unreleased] section for next cycle	2026-02-01 02:34:06 +01:00
4h9fbZ	993c45a059	feat(coding-agent): add Qwen CLI OAuth provider	2026-02-01 01:51:55 +01:00
Mario Zechner	030a61d88c	feat: add maxDelayMs setting to cap server-requested retry delays When a provider (e.g., Google Gemini CLI) requests a retry delay longer than maxDelayMs (default: 60s), the request fails immediately with an informative error instead of waiting silently for hours. The error is then handled by agent-level auto-retry, which shows the delay to the user and allows aborting with Escape. - Add maxRetryDelayMs to StreamOptions (packages/ai) - Add maxRetryDelayMs to AgentOptions (packages/agent) - Add retry.maxDelayMs to settings (packages/coding-agent) - Update _isRetryableError to match 'retry delay' errors fixes #1123	2026-02-01 00:50:41 +01:00
Mario Zechner	0091857f8b	Add [Unreleased] section for next cycle	2026-01-30 11:48:16 +01:00
Mario Zechner	2cee7e17de	Add [Unreleased] section for next cycle	2026-01-30 03:27:09 +01:00
Ben Vargas	e045a9f142	feat(ai): add Vercel AI Gateway routing support (#1051 ) * feat(ai): add Vercel AI Gateway routing support Add vercelGatewayRouting to OpenAICompletionsCompat, parallel to openRouterRouting. When a model targets ai-gateway.vercel.sh and has vercelGatewayRouting configured, the openai-completions provider passes providerOptions.gateway with only/order in the request body. Changes: - types.ts: VercelGatewayRouting interface + field on OpenAICompletionsCompat - openai-completions.ts: buildParams passes providerOptions.gateway, detectCompat/getCompat include the new field - model-registry.ts: VercelGatewayRoutingSchema for models.json validation - test: updated Required<OpenAICompletionsCompat> in test fixture * docs(coding-agent): add vercelGatewayRouting to custom models documentation	2026-01-30 01:44:51 +01:00
Mario Zechner	af813f9048	fix(ai): default tool call arguments fixes #1065	2026-01-30 01:13:16 +01:00
Mario Zechner	52532c7c00	fix(ai): update Antigravity User-Agent to 1.15.8 fixes #1079	2026-01-29 23:10:49 +01:00
Mario Zechner	87ab5c5c3b	feat(ai): add Kimi For Coding provider support - Add kimi-coding provider using Anthropic Messages API - API endpoint: https://api.kimi.com/coding/v1 - Environment variable: KIMI_API_KEY - Models: kimi-k2-thinking (text), k2p5 (text + image) - Add context overflow detection pattern for Kimi errors - Add tests for all standard test suites	2026-01-29 04:12:28 +01:00
Mario Zechner	c808de605a	feat(ai): add Hugging Face provider support - Add huggingface to KnownProvider type - Add HF_TOKEN env var mapping - Process huggingface models from models.dev (14 models) - Use openai-completions API with compat settings - Add tests for all provider test suites - Update documentation fixes #994	2026-01-29 02:40:14 +01:00
Mario Zechner	1b6a147579	feat(ai): add PI_CACHE_RETENTION env var for extended prompt caching Adds support for extended cache retention via PI_CACHE_RETENTION=long: - Anthropic: 5m -> 1h TTL - OpenAI: in-memory -> 24h retention Only applies to direct API calls (api.anthropic.com, api.openai.com). Proxies and other providers are unaffected. fixes #967	2026-01-29 02:22:06 +01:00
Mario Zechner	605f6f494b	fix(ai): normalize pipe-separated tool call IDs for cross-provider handoff - Handle pipe-separated IDs from OpenAI Responses API in openai-completions provider - Strip trailing underscores after truncation in openai-responses-shared (OpenAI Codex rejects them) - Add regression tests for tool call ID normalization fixes #1022	2026-01-29 01:28:12 +01:00
Mario Zechner	25707f9ad4	fix(ai): 429 rate limit errors no longer trigger auto-compaction 429 (Too Many Requests) was incorrectly classified as context overflow, triggering compaction instead of retry with backoff. The original logic assumed token-based rate limiting correlates with context overflow, but these are different concepts: - Rate limiting (429): requests/tokens per time period (throughput) - Context overflow: single request exceeds context window (size) Now 429 errors are handled by the existing retry logic with exponential backoff, while 400/413 remain as potential context overflow indicators. fixes #1038	2026-01-29 00:43:38 +01:00
Mario Zechner	8b5c81f21f	fix(ai): preserve input token counts from message_start in Anthropic provider Proxies like Portkey omit input_tokens in message_delta events (it's nullable per the SDK). The previous code unconditionally overwrote usage fields, causing input token counts to reset to 0. Now only updates usage fields when they are present (not null), preserving the correct input_tokens value captured from message_start. Fixes #1045	2026-01-29 00:06:51 +01:00
Mario Zechner	4f9deddd47	fix(ai): detect DeepSeek URLs and disable unsupported developer role fixes #1048	2026-01-28 23:55:54 +01:00
mom	ee7c0a7d18	fix(ai): handle sensitive stop_reason from Anthropic API (fixes #978 )	2026-01-28 02:18:16 +00:00
williamtwomey	41d2c7ff38	OpenAI completions toolChoice fix (#998 ) * openai completions tools fix * Reset generated file --------- Co-authored-by: williamtwomey <ai@shadylawn.net>	2026-01-28 03:03:15 +01:00
Daniel Tatarkin	9f3eef65f8	fix(ai): filter deprecated OpenCode models from generation (#970 ) Add status === 'deprecated' check for OpenCode Zen models, matching the existing pattern used for GitHub Copilot models. This removes deprecated models like glm-4.7-free and minimax-m2.1-free from the generated model catalog.	2026-01-26 23:56:13 +01:00
Mario Zechner	2bbc1a7659	Add [Unreleased] section for next cycle	2026-01-26 16:55:24 +01:00
Mario Zechner	9b903656ae	fix(ai): correct provider error message typo (closes #958 )	2026-01-26 15:37:57 +01:00
Mario Zechner	676de103e1	Update models/package-lock.json after adding deps	2026-01-25 19:29:02 +01:00
haoqixu	1e718e63ea	feat(ai): Support HTTP proxy through environment variables	2026-01-25 15:52:45 +08:00
jake	dac7474da2	feat(ai): add OpenRouter provider routing support Allows custom models to specify which upstream providers OpenRouter should route requests to via the `openRouterRouting` field in model definitions. Supported fields: - `only`: list of provider slugs to exclusively use - `order`: list of provider slugs to try in order	2026-01-25 03:34:49 +01:00
Mario Zechner	a6d878e804	fix(ai): default tool call arguments to empty object for Google providers When Google providers return tool calls without an args field (common for no-argument tools), the arguments field was undefined. This breaks subsequent API calls that require tool_use.input to be present. Now defaults to {} when args is missing. Related: clawdbot/clawdbot#1509	2026-01-25 03:18:02 +01:00
Mario Zechner	0d24ddbb03	fix(ai): use model.api instead of hardcoding api type in streaming functions - anthropic.ts: use model.api instead of hardcoding 'anthropic-messages' - openai-responses.ts: use model.api instead of hardcoding 'openai-responses' - gitlab-duo: simplify to use actual model IDs, export MODELS array	2026-01-25 00:52:34 +01:00
Mario Zechner	177c694406	feat: custom provider support with streamSimple - Add resetApiProviders() to clear and re-register built-in providers - Add createAssistantMessageEventStream() factory for extensions - Add streamSimple support in ProviderConfig for custom API implementations - Call resetApiProviders() on /reload to clean up extension providers - Add custom-provider.md documentation - Add custom-provider.ts example with full Anthropic implementation - Update extensions.md with streamSimple config option	2026-01-24 23:15:11 +01:00
Mario Zechner	c725135a76	refactor(ai): register api providers	2026-01-24 23:15:11 +01:00
Mario Zechner	3256d3c083	refactor(oauth): add provider registry	2026-01-24 23:15:11 +01:00

1 2 3 4 5 ...

466 commits