co-mono

mirror of https://github.com/getcompanion-ai/co-mono.git synced 2026-04-16 16:00:58 +00:00

Author	SHA1	Message	Date
haoqixu	1e718e63ea	feat(ai): Support HTTP proxy through environment variables	2026-01-25 15:52:45 +08:00
Mario Zechner	c725135a76	refactor(ai): register api providers	2026-01-24 23:15:11 +01:00
Markus Ylisiurunen	856012296b	add Azure OpenAI Responses provider with deployment-aware model mapping	2026-01-24 12:04:34 +01:00
Danila Poyarkov	6d0c544e18	fix: Bun compatibility for build scripts and runtime detection	2026-01-23 19:31:16 +03:00
Mario Zechner	b712d1ca43	fix(ai, web-ui): browser compatibility for pi-ai, update tsgo for decorator support - Update @typescript/native-preview to 7.0.0-dev.20260120.1 (supports experimentalDecorators) - Replace top-level node:fs, node:os, node:path imports with dynamic imports in stream.ts - Replace top-level node:os import with dynamic import in openai-codex-responses.ts - Replace top-level node:crypto, node:http imports with dynamic imports in openai-codex.ts - Replace Buffer.from with atob for browser-compatible base64 decoding fixes #873	2026-01-22 01:33:46 +01:00
Mario Zechner	d2be6486a4	feat(ai): add headers option to StreamOptions for custom HTTP headers - Added headers field to base StreamOptions interface - Updated all providers to merge options.headers with defaults - Forward headers and onPayload through streamSimple/completeSimple - Bedrock not supported (uses AWS SDK auth)	2026-01-20 01:08:31 +01:00
Mario Zechner	2b04aefa6d	feat(ai): add AWS ECS/IRSA credential detection for Bedrock, fixes #848 Added support for additional AWS credential environment variables: - AWS_CONTAINER_CREDENTIALS_RELATIVE_URI (ECS task roles) - AWS_CONTAINER_CREDENTIALS_FULL_URI (ECS task roles) - AWS_WEB_IDENTITY_TOKEN_FILE (IRSA for Kubernetes) Also fixed undefined currentModel variable in OAuth error handling.	2026-01-19 16:10:10 +01:00
Pablo Tovar	cd43b8a9ca	fix: ensure max_tokens > thinking.budget_tokens for bedrock claude (#797 ) Bedrock Claude models require max_tokens to exceed thinking.budget_tokens. This constraint was handled for anthropic-messages API but missing for bedrock-converse-stream, causing compaction failures. Extracted adjustMaxTokensForThinking() helper that: - Adds thinking budget on top of desired output tokens - Reduces thinking budget if insufficient room (min 1024 output tokens) - Applied to both anthropic-messages and bedrock-converse-stream APIs	2026-01-17 10:55:30 +01:00
Jian Zhang	558a77b45f	feat(ai): add support for MiniMax China (minimax-cn) provider (#725 ) Co-authored-by: Jian Zhang <jzhang@yanhuangdata.com>	2026-01-14 15:41:47 +01:00
Timo Lins	65eb738c90	Rename to `vercel-ai-gateway` for clarity	2026-01-13 16:42:34 +01:00
Timo Lins	164a69a601	Add Vercel AI Gateway support	2026-01-13 16:42:34 +01:00
Mario Zechner	8af8d0d672	Add MiniMax provider support (#656 by @dannote) - Add minimax to KnownProvider and Api types - Add MINIMAX_API_KEY to getEnvApiKey() - Generate MiniMax-M2 and MiniMax-M2.1 models - Add context overflow detection pattern - Add tests to all required test files - Update README and CHANGELOG with attribution Also fixes: - Bedrock duplicate toolResult ID when content has multiple blocks - Sandbox extension unused parameter lint warning	2026-01-13 02:27:09 +01:00
Mario Zechner	fd268479a4	feat(ai): Add Amazon Bedrock provider (#494 ) Adds support for Amazon Bedrock with Claude models including: - Full streaming support via Converse API - Reasoning/thinking support for Claude models - Cross-region inference model ID handling - Multiple AWS credential sources (profile, IAM keys, API keys) - Image support in messages and tool results - Unicode surrogate sanitization Also adds 'Adding a New Provider' documentation to AGENTS.md and README. Co-authored-by: nickchan2 <nickchan2@users.noreply.github.com>	2026-01-13 00:32:59 +01:00
jhyang	d2882c2643	Resolve os.homedir() lazily instead of at module load time - Move homedir() calls into functions for lazy evaluation - Add GOOGLE_APPLICATION_CREDENTIALS support for Vertex AI	2026-01-09 16:09:54 +08:00
Mario Zechner	97d0189eae	Add OpenCode Zen provider support	2026-01-09 06:58:20 +01:00
Melih Mucuk	0f27eae77e	feat: add thinkingBudgets option to customize token budgets	2026-01-07 15:13:26 +03:00
Mario Zechner	edb0da9611	feat(ai,agent,coding-agent): add sessionId for provider session-based caching - Add sessionId to StreamOptions for providers that support session-based caching - OpenAI Codex provider uses sessionId for prompt_cache_key and routing headers - Agent class now accepts and forwards sessionId to stream functions - coding-agent passes session ID from SessionManager and updates on session changes - Update ai package README with table of contents, OpenAI Codex OAuth docs, and env vars table - Increase Codex instructions cache TTL from 15 minutes to 24 hours - Add tests for sessionId forwarding in ai and agent packages	2026-01-06 11:08:42 +01:00
Mario Zechner	0b9e3ada0c	fix: clean up Codex thinking level handling - Remove per-thinking-level model variants (gpt-5.2-codex-high, etc.) - Remove thinkingLevels from Model type - Provider clamps reasoning effort internally - Omit reasoning field when thinking is off fixes #472	2026-01-05 21:58:26 +01:00
Mario Zechner	9a147559c0	Merge branch 'openai-codex'	2026-01-05 05:33:48 +01:00
Anton Kuzmenko	63093bf7e4	fix(ai): cache vertex adc credentials file check	2026-01-04 17:16:40 -08:00
Anton Kuzmenko	a3f30e085a	fix(ai): add vertex ai dummy value for configured credentials	2026-01-04 12:59:19 -08:00
Ahmed Kamal	1650041a63	feat(ai): add OpenAI Codex OAuth + responses provider	2026-01-04 21:11:19 +02:00
Mario Zechner	c9a85342ea	Fix google-vertex models showing without auth configured	2026-01-03 17:09:02 +01:00
Mario Zechner	8df22faedf	fix(ai): ensure maxTokens > thinkingBudget for Claude thinking models Claude requires max_tokens > thinking.budget_tokens. When caller specifies a small maxTokens (e.g. compaction with ~13k tokens) and reasoning is enabled with high budget (16k tokens), the constraint was violated. Fix: In mapOptionsForApi, add thinkingBudget on top of caller's maxTokens (capped at model.maxTokens). If still not enough room, reduce thinkingBudget to leave space for output. Applied to both anthropic-messages and google-gemini-cli APIs. Also adds test utilities for OAuth credential resolution and tests for compaction with thinking models. fixes #413	2026-01-03 02:45:30 +01:00
Anton Kuzmenko	3b61d83d29	Fix google-vertex build	2026-01-03 01:11:03 +01:00
Anton Kuzmenko	214e7dae15	Add Vertex AI provider with ADC support - Implement google-vertex provider in packages/ai - Support ADC (Application Default Credentials) via @google/generative-ai - Add Gemini model catalog for Vertex AI - Update packages/coding-agent to handle google-vertex provider	2026-01-03 01:11:03 +01:00
Mario Zechner	ecd240f636	Define own GoogleThinkingLevel type instead of importing from @google/genai - Add GoogleThinkingLevel type mirroring Google's ThinkingLevel enum - Update GoogleGeminiCliOptions and GoogleOptions to use our type - Cast to any when assigning to Google SDK's ThinkingConfig	2025-12-30 22:42:25 +01:00
Mario Zechner	251fea752c	Fix API key priority and compaction bugs - getEnvApiKey: ANTHROPIC_OAUTH_TOKEN now takes precedence over ANTHROPIC_API_KEY - findCutPoint: Stop scan-backwards loop at session header (was decrementing past it causing null preparation) - generateSummary/generateTurnPrefixSummary: Throw on stopReason=error instead of returning empty string - Test files: Fix API key priority order, use keepRecentTokens=1 for small test conversations	2025-12-30 22:42:17 +01:00
Mario Zechner	9f97f0c8da	getApiKeyFromEnv -> getEnvApiKey	2025-12-25 02:38:10 +01:00
Mario Zechner	030788140a	WIP: Remove global state from pi-ai OAuth/API key handling - Remove setApiKey, resolveApiKey, and global apiKeys Map from stream.ts - Rename getApiKey to getApiKeyFromEnv (only checks env vars) - Remove OAuth storage layer (storage.ts deleted) - OAuth login/refresh functions now return credentials instead of saving - getOAuthApiKey/refreshOAuthToken now take credentials as params - Add test/oauth.ts helper for ai package tests - Simplify root npm run check (single biome + tsgo pass) - Remove redundant check scripts from most packages - Add web-ui and coding-agent examples to biome/tsgo includes coding-agent still has compile errors - needs refactoring for new API	2025-12-25 01:01:03 +01:00
Mario Zechner	d93cbf8c32	WIP: remove setApiKey, resolveApiKey	2025-12-24 23:34:23 +01:00
Luke Foster	ee9b498380	Add Gemini 3 preview models to google-gemini-cli provider - Add gemini-3-pro-preview and gemini-3-flash-preview to Cloud Code Assist - Handle thinkingLevel config for Gemini 3 (vs thinkingBudget for Gemini 2.x) - Gemini 3 Pro: LOW/HIGH levels only - Gemini 3 Flash: all four levels (MINIMAL/LOW/MEDIUM/HIGH)	2025-12-20 22:10:47 -06:00
Mario Zechner	c359023c3f	Add Google Gemini CLI and Antigravity OAuth providers - Add google-gemini-cli provider: free Gemini 2.0/2.5 via Cloud Code Assist - Add google-antigravity provider: free Gemini 3, Claude, GPT-OSS via sandbox - Move OAuth infrastructure from coding-agent to ai package - Fix thinking signature handling for cross-model handoff - Fix OpenAI message ID length limit (max 64 chars) - Add GitHub Copilot overflow pattern detection - Add OAuth provider tests for context overflow and streaming	2025-12-20 21:34:18 +01:00
Mario Zechner	36e17933d5	feat(ai): add Google Cloud Code Assist provider - Add new API type 'google-cloud-code-assist' for Gemini CLI / Antigravity auth - Extract shared Google utilities to google-shared.ts - Implement streaming provider for Cloud Code Assist endpoint - Add 7 models: gemini-3-pro-high/low, gemini-3-flash, claude-sonnet/opus, gpt-oss Models use OAuth authentication and have sh cost (uses Google account quota). OAuth flow will be implemented in coding-agent in a follow-up.	2025-12-20 10:20:30 +01:00
Mario Zechner	7e38897673	feat: add xhigh thinking level support for gpt-5.2 models - Add supportsXhigh() function to ai package for checking xhigh support - Clamp xhigh to high for OpenAI models that don't support it - Update coding-agent to use centralized supportsXhigh() - gpt-5.2, gpt-5.2-codex now show xhigh in thinking selector Closes #236	2025-12-19 20:07:24 +01:00
Markus Ylisiurunen	d690310587	Fix Gemini 3 Flash Preview thinking levels (#212 ) * use the correct Gemini 3 Flash Preview thinking levels * fix a build error * add changelog entry * regenerate models * make less assumptions about future models	2025-12-18 13:03:28 +01:00
Mario Zechner	fbda78bfb3	Fix reasoning disabled by default for all providers Previously, when reasoning was not specified, some providers like Gemini with 'dynamic thinking' enabled by default would still use thinking. Now explicitly sets thinkingEnabled: false (Anthropic) and thinking: { enabled: false } (Google) when reasoning is undefined. Closes #180	2025-12-15 22:42:08 +01:00
cau1k	ccae7a4e0e	feat: initial impl - add GitHub Copilot model discovery (env token fallback, headers, compat) plus fallback list and quoted provider keys in generated map - surface Copilot provider end-to-end (KnownProvider/default, env+OAuth token refresh/save, enterprise base URL swap, available only when creds/env exist) - tweak interactive OAuth UI to render instruction text and prompt placeholders gpt-5.2-high took about 35 minutes. It had a lot of trouble with `npm check` and went off on a "let's adjust every tsconfig" side quest. Device code flow works, but the ai/scripts/generate-models.ts impl is wrong as models from months ago are missing and only those deprecated are accessible in the /models picker.	2025-12-14 17:18:13 -05:00
Markus Ylisiurunen	6b48fa58d7	Support thinking level configuration for Gemini 3 Pro models (#176 ) * support Google thinking level configuration for Gemini 3 Pro models * relax model ID check for gemini 3 pro	2025-12-13 02:09:54 +01:00
Mario Zechner	99b4b1aca0	Add Mistral as AI provider - Add Mistral to KnownProvider type and model generation - Implement Mistral-specific compat handling in openai-completions: - requiresToolResultName: tool results need name field - requiresAssistantAfterToolResult: synthetic assistant message between tool/user - requiresThinkingAsText: thinking blocks as <thinking> text - requiresMistralToolIds: tool IDs must be exactly 9 alphanumeric chars - Add MISTRAL_API_KEY environment variable support - Add Mistral tests across all test files - Update documentation (README, CHANGELOG) for both ai and coding-agent packages - Remove client IDs from gemini.md, reference upstream source instead Closes #165	2025-12-10 20:36:19 +01:00
Mario Zechner	00370cab39	Add xhigh thinking level for OpenAI codex-max models - Add 'xhigh' to ThinkingLevel type in ai and agent packages - Map xhigh to reasoning_effort: 'max' for OpenAI providers - Add thinkingXhigh color token to theme schema and built-in themes - Show xhigh option only when using codex-max models - Update CHANGELOG for both ai and coding-agent packages closes #143	2025-12-08 21:12:54 +01:00
Mario Zechner	cac353b3fe	Limit max output tokens to 32k	2025-10-30 15:47:36 +01:00
Mario Zechner	b6b64dff86	Better proxy handling.	2025-10-28 00:21:54 +01:00
Mario Zechner	35fe8f21e9	feat(ai): Implement Zod-based tool validation and improve Agent API - Replace JSON Schema with Zod schemas for tool parameter definitions - Add runtime validation for all tool calls at provider level - Create shared validation module with detailed error formatting - Update Agent API with comprehensive event system - Add agent tests with calculator tool for multi-turn execution - Add abort test to verify proper handling of aborted requests - Update documentation with detailed event flow examples - Rename generate.ts to stream.ts for clarity	2025-09-09 14:58:54 +02:00

44 commits