mirror of
https://github.com/getcompanion-ai/co-mono.git
synced 2026-04-22 02:03:42 +00:00
fix(ai): 429 rate limit errors no longer trigger auto-compaction
429 (Too Many Requests) was incorrectly classified as context overflow, triggering compaction instead of retry with backoff. The original logic assumed token-based rate limiting correlates with context overflow, but these are different concepts: - Rate limiting (429): requests/tokens per time period (throughput) - Context overflow: single request exceeds context window (size) Now 429 errors are handled by the existing retry logic with exponential backoff, while 400/413 remain as potential context overflow indicators. fixes #1038
This commit is contained in:
parent
a373dce747
commit
25707f9ad4
2 changed files with 8 additions and 7 deletions
|
|
@ -4,6 +4,7 @@
|
|||
|
||||
### Fixed
|
||||
|
||||
- Fixed 429 rate limit errors incorrectly triggering auto-compaction instead of retry with backoff ([#1038](https://github.com/badlogic/pi-mono/issues/1038))
|
||||
- Fixed Anthropic provider to handle `sensitive` stop_reason returned by API ([#978](https://github.com/badlogic/pi-mono/issues/978))
|
||||
- Fixed DeepSeek API compatibility by detecting `deepseek.com` URLs and disabling unsupported `developer` role ([#1048](https://github.com/badlogic/pi-mono/issues/1048))
|
||||
- Fixed Anthropic provider to preserve input token counts when proxies omit them in `message_delta` events ([#1045](https://github.com/badlogic/pi-mono/issues/1045))
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue