From f36299ad3a36384c909b867a8b27e74d0898ece0 Mon Sep 17 00:00:00 2001
From: Mario Zechner <badlogicgames@gmail.com>
Date: Sun, 10 Aug 2025 10:33:59 +0200
Subject: [PATCH] docs: Add todo for automatic context length detection

Document provider support for context length via models endpoint
and caching strategy for model metadata
---
 todos/todos.md | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/todos/todos.md b/todos/todos.md
index ae620448..9f44a507 100644
--- a/todos/todos.md
+++ b/todos/todos.md
@@ -3,6 +3,17 @@
 - agent: ultrathink to temporarily set reasoning_effort?
 
 - agent: need to figure out a models max context lenght
+    - Add automatic context length detection via models endpoint
+    - Cache per baseURL/model combination in $PI_CONFIG_DIR/models.json or ~/.pi/models.json
+    - Should be part of preflight check in agent (like reasoning support detection)
+    - Provider support status:
+        - vLLM: ✅ `/v1/models` → `max_model_len`
+        - Groq: ✅ `/openai/v1/models` → `context_window`
+        - OpenRouter: ✅ `/api/v1/models` → `context_length`
+        - Gemini: ✅ `/v1beta/models` (native API) → `inputTokenLimit`
+        - Anthropic: ❌ `/v1/models` (no context info)
+        - OpenAI: ❌ `/v1/models` (no context info)
+    - For Anthropic/OpenAI, may need hardcoded fallback values or separate lookup table
 
 - agent: compaction & micro compactionexi