Simplify model tiers: Basic=Ollama, Normal=Haiku, Complex=Sonnet

2026-02-05 10:36:52 -07:00
parent 30658a9cff
commit a029790483
1 changed files with 15 additions and 17 deletions
--- a/workspace/OPTIMIZATION.md
+++ b/workspace/OPTIMIZATION.md
@@ -14,36 +14,34 @@

 ---

-## MODEL SELECTION
+## MODEL SELECTION - Three Tiers

-### System Default: Haiku
-
-Haiku is your primary model. It decides routing internally:
-
-**Haiku routes to Ollama when:**
+### Basic → Ollama
 - File checking and organization
 - Simple templating/formatting
 - Log review and cleanup
 - Non-critical analysis
 - Routine status checks

-**Haiku handles directly when:**
- Most tasks fit within Haiku's capability
- Reasoning is needed but not deeply complex
+**Cost:** Free (local)
+
+### Normal → Haiku (Default)
+- Most tasks
 - Code review (non-production)
 - Documentation and writing
+- General problem solving
+- Straightforward reasoning

-**Haiku escalates to Sonnet when:**
+**Cost:** ~$0.30-1.50 per 1M tokens
+
+### Complex → Sonnet
 - Architecture decisions
- Production-like code review
+- Production code review
 - Security analysis
 - Complex debugging/reasoning
 - Strategic multi-project decisions

+**Cost:** ~$3-15 per 1M tokens
+
 ### Heartbeat: Ollama Only
-
-Heartbeats ALWAYS use Ollama. No escalation. If Ollama fails, the heartbeat fails.
-
-### Decision Rule
-
-Let Haiku decide. It's smart enough to route to Ollama when appropriate and escalate to Sonnet when needed. You only override when you know you need Sonnet upfront.
+Heartbeats ALWAYS use Ollama. No escalation. If it fails, it fails.