Jordan Ramos
911d362ba2
Optimize for Claude Agent SDK: Memory, context, and model selection
## Memory & Context Optimizations
### agent.py
- MAX_CONTEXT_MESSAGES: 10 → 20 (better conversation coherence)
- MEMORY_RESPONSE_PREVIEW_LENGTH: 200 → 500 (richer memory storage)
- MAX_CONVERSATION_HISTORY: 50 → 100 (longer session continuity)
- search_hybrid max_results: 2 → 5 (better memory recall)
- System prompt: Now mentions tool count and flat-rate subscription
- Memory format: Changed "User (username)/Agent" to "username/Garvis"
### llm_interface.py
- Added claude_agent_sdk model (Sonnet) to defaults
- Mode-based model selection:
* Agent SDK → Sonnet (best quality, flat-rate)
* Direct API → Haiku (cheapest, pay-per-token)
- Updated logging to show active model
## SOUL.md Rewrite
- Added Garvis identity (name, email, role)
- Listed all 17 tools (was missing 12 tools)
- Added "Critical Behaviors" section
- Emphasized flat-rate subscription benefits
- Clear instructions to always check user profiles
## Benefits
With flat-rate Agent SDK:
- ✅ Use Sonnet for better reasoning (was Haiku)
- ✅ 2x context messages (10 → 20)
- ✅ 2.5x memory results (2 → 5)
- ✅ 2.5x richer memory previews (200 → 500 chars)
- ✅ Bot knows its name and all capabilities
- ✅ Zero marginal cost for thoroughness
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>