Add performance features: caching, cost tracking, retry, compaction, classification, scrubbing

Inspired by zeroclaw's lightweight patterns for slow hardware:
- Response cache (SQLite + SHA-256 keyed) to skip redundant LLM calls
- History compaction — LLM-summarize old messages when history exceeds 50
- Query classifier routes simple/research queries to cheaper models
- Credential scrubbing removes secrets from tool output before sending to LLM
- Cost tracker with daily/monthly budget enforcement (SQLite)
- Resilient provider with retry + exponential backoff + fallback provider
- Approval engine gains session "always allow" and audit log

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This commit is contained in:

Kaloyan Danchev

2026-02-19 09:20:52 +02:00

parent e24e3026b6

commit 872ed24f0c

10 changed files with 694 additions and 17 deletions

									
										1

pyproject.toml
									
												View File
												
				@@ -19,6 +19,7 @@ dependencies = [

				    "json-repair>=0.30.0",

				    "duckduckgo-search>=7.0.0",

				    "pypdf>=5.0.0",

				    "aiosqlite>=0.20.0",

				]

				[project.scripts]

Add performance features: caching, cost tracking, retry, compaction, classification, scrubbing

1 pyproject.toml Unescape Escape View File

1

pyproject.toml

View File