Lab 11: Prompt Caching & Cost Benchmarks

Difficulty: Intermediate · Estimated time: ~2–3 hours

Objective

Implement caching for an LLM feature (CLI or API) and report measured improvements.

Create a CLI:

bash

tds-ask "Explain hybrid search"

cache.py (SQLite or similar)
ask.py (the CLI)
REPORT.md with:
- methodology
- cache hit rate
- latency numbers
- what you chose not to cache (and why)