Lab 15: Polars + DuckDB Local Warehouse
Difficulty: Intermediate · Estimated time: ~3–5 hours
Objective
Build a local warehouse workflow:
- ingest raw CSV/JSON logs
- transform with Polars
- write Parquet
- query features with DuckDB
Requirements
- A reproducible pipeline script (
make_dataset.py) - A DuckDB query file (
features.sql) - Output
features.parquet
Deliverables
make_dataset.pyfeatures.sqlfeatures.parquetREPORT.mdwith performance notes