Skip to main content

Lab 15: Polars + DuckDB Local Warehouse

Difficulty: Intermediate · Estimated time: ~3–5 hours

Objective

Build a local warehouse workflow:

  • ingest raw CSV/JSON logs
  • transform with Polars
  • write Parquet
  • query features with DuckDB

Requirements

  • A reproducible pipeline script (make_dataset.py)
  • A DuckDB query file (features.sql)
  • Output features.parquet

Deliverables

  • make_dataset.py
  • features.sql
  • features.parquet
  • REPORT.md with performance notes