Medallion Architecture
Last updated
Last updated
Typically just a raw copy of ingested data.
Replaces traditional data lake.
Provides efficient storage and querying of unprocessed history of data.
Reduces data storage complexity, latency, and redundancy.
Optimizes ETL throughput and analytic query performance.
Preserves grain of original data.
Eliminates Duplicate records.
Production schema is enforced.
Data quality checks and corrupt data are quarantined.
Powers ML applications, reporting, dashboards, and ad-hoc analytics.
Refined views of data, typically with aggregations.
Optimizes query performance for business-critical data.
Data Engineer
Data Analysts Data Scientists