Enterprise Data Platform Architecture
End-to-end data platform showing sources, ingestion, data lake layers, data warehouse layers, and consumption
AWS Cloud
Data Sources
COS / Files
RDBMS
APIs
NoSQL
Kafka / Events
Data Lake (S3)
EMR / Spark
PySpark
Landing Zone
Raw / As-Is
data
Parquet · Hudi
Bronze
Layer
(As-Is)
Silver
Layer
(Cleansed)
Gold
Layer
(Latest
Version)
dbt
Snowflake
Persistent
Staging
Layer
Curated
Layer
Aggregate
Layer
Subset of data
Data Science
Datasets
BI &
Analytics
Platform Services
Orchestration
Secrets Mgr
KMS
EKS
ECR
OpenSearch
IAM
GuardDuty
Config / Hub
Inspector
* All services on AWS unless noted
Enterprise Data Platform — Reference Architecture