Data Systems

CLASSIFICATION

12 tracks / 333 lessons

TRACKS

[HIDDEN]

Database Engine Internals and Implementation (Legacy Umbrella)

Legacy oversized source track retained for migration only. Prefer focused review tracks for storage engines, query execution, transactions, backend database operations, and PostgreSQL operations.

Deep Dive / 47 lessons

Not published

[HIDDEN]

Data Architecture and Platforms

End-to-end data system architecture, platform contracts, and data-intensive operating models.

Deep Dive / 46 lessons

Not published

[DRAFT]

Analytical Query Engines and Warehouses

Columnar storage, execution engines, warehouse architecture, vectorization, and the internals of large-scale analytical query systems.

Deep Dive / 32 lessons

Not published

[DRAFT]

Data Integration, CDC, and Pipelines

Change capture, ingestion contracts, backfills, schema drift, and the operational trade-offs of moving data through modern pipelines.

Specialization / 24 lessons

Not published

[DRAFT]

Data Systems Foundations

Data models, storage trade-offs, batch versus streaming, analytical versus transactional systems, and the basic mental models for modern data stacks.

Foundation / 16 lessons

Not published

[DRAFT]

Data Lakehouse and Storage Formats

Draft track for columnar formats, table metadata layers, schema evolution, compaction, and lakehouse architecture.

Specialization / 24 lessons

Not published

[DRAFT]

Metadata, Lineage, and Catalog Systems

Schemas, ownership, lineage graphs, discovery surfaces, and the metadata infrastructure that makes data platforms governable.

Specialization / 24 lessons

Not published

[DRAFT]

Streaming Data Infrastructure

Streaming ingestion, stateful processors, watermarks, checkpoints, exactly-once claims, backpressure, replay, and the platform patterns behind low-latency data movement.

Specialization / 24 lessons

Not published

[DRAFT]

In-Memory Data Systems and Redis

In-memory system design through Redis as the concrete case study: event loops, data structures, persistence, replication, clustering, caching, queues, locks, and operations.

Specialization / 24 lessons

Not published

[DRAFT]

NoSQL and Distributed Data Stores

Key-value, document, wide-column, graph, and search-oriented data stores with partitioning, replication, consistency, compaction, indexing, and operations.

Specialization / 24 lessons

Not published

[DRAFT]

Backend Database Operations and Query Performance

Operational database depth for backend engineers: connection pools, isolation, query planning, index health, sharding, replicas, failover, and split-brain prevention.

Deep Dive / 24 lessons

Not published

[DRAFT]

PostgreSQL Internals and Operations

PostgreSQL-specific depth for production systems: MVCC, WAL, locks, planner evidence, indexes, vacuum, replication, pooling, migrations, security, and operational debugging.

Deep Dive / 24 lessons

Not published