Section 2: Systems Basics

You know SQL and the architectural tradeoffs. Now, let's dissect how a database actually executes those operations.


The Key Problems We Solve

1. The I/O Bottleneck

Hardware dictates software behavior.

  • The Problem: Why does one operation fly while another crawls? Why does a query suddenly choke the server?

  • The Reality: We build an I/O Cost Model based on the latency gap between RAM and Disk. You'll see how databases use paging to sidestep costly disk access.

2. The Search Problem

Scanning a billion rows? That's a fool's errand.

  • The Problem: How do systems pinpoint a record, or confirm its absence, without combing through everything?

  • The Reality: We'll dive into Basic Hashing and Bloom Filters. Learn how databases use mathematical shortcuts to dodge unnecessary disk reads.

3. The Physical Constraints Problem

Data centers aren't infinite.

  • The Problem: How do you store petabytes without ballooning costs or delays?

  • The Reality: We explore Compression techniques, showing how databases squeeze more into fewer bytes by trading CPU cycles for savings on pricey disk operations.


Let's peek under the hood.