This PSET covers Module 2: IO cost model (RAM, SSD, HDD), hashing, Bloom filters, and compression (RLE, dictionary encoding, columnar storage). Before starting, you should have read the Module 2 content, attended the lectures, and completed the Systems Primer and Data Systems Intro Colab walkthroughs in your CA section.
IO Model: Unless stated otherwise, page size = 64 MB. RAM: 100ns access, 100 GB/s scan. SSD: 10μs access, 5 GB/s scan. HDD: 10ms access, 100 MB/s scan. Cw = 1.