Visualizing Back-of-the-Envelope Numbers
December 1, 2024
System Design
It can be difficult developing a intuition for the numbers thrown around in system design interviews. For my own sake, I thought I'd try to visualize them.
Data Type Sizes
First, a table of data type sizes. These are Postgres data types, but they should be similar for other systems. The squares are scaled in size logarithmically, which means that the differences are really much bigger than they seem here.
Type | Postgres Data Types | Size | Notes |
---|---|---|---|
Integer |
| 2 bytes 4 bytes 8 bytes | Used for numeric fields, IDs, and counters. Larger sizes support larger ranges. |
Floating Point |
| 4 bytes 8 bytes | Used for scientific calculations or approximate values. |
Timestamp |
| 8 bytes | Used for date and time fields, with or without timezone information. |
UUID |
| 16 bytes | Universally unique identifiers for distributed systems. |
Enum |
| 1 byte 4 bytes |
|
String |
| ~100 bytes ~1 KB ~10 KB |
|
JSONB |
| ~100 bytes ~1 KB ~10 KB |
|
BYTEA |
| ~1 KB ~100 KB ~1 MB |
|
Legend: 1 byte 100 bytes 1 KB (1,024 bytes) 10 KB 100 KB 1 MB (1,048,576 bytes) |
Time Scales
Second, this is a table of time scales. Some quick comparisons: it's 160x faster to read from RAM than do a random read from SSD. It's 4x faster to read 1MB sequentially from RAM than to do the same from SSD. Squares are scaled in size logarithmically.
Operation | Time | Visualization |
---|---|---|
L1 Cache Reference Fetching a hot variable in a running JavaScript function. | 0.5 ns | |
L2 Cache Reference Accessing slightly less frequent in-memory data. | 7 ns | |
L3 Cache Reference Resolving reused middleware configurations in Node.js. | 20-30 ns | |
Main Memory Reference Fetching data from a Redis cache stored in RAM. | 100 ns | |
Compress 1KB of Data in Memory Gzip compressing an HTTP response in Express.js. | 3,000 ns | |
Send 1KB over 1 Gbps Network Sending a small JSON response over the network. | 10,000 ns | |
SSD Random Read Querying a session ID from persistent Redis storage. | 16,000 ns | |
Read 1MB Sequentially from Memory Iterating through a large in-memory buffer of user data. | 250,000 ns | |
Read 1MB Sequentially from SSD Retrieving a 1MB file stored on disk for a static site. | 1,000,000 ns | |
Disk Seek Accessing a log file stored on a spinning disk. | 10,000,000 ns | |
Send Packet US to Europe and Back Calling an API hosted in a European data center. | 150,000,000 ns | |
Legend: 0.5 ns 1 ns 100 ns 1,000 ns 10,000 ns 1,000,000 ns 10,000,000 ns 150,000,000 ns |