Day 237: Async IO Fundamentals - Concurrency While Waiting

Threads are one way to get concurrency, but they are not the only way. Async I/O becomes attractive when the real bottleneck is not CPU work, but how much time the program spends waiting for sockets, files, or timers to become ready.

Today's "Aha!" Moment

After processes, threads, locks, lock-free structures, and memory ordering, it is easy to assume that concurrency always means:

more execution paths
more scheduling
more synchronization between them

Async I/O changes the center of gravity.

It starts from a different observation:

many servers spend large amounts of time not computing, but waiting

Waiting for:

the next bytes on a socket
the kernel to finish a read
a timeout to expire
a remote peer to respond

The aha is:

async I/O does not make waiting disappear
it makes waiting non-blocking, so one execution context can keep making progress on other work while the kernel watches the slow thing

That is why an event loop can handle many in-flight operations without needing one blocked thread per connection.

So async I/O is best understood as:

concurrency for I/O-bound workloads by multiplexing waiting, not by multiplying blocked threads

Why This Matters

Imagine a chat server handling tens of thousands of mostly idle connections.

At any given moment, most clients are doing almost nothing:

waiting for a message
waiting for a heartbeat
waiting for the next small burst of traffic

If we use one blocked thread per connection, the system spends a huge amount of overhead on:

thread stacks
scheduler bookkeeping
context switches
synchronization around shared resources

Yet most of those threads are simply sleeping in read() or recv().

Async I/O changes the model:

register interest in events
let the kernel tell us which sockets are ready
run a small piece of handler logic for the ready ones
go back to the event loop

Now the program is not saying:

"one waiting thread per connection"

It is saying:

"one loop that reacts to whichever connections are ready right now"

This matters because it changes the cost structure dramatically for I/O-heavy systems.

The point is not that async is "modern" or always better. The point is that blocking threads are often a bad fit when waiting dominates and per-connection work is tiny.

Learning Objectives

By the end of this session, you will be able to:

Explain why async I/O exists - Describe the difference between CPU-bound concurrency and I/O-bound waiting, and why the latter motivates an event-driven model.
Trace the event loop model - Show how readiness notification and callback/coroutine resumption let one runtime manage many in-flight operations.
Evaluate the trade-off - Recognize when async I/O reduces thread overhead and when it instead complicates control flow or fails to help with CPU-bound work.

Core Concepts Explained

Concept 1: Blocking I/O Ties Up an Execution Context While Nothing Useful Happens

Suppose a thread handles a client connection like this:

read request
wait for bytes
parse
call database
wait for response
format output
write response
wait for socket buffer

Large parts of that lifecycle are waiting, not computing.

With blocking I/O, the thread cannot do other useful work while stuck in those waits.

That is acceptable if:

concurrency is low
the number of connections is small
the simplicity of one-thread-per-task is worth the cost

But it becomes expensive when thousands of tasks are mostly sleeping.

So the key insight is:

blocking I/O wastes execution contexts on waiting

That waste is exactly what async I/O tries to remove.

Concept 2: Async I/O Uses Readiness or Completion Notification to Reuse the Same Execution Context

The event-driven model usually looks like this:

tell the kernel which sockets or file descriptors we care about
wait for the kernel to report readiness or completion
run the handler for whichever operation is now ready
suspend again when that handler reaches another wait point

ASCII sketch:

event loop
   |
   +--> register interest in fd1, fd2, fd3, ...
   |
   +--> kernel says: fd2 ready, timer expired, fd9 writable
   |
   +--> run tiny units of work for ready events
   |
   +--> go back to waiting for the next set

This is the reason async can scale well for many mostly idle connections.

Instead of:

many blocked threads each waiting separately

we have:

one or a few loops resuming only the work that is actually unblocked

Modern async syntax with await makes this easier to read, but the underlying idea is the same:

suspend when progress depends on external I/O
resume when the awaited event is ready

Concept 3: Async I/O Is Great for I/O-Bound Workloads, but It Does Not Magically Fix CPU-Bound Work

Async often gets overgeneralized.

It is excellent when:

there are many concurrent waits
each ready event does a relatively small amount of CPU work
thread-per-connection overhead would dominate

It does not make CPU-heavy handlers faster.

If a handler spends a long time compressing data, parsing a giant document, or running expensive business logic, then the event loop itself can become blocked.

That leads to the core trade-off:

async I/O reduces blocked-thread overhead
but it demands that handlers stay short, cooperative, and aware of where they may yield

This is why async systems often need:

worker pools for CPU-bound tasks
explicit backpressure
careful avoidance of accidental blocking calls inside the event loop

So async I/O is not a universal concurrency upgrade. It is a very good fit for one specific shape of workload:

high concurrency
lots of waiting
small bursts of actual work between waits

Troubleshooting

Issue: "Async means the program runs everything in parallel."

Why it happens / is confusing: The word "concurrent" is often mentally collapsed into "parallel."

Clarification / Fix: Async mainly helps one runtime manage many in-flight waits. It is about efficient multiplexing of blocked I/O, not automatic CPU parallelism.

Issue: "If we move to async, all scalability problems go away."

Why it happens / is confusing: Thread overhead is visible, so removing it feels like the whole problem.

Clarification / Fix: Async reduces one class of cost. It does not remove slow handlers, overloaded databases, bad backpressure, or CPU-heavy application code.

Issue: "If the code uses await, it cannot block the event loop."

Why it happens / is confusing: Developers equate async syntax with non-blocking behavior.

Clarification / Fix: A function can still call blocking APIs or do too much CPU work before yielding. Async correctness depends on what the code actually does, not on the presence of async keywords.

Advanced Connections

Connection 1: Async IO Fundamentals <-> Threads & Scheduling

The parallel: Threads create concurrency by giving the scheduler many execution paths. Async I/O creates concurrency by letting a smaller number of execution contexts manage many waits cooperatively.

Connection 2: Async IO Fundamentals <-> io_uring

The parallel: This lesson introduces the event-driven model at a high level. io_uring is one modern Linux mechanism that implements an efficient completion-based path for that style of I/O.

Resources

[DOC] epoll(7)
[DOC] select(2)
[DOC] The Node.js Event Loop
[BOOK] Operating Systems: Three Easy Pieces

Key Insights

Async I/O is about concurrency while waiting - It helps when many operations are blocked on external I/O, not when the core problem is CPU saturation.
The event loop reuses execution context across many in-flight operations - The kernel reports readiness or completion, and the runtime resumes only the work that can move forward.
Async changes the cost model, not the laws of performance - It reduces blocked-thread overhead but still requires short handlers, backpressure, and care around accidental blocking work.

Knowledge Check

What problem is async I/O primarily trying to solve?
- A) Faster arithmetic on CPU-bound workloads
- B) Efficient handling of many concurrent waits without one blocked thread per operation
- C) Removal of all kernel scheduling
What does an event loop fundamentally do?
- A) It runs every handler continuously whether or not I/O is ready
- B) It waits for readiness/completion events and resumes the work that can now make progress
- C) It converts network traffic into disk sectors
When is async I/O a poor fit by itself?
- A) When handlers spend long periods on CPU-heavy work without yielding
- B) When the application uses sockets
- C) When the kernel supports readiness notification

Answers

1. B: Async I/O is primarily about handling many concurrent waits efficiently instead of dedicating one blocked thread to each one.

2. B: The event loop waits for the kernel to report which operations are ready and resumes only those paths.

3. A: Async I/O does not solve CPU saturation; CPU-heavy work can still block the loop and needs another strategy.

← Back to Learning