A distribution is the trace a process leaves behind

There’s a shopping cart full of bouncy balls in the nosebleed seats of Levi Stadium. Visualize what happens when you knock it over.

The balls start in one place. Then the collisions begin. One clips a seatback and kicks left. Another catches the lip of a stair and hops right. Another glances off a railing, then another ball, then another stair edge. Any single bounce changes very little. A long sequence of bounces creates spread.

The process

By the time the balls reach the bottom, most end up somewhere near the center. Their left and right deflections mostly canceled out. Far fewer drift all the way to one side, because long runs in the same direction are rarer. The final pile has a dense middle, thinning sides, and only a few extremes.

The distribution

0 / 250μ 0.00·σ 0.00expected 10.00 · 2.24

That is the logic behind a normal distribution. Repeated, balanced variation produces many middling outcomes and only a handful far from the center.

A distribution carries clues about the process that made it. Tight clustering suggests little room for outcomes to diverge. Wide spread suggests many chances for small differences to accumulate. Bias the bounces, change the geometry, or introduce a few larger shocks, and the shape changes with the process.

But the shape is only a clue. Different processes can leave behind similar-looking piles. The real question is always the same: what kind of process, repeated many times, could have made the pattern you see?