School usually presents a(b + c) = ab + ac as a rule to memorise, a line in a table of "algebraic identities." It is not that. It is a statement about rectangles — about the obvious fact that cutting a rectangle with a vertical line doesn't change its total area. Once you see this, distributivity stops being a formula and becomes a picture. And the picture does the job for factoring (ab + ac \to a(b + c)) every bit as well as it does for expanding (a(b + c) \to ab + ac).

The picture

Draw a rectangle of height a and width b + c. Its area is

\text{big rectangle} = a \cdot (b + c).

Now cut it vertically at width b. You get two smaller rectangles: a left piece of height a and width b, and a right piece of height a and width c. Their areas are

\text{left piece} = a \cdot b \qquad \text{right piece} = a \cdot c.

The total area hasn't changed — you only drew a line. So

a \cdot (b + c) = a \cdot b + a \cdot c.

That equation isn't an identity to believe on faith. It is the number \text{(area of the rectangle)} computed two ways: once as a single rectangle, and once as the sum of two rectangles. Both computations must give the same number because they describe the same shape.

Drag the cut

The figure below shows a rectangle of height a = 3 and total width 6. You can drag the vertical cut left or right to change the split between b and c (their sum always stays 6). The readouts compute the two areas and their sum — and confirm that the sum equals a \cdot (b + c) = 3 \cdot 6 = 18, no matter where you put the cut.

Drag the slider (or grab the blue handle on the cut). The left piece's width ($b$) and the right piece's width ($c$) change, and so do their areas ($ab$ and $ac$). Their sum stays locked at $18$ — the outer rectangle never changed.

The readout values, at a few snapshots:

b c ab ac ab + ac a(b+c)
1 5 3 15 18 18
2 4 6 12 18 18
3 3 9 9 18 18
4.5 1.5 13.5 4.5 18 18

Every row: ab + ac = a(b + c). Every row: the final column is 3 \times 6 = 18. The cut is a story we tell about the rectangle; the rectangle itself is unchanged.

Why the area is invariant: cutting a shape into pieces partitions its area. Sum-of-parts equals whole. Nothing mysterious. The distributive law is this partition rule applied to a rectangle with one side of length a and the other side split into b + c.

Running it backwards: the same picture factors

The distributive law reads left-to-right when you expand brackets and right-to-left when you factor. The rectangle does both.

  • Expand: given a(b + c), you see the big rectangle and want the sum ab + ac. Cut vertically.
  • Factor: given ab + ac, you see two rectangles with a common height a. Slide them together and the combined rectangle has width b + c, so the total area is a(b + c).

Both readings are the same picture, done in opposite directions. The factor move (ab + ac \to a(b + c)) is the single most common first step in school algebra — "take out the common factor" — and the rectangle makes it inevitable. The height doesn't care whether you are looking at two pieces or one; all that matters is that both pieces share it.

The common trap: addition does not distribute over multiplication

Students often write

a + (b \times c) \stackrel{?}{=} (a + b) \times (a + c). \quad (\text{WRONG})

No. The distributive law goes one direction only: multiplication distributes over addition, but addition does not distribute over multiplication. The rectangle picture explains why. If you try to cut a rectangle of height a + b and width a + c, you do not get an area equal to a + bc. You get

(a + b)(a + c) = a^2 + ab + ac + bc,

which is a whole different story — four rectangles, not two. Why the asymmetry: multiplication is what the area computation is. Cutting the width (distributing multiplication over addition of widths) preserves area because the height stays constant. There is no analogous cut-by-adding move.

From picture to identity

Once you trust the picture, all the "distributive" identities you learn later are just shorter or taller or repeated versions of it.

  • 3(x + 7) = 3x + 21 — rectangle of height 3, width x + 7, cut at x.
  • a(b + c + d) = ab + ac + ad — rectangle with a width split into three pieces, so two cuts; three smaller rectangles on the right.
  • (a + b)(c + d) = ac + ad + bc + bd — rectangle cut both horizontally and vertically, producing a 2 \times 2 grid of four smaller rectangles. This is also where the (a + b)^2 = a^2 + 2ab + b^2 identity comes from: it is the square case where c = a and d = b.

The rectangle is the universal mnemonic. If you ever forget which side distributes over which, draw the rectangle — the geometry forces the right answer.

FOIL is just a 2-by-2 grid of rectangles

Expand (x + 3)(x + 5).

Draw a rectangle of height x + 3 and width x + 5. Cut it horizontally at height x and vertically at width x. You now have four smaller rectangles:

  • top-left: area x \cdot x = x^2
  • top-right: area x \cdot 5 = 5x
  • bottom-left: area 3 \cdot x = 3x
  • bottom-right: area 3 \cdot 5 = 15

Sum: x^2 + 5x + 3x + 15 = x^2 + 8x + 15.

Why: the full rectangle has area (x + 3)(x + 5). The four pieces have areas x^2, 5x, 3x, and 15. Setting sum-of-pieces equal to whole gives the expansion. The FOIL acronym (First, Outer, Inner, Last) is just a mnemonic for visiting the four rectangles in a fixed order.

The reflex, in one line

When you see a(b + c) or ab + ac, picture a rectangle. Expanding a bracket is cutting the rectangle. Factoring a common term is merging two rectangles that share a height. The rectangle is the distributive law made visible.

Related: Operations and Properties · Algebraic Expressions · Algebraic Identities · Expression Trees: Watching Precedence Collapse