In short

The addition theorem says P(A \cup B) = P(A) + P(B) - P(A \cap B). The subtraction corrects for the overlap being counted twice. For three events, a second correction layer is needed. For n events, the pattern generalises into the inclusion-exclusion principle — add singles, subtract pairs, add triples, subtract quadruples, and so on.

A standard deck has 52 cards. You draw one at random. What is the probability that the card is a heart or a king?

Your first instinct might be: there are 13 hearts and 4 kings, so P = 17/52. But that is wrong. The king of hearts is both a heart and a king, and you just counted it twice — once among the 13 hearts and once among the 4 kings.

The correct count: 13 + 4 - 1 = 16 favourable cards (subtracting the one card that was double-counted), so P = 16/52 = 4/13.

That subtraction — removing the overlap to fix the double-count — is the entire content of the addition theorem. The formula works for every probability problem where you ask "what is the chance that this or that happens?" and the two events might overlap.

The formula and its proof

Here is the theorem, stated precisely.

Addition Theorem (two events)

For any two events A and B,

P(A \cup B) = P(A) + P(B) - P(A \cap B)

The proof works by splitting the union into disjoint pieces, then reassembling.

Proof. The union A \cup B can be split into three mutually exclusive parts:

A \cup B = (A \cap B') \;\cup\; (A \cap B) \;\cup\; (A' \cap B)

These three sets are pairwise disjoint — no outcome belongs to more than one of them. By Axiom 3 (countable additivity):

P(A \cup B) = P(A \cap B') + P(A \cap B) + P(A' \cap B) \tag{1}

Now, A itself is the disjoint union A = (A \cap B') \cup (A \cap B), so:

P(A) = P(A \cap B') + P(A \cap B)
P(A \cap B') = P(A) - P(A \cap B) \tag{2}

Similarly, B = (A' \cap B) \cup (A \cap B), so:

P(A' \cap B) = P(B) - P(A \cap B) \tag{3}

Substitute (2) and (3) into (1):

P(A \cup B) = [P(A) - P(A \cap B)] + P(A \cap B) + [P(B) - P(A \cap B)]
P(A \cup B) = P(A) + P(B) - P(A \cap B)

That's the proof — clean and complete. The subtraction -P(A \cap B) at the end is doing exactly one job: it removes the double-count of the overlap.

Venn diagram showing the addition theorem visuallyTwo overlapping circles A and B inside a rectangle S. The left-only region is labelled P(A) minus P(A intersect B). The overlap is labelled P(A intersect B) with the text 'counted twice, subtracted once'. The right-only region is labelled P(B) minus P(A intersect B). S only A A ∩ B counted twice subtracted once only B A B
When you add $P(A)$ and $P(B)$, the middle region — $A \cap B$ — gets counted twice. The addition theorem subtracts it once to correct the count.

The special case: mutually exclusive events

When A and B are mutually exclusive (A \cap B = \varnothing), the overlap is empty, so P(A \cap B) = 0. The formula simplifies to:

P(A \cup B) = P(A) + P(B)

This is just Axiom 3 for two events. The addition theorem generalises the axiom to events that are not mutually exclusive.

Three events: a second layer of correction

What if you have three events — A, B, and C — and you want P(A \cup B \cup C)?

The same double-counting problem occurs, but now it has more layers.

Addition Theorem (three events)

For any three events A, B, and C,

P(A \cup B \cup C) = P(A) + P(B) + P(C) - P(A \cap B) - P(A \cap C) - P(B \cap C) + P(A \cap B \cap C)

Proof. Write A \cup B \cup C = (A \cup B) \cup C and apply the two-event formula with the events D = A \cup B and C:

P(D \cup C) = P(D) + P(C) - P(D \cap C)

Now expand each piece.

For P(D) = P(A \cup B), use the two-event formula:

P(A \cup B) = P(A) + P(B) - P(A \cap B) \tag{4}

For P(D \cap C) = P((A \cup B) \cap C), distribute the intersection over the union:

(A \cup B) \cap C = (A \cap C) \cup (B \cap C)

Apply the two-event formula again:

P((A \cap C) \cup (B \cap C)) = P(A \cap C) + P(B \cap C) - P(A \cap B \cap C) \tag{5}

Substitute (4) and (5) back:

P(A \cup B \cup C) = [P(A) + P(B) - P(A \cap B)] + P(C) - [P(A \cap C) + P(B \cap C) - P(A \cap B \cap C)]
= P(A) + P(B) + P(C) - P(A \cap B) - P(A \cap C) - P(B \cap C) + P(A \cap B \cap C)

Notice the pattern: add the singles, subtract the pairs, add back the triple. Each layer corrects for the overcounting introduced by the previous layer.

Three-circle Venn diagram showing inclusion-exclusionThree overlapping circles A, B, and C forming seven distinct regions. The triple overlap in the centre is highlighted in red. The three pairwise overlaps and the three single-only regions are all visible. S A B C A∩B A∩C B∩C A∩B∩C
Three overlapping events partition the union into seven regions. Adding the three circle areas double-counts each pairwise overlap and triple-counts the centre. Subtracting the three pairwise overlaps removes the double-counts but also removes the centre entirely. Adding back the triple overlap restores it exactly once.

Why the triple overlap gets added

Here is a careful count of how many times the central region A \cap B \cap C appears in each step:

Step Contribution Count of A \cap B \cap C
+P(A) included in A +1
+P(B) included in B +1
+P(C) included in C +1
-P(A \cap B) included in A \cap B -1
-P(A \cap C) included in A \cap C -1
-P(B \cap C) included in B \cap C -1
Running total 0

After adding singles and subtracting pairs, the centre has been counted 3 - 3 = 0 times. It has been subtracted away entirely. The final +P(A \cap B \cap C) puts it back exactly once.

The general principle: inclusion-exclusion

The pattern continues. For n events A_1, A_2, \ldots, A_n:

Inclusion-Exclusion Principle

P\!\left(\bigcup_{i=1}^{n} A_i\right) = \sum_{i} P(A_i) - \sum_{i<j} P(A_i \cap A_j) + \sum_{i<j<k} P(A_i \cap A_j \cap A_k) - \cdots + (-1)^{n+1} P(A_1 \cap A_2 \cap \cdots \cap A_n)

The signs alternate: add singles, subtract pairs, add triples, subtract quadruples, and so on. At each level, you are correcting for the overcounting (or undercounting) introduced by the previous level.

For n = 2: one positive term, one negative term. For n = 3: three positive, three negative, one positive. For n = 4: four positive, six negative, four positive, one negative — the coefficients follow the binomial pattern \binom{n}{1}, \binom{n}{2}, \binom{n}{3}, \ldots

The proof is by induction on n. The base case n = 2 is the two-event addition theorem, already proved. The inductive step uses the same "group and apply the two-event formula" strategy used in the three-event proof — writing A_1 \cup \cdots \cup A_n = (A_1 \cup \cdots \cup A_{n-1}) \cup A_n, expanding, and distributing intersections over unions.

Seeing it with numbers

Example 1: Hearts or kings from a deck

A standard 52-card deck. One card is drawn. Let A = "the card is a heart" and B = "the card is a king." Find P(A \cup B).

Step 1. Find P(A). There are 13 hearts in a 52-card deck:

P(A) = \frac{13}{52} = \frac{1}{4}

Why: one of the four suits is hearts, so exactly a quarter of the deck.

Step 2. Find P(B). There are 4 kings:

P(B) = \frac{4}{52} = \frac{1}{13}

Why: one king per suit, four suits.

Step 3. Find P(A \cap B). The overlap is the king of hearts — one card:

P(A \cap B) = \frac{1}{52}

Why: there is exactly one card that is both a heart and a king.

Step 4. Apply the addition theorem:

P(A \cup B) = \frac{13}{52} + \frac{4}{52} - \frac{1}{52} = \frac{16}{52} = \frac{4}{13}

Why: without the subtraction, you would get 17/52, which overcounts by one — the king of hearts.

Result: P(\text{heart or king}) = \dfrac{4}{13} \approx 0.308.

Venn diagram for hearts and kings in a deckTwo overlapping circles inside a rectangle labelled 52 cards. The left circle is labelled Hearts with 12 cards unique to it. The right circle is labelled Kings with 3 cards unique to it. The overlap contains 1 card: the king of hearts. Together they make 16 cards. 52 cards Hearts 12 cards (non-king hearts) K♥ 1 card Kings 3 cards (non-heart kings) Total in union: 12 + 1 + 3 = 16
The 52-card deck partitioned by the events "heart" and "king." The overlap — the king of hearts — contains exactly one card. Adding $13 + 4$ would count it twice; the addition theorem subtracts it once, giving $16$ favourable cards.

The diagram makes the double-counting visible: the single card sitting in the overlap gets pulled into both circles. The subtraction in the formula is just the diagram talking.

Example 2: Three overlapping events with a student survey

In a class of 60 students: 35 play cricket (A), 25 play football (B), 20 play basketball (C). Also, 12 play both cricket and football, 8 play both cricket and basketball, 10 play both football and basketball, and 5 play all three. Find the probability that a randomly chosen student plays at least one sport.

Step 1. Write down the given information as probabilities (each divided by 60):

P(A) = \frac{35}{60}, \quad P(B) = \frac{25}{60}, \quad P(C) = \frac{20}{60}

Why: each count divided by 60 (the total) gives the probability under equally likely selection.

Step 2. Write the pairwise and triple overlaps:

P(A \cap B) = \frac{12}{60}, \quad P(A \cap C) = \frac{8}{60}, \quad P(B \cap C) = \frac{10}{60}, \quad P(A \cap B \cap C) = \frac{5}{60}

Why: the problem gives these directly. These are the correction terms in the three-event formula.

Step 3. Apply the three-event addition theorem:

P(A \cup B \cup C) = \frac{35 + 25 + 20}{60} - \frac{12 + 8 + 10}{60} + \frac{5}{60}
= \frac{80}{60} - \frac{30}{60} + \frac{5}{60} = \frac{55}{60} = \frac{11}{12}

Why: add singles (80), subtract pairs (30), add triple (5). Each layer corrects the over- or under-counting of the previous layer.

Step 4. Interpret. Out of 60 students, 55 play at least one sport. Only 5 play none.

Result: P(\text{at least one sport}) = \dfrac{11}{12} \approx 0.917.

Three-circle Venn diagram for the sports surveyThree overlapping circles for Cricket (35), Football (25), and Basketball (20). The numbers in each of the seven regions are computed using inclusion-exclusion. The triple overlap has 5 students. Outside all three circles, 5 students play no sport. 60 students Cricket Football Basketball 20 12 7 7 3 5 5 None: 5
The seven regions of the Venn diagram. The numbers inside each region are computed by working inward from the triple overlap ($5$), then the pairwise overlaps, then the single-only counts. They add to $55$, leaving $5$ students outside all three circles. The inclusion-exclusion formula gave us $55/60$ without needing to fill in each region individually.

Notice that you did not need to compute the individual region counts (the numbers inside each zone of the Venn diagram) to get the final answer. The inclusion-exclusion formula gave you 55 directly. The region counts are useful for more detailed questions ("how many play only cricket?"), but for "at least one sport" the formula does all the work.

Common confusions

Going deeper

If you came here to learn the addition theorem for two and three events, you have it — you can stop here. The rest is for readers who want to see inclusion-exclusion proved in general and applied to a classic counting problem.

The Bonferroni inequalities

Inclusion-exclusion gives the exact value of P(A_1 \cup \cdots \cup A_n). But the formula has 2^n - 1 terms, which grows fast. The Bonferroni inequalities say that if you stop the alternating sum at any stage, you get a bound:

In general, stopping after an odd-indexed layer gives an upper bound, and stopping after an even-indexed layer gives a lower bound. These bounds are useful when computing all 2^n - 1 terms is impractical but you still want an estimate.

A classic application: the matching problem (derangements)

How many permutations of \{1, 2, \ldots, n\} have no fixed point — that is, \sigma(i) \neq i for every i? These are called derangements, and inclusion-exclusion gives the answer elegantly.

Let A_i be the event that position i is a fixed point. Then the number of permutations with at least one fixed point is |A_1 \cup \cdots \cup A_n|, computed by inclusion-exclusion:

|A_1 \cup \cdots \cup A_n| = \binom{n}{1}(n-1)! - \binom{n}{2}(n-2)! + \binom{n}{3}(n-3)! - \cdots

Each term \binom{n}{k}(n-k)! counts the permutations that fix a specific set of k positions (the remaining n - k elements can go anywhere). The number of derangements is:

D_n = n! - \binom{n}{1}(n-1)! + \binom{n}{2}(n-2)! - \cdots + (-1)^n \cdot 1 = n!\sum_{k=0}^{n}\frac{(-1)^k}{k!}

As n grows, D_n/n! \to 1/e \approx 0.3679. About 36.8\% of all permutations are derangements — a fact that surprises most people and is a direct consequence of inclusion-exclusion.

Where this leads next

The addition theorem is the first major result built on the axioms. The next articles extend the framework in new directions.