A map of European countries needs colours to distinguish neighbouring states. Looking at it, you might guess you’d need many — there are 50 or so countries, some with complicated borders. Try it. You’ll find that four colours always suffice, no matter how baroque the boundaries. France, Germany, Italy, Spain, and the rest can always be coloured with red, green, blue, and yellow such that no two adjacent countries share a colour.

This observation, made informally by London cartographer Francis Guthrie in 1852, became one of the most famous open problems in mathematics. For 124 years, mathematicians proved partial results, found gaps in claimed proofs, and tried — without success — to demonstrate that four colours were always enough. The problem was finally settled in 1976 by Kenneth Appel and Wolfgang Haken at the University of Illinois. Their proof was unlike any in the previous history of mathematics. It involved 1,200 hours of computer time analysing 1,936 specific configurations of regions, with no possibility of human verification of every step.

The Four-Color Theorem was the first major mathematical result whose proof depended fundamentally on machine computation. It changed what mathematicians were willing to accept as a proof, and the debate it provoked is still active.

The statement

A “map” in the precise mathematical sense is a partition of a region (say, the plane) into countries — connected pieces with well-defined boundaries. Two countries are “adjacent” if they share a border of positive length. Sharing only a point doesn’t count.

The Four-Color Theorem states: every such map can be coloured with at most four colours so that no two adjacent countries share a colour.

The constraint that countries have to be connected matters. If you allow countries with disconnected pieces (think of the United States with Alaska and Hawaii), you can construct examples requiring more colours. Real maps, by convention, treat enclaves as the same country, but mathematically the theorem is about connected regions.

The theorem is also specifically about planar maps — maps drawn on a flat surface. On a torus (donut-shaped surface) the answer is seven, not four. On the sphere, four (which is equivalent to planar by stereographic projection).

Why three is not enough

The lower bound is easy. Consider four countries arranged so that each one borders all three of the others — for example, three countries surrounding a fourth in the middle. You cannot colour this with fewer than four colours: each of the four mutually-adjacent countries needs its own colour.

Such configurations exist on real maps. In Africa, Burkina Faso, Mali, Niger, and Algeria-and-friends can be arranged in such a way. So three colours are demonstrably insufficient.

The hard direction is showing that four are always enough.

A century of partial results

The first published claim of a proof came in 1879 from Alfred Kempe, who introduced an idea — now called Kempe chains — that turned out to be the right framework but with a flaw. Kempe’s argument was widely accepted for eleven years until in 1890 Percy Heawood found a counterexample to the central step. Kempe’s method showed that five colours always suffice — a result that has remained a clean elementary theorem in graph theory ever since — but the four-colour case remained open.

For the next 86 years, mathematicians refined Kempe’s approach. The key insight was a translation: a planar map can be replaced by its dual graph, in which countries become vertices and adjacencies become edges. The four-colour problem becomes a graph-theory question: can every planar graph be vertex-coloured with four colours? This is the modern formulation.

Progress came in two forms:

  • Reducibility. Show that certain configurations of vertices in a planar graph can be replaced by smaller configurations without changing colorability. If a graph contains a “reducible” configuration, you can simplify it and reduce the colorability question to a smaller graph.
  • Unavoidability. Show that any planar graph (with high enough connectivity) must contain at least one configuration from a finite catalogue. If the catalogue is “unavoidable,” every counterexample must contain at least one item.

The proof strategy was clear in principle: find a finite catalogue of configurations that is both unavoidable (every counterexample contains one) and reducible (each one can be simplified). If you can find such a catalogue, no minimum counterexample can exist, so no counterexample exists at all.

The catalogue had to contain configurations that were genuinely tractable. By 1969 Heinrich Heesch had identified roughly 9,000 configurations that he believed were both unavoidable and reducible. Verifying reducibility for each one required laborious case analysis.

The Appel-Haken proof

Kenneth Appel and Wolfgang Haken refined Heesch’s approach in the early 1970s. They reduced the catalogue to 1,936 configurations and developed an algorithm to verify reducibility for each one mechanically. In 1976, after about 1,200 hours of IBM 360 computer time, the algorithm reported success: every configuration in the catalogue was reducible, and the catalogue was unavoidable.

The Four-Color Theorem was proved.

The mathematical community’s response was mixed. The proof was over 700 pages of human-readable analysis plus computer outputs verifying the 1,936 configurations. No human could ever check the computer’s work directly. For some mathematicians this was fine — the algorithm itself was clear, the configurations explicit, and the logic of the proof unimpeachable. For others, this was uncomfortable. A proof that no human could verify in their lifetime felt qualitatively different from a Euclidean proof you could read on a single page.

The philosopher Thomas Tymoczko, in a 1979 paper, argued that the Appel-Haken proof represented a genuine epistemological shift. Mathematics had traditionally rested on proofs that could in principle be checked by any sufficiently skilled human. Computer-assisted proofs introduced a new category: results that had to be trusted on the basis of the correctness of the software and hardware, not just the correctness of the argument. This was, Tymoczko argued, no longer pure mathematics in the classical sense.

Most mathematicians have come to accept the proof, but the debate it sparked has not gone away.

Verification and re-verification

In 1996, Neil Robertson, Daniel Sanders, Paul Seymour, and Robin Thomas produced a simpler computer-assisted proof. They reduced the unavoidable set to 633 configurations and significantly simplified the reducibility verification. Their proof was easier to check — though still beyond what any human could complete by hand.

In 2005, Georges Gonthier completed a fully formal verification of the four-color theorem using the Coq proof assistant. Coq is a software system that mechanically verifies mathematical proofs from the axioms of set theory upward, using a small “kernel” that is itself verifiable by humans. Gonthier translated the Robertson-Sanders-Seymour-Thomas proof into Coq, which then checked every step.

This verification removed the last serious objection to the proof. The Four-Color Theorem is now as rigorously established as any theorem in mathematics — arguably more so, since most published proofs are checked only by human peer review and contain undetected errors with measurable frequency. Gonthier’s verification produces a certificate that is as good as any mathematical certificate gets.

Why no human-readable proof exists

Despite decades of effort, no proof of the Four-Color Theorem has been found that does not require enormous case analysis. There are reasons to believe this is fundamental.

The theorem is, in some sense, a result about a vast collection of finite combinatorial objects (planar graphs). A clean conceptual proof would require finding a single conceptual structure shared by all such objects. None has been found, and several theoretical results suggest the search is unlikely to succeed: the four-color theorem is “Ramsey-like” in flavour, and Ramsey-type results often resist short proofs.

The contrast with adjacent results is striking. The Five-Color Theorem has a clean human-readable proof: about a page of clever induction using Kempe chains. The Six-Color Theorem (which holds on the sphere) has an even shorter proof using Euler’s polyhedron formula. Going from five to four costs you, apparently, an unbridgeable jump in proof complexity. Why this should be is itself a subject of speculation.

What the proof teaches

The Four-Color Theorem is now part of the standard graph-theory curriculum. The Appel-Haken proof and its successors are part of the standard history. Several lessons have been absorbed.

Mathematics has admitted machine assistance. The mathematical community now accepts proofs that depend on machine computation, provided the computational steps are well-specified and reproducible. This has opened the door to proofs of other results — Kepler’s sphere-packing conjecture (proved by Hales in 1998–2014, with Coq verification completed in 2014), the Robbins conjecture in algebra (proved in 1996 with the EQP theorem prover), and various combinatorial-geometry results — that no human could have completed unaided.

Proof certainty has been re-evaluated. Formal verification systems like Coq, Isabelle, Lean, and Agda allow proofs to be checked at a level of rigour that human review cannot match. Several traditional human-verified proofs, when translated into formal verifiers, have been found to contain gaps that survived decades of peer review. The Four-Color Theorem may now be the most rigorously verified theorem in mathematics, despite the original proof being controversial.

The aesthetics question remains open. Many mathematicians still feel the lack of a “beautiful” proof of the Four-Color Theorem. The problem itself is so simple to state that the absence of a conceptual proof feels like a cosmic glitch. Why should four colours suffice for an arbitrary map? The proof shows the answer is yes; it does not show why. That dissatisfaction is, perhaps, the price of having mechanically verifiable proofs of statements that resist conceptual unification.

A footnote

A well-known piece of mathematical folklore: a famous topologist, told that the Four-Color Theorem had been proved by computer, replied: “Wonderful — now we have computer-assisted proofs that there is no human-comprehensible proof of the four-color theorem.”

That’s not quite what the Appel-Haken result proves. It is, however, what many mathematicians felt the result had effectively shown. After half a century of further work, no proof has emerged that contradicts the suspicion. The Four-Color Theorem is true, the theorem is rigorously established, and the underlying reason remains genuinely opaque to human reasoning.

That, in itself, is one of the most interesting facts in modern mathematics.

Frequently asked

Has the proof been verified?

Yes. In 2005, Georges Gonthier produced a fully formal verification using the Coq proof assistant, mechanically checking every step from the axioms of set theory upward. So the proof is now as rigorous as any in mathematics — arguably more so, since human-checked proofs can contain errors that survive peer review for decades.

Why does it require four colors and not three?

Three colors are not enough — the simplest counterexample is four regions arranged so that each one borders the other three. Such a configuration cannot be coloured with three colors. Four colors always suffice; this is the content of the theorem.

Does the theorem apply to maps on a sphere or torus?

On the sphere, yes — the four-color theorem applies, and the proof goes through. On the torus, the answer is seven colors. The maximum number of colors needed depends on the surface's Euler characteristic. The torus version was proved before the planar version, in 1890.