Three-Part Invention & Topic I — The MU-puzzle

5 min read

Core idea

The MIU-system is the smallest interesting formal system in the book. Its alphabet is three letters: M, I, U. Its single axiom is the string MI. Four rules let you transform one string into another. The puzzle: starting from MI, can you derive the string MU?

The puzzle is engineered so that anyone willing to try will, after a few hours, develop a strong suspicion that the answer is no — but no amount of playing the rules will prove that. The proof requires stepping outside the system and noticing an invariant — a property the rules cannot disturb — that MI has and MU does not. The lesson is the lesson of the rest of the book in miniature: inside any formal system there are questions you cannot answer using only the system; you need a higher level that talks about the system.

Hofstadter's argument: A formal system is a closed game of symbol-shoving. Statements about what the game can or cannot produce are not moves in the game — they live on a meta-level. The distinction between inside and outside the system is the most important distinction in the book.

Why it matters

The MIU-system in full

The system has one axiom, MI, and four rules of inference. Rule I: if a string ends in I, you may append U (e.g. MI becomes MIU). Rule II: if you have Mx (where x is any string of Is and Us), you may form Mxx (doubling everything after the M). Rule III: any III inside a string may be replaced by U. Rule IV: any UU inside a string may be deleted. That is the whole system — there is no fifth rule, no exception, no special case for the goal string.

Starting from MI the rules generate MII, MIIII, MIIIIIIII, MUI, MIU, MUIIU, and a vast tree of further strings. Every string contains exactly one M, at the front; the body after the M is a sequence of Is and Us. The question is whether MU — a string with zero Is in the body — is reachable.

The trap: trying to solve it from inside

Hofstadter spends pages walking the reader through false starts. You try Rule II to double MI into MII. Then again into MIIII. You apply Rule III to convert three Is into a U. You apply Rule I to add Us. You manage to eliminate Is in pairs of three. But the count of Is keeps shifting in ways you can't quite control, and every attempt to land on MU seems to leave at least one I stuck somewhere.

The mistake is structural. You are looking for a sequence of moves — a derivation — but the question "is MU derivable?" is not itself a derivation. It is a question about what derivations exist. No application of the four rules can ever produce the answer "no derivation exists." The system is opaque to its own meta-questions.

Jumping out: the invariant on I-count modulo 3

The way out is to notice a hidden conserved quantity. Count the number of Is in any reachable string and look at it modulo 3 (the remainder when divided by 3). The axiom MI has one I; one mod three is one. Watch what each rule does to the I-count modulo 3:

  • Rule I adds a U, so the I-count is unchanged. (1 mod 3 is still 1.)
  • Rule II doubles the body, so the I-count doubles. If you start with n Is mod 3, you end with 2n mod 3. (1 doubles to 2; 2 doubles to 4, which is 1; never 0.)
  • Rule III turns three Is into a U, so the I-count drops by exactly 3. (Mod 3, no change.)
  • Rule IV deletes UU, leaving the I-count alone.

So the I-count mod 3 starts at 1 and can only become 2 (via Rule II from 1) or 1 (via Rule II from 2). It can never become 0. But MU has zero Is, and zero mod 3 is zero. Therefore MU is unreachable. The argument never produces a single MIU-derivation. It runs entirely outside the system, in arithmetic.

Inside vs outside as a permanent distinction

Once you see this argument you can never un-see it. The system as a game could not give you the answer; the system as an object of arithmetic gave you the answer in three lines. The reader now has a felt example of the difference between theorem (a string the rules can produce) and truth about the system (a fact established by reasoning outside the rules). The Three-Part Invention dialogue that opens this section is itself a tiny example of the same move: Zeno's tortoise generates the regress, and Achilles must reason about the regress rather than continue it.

Key takeaways

Mental model

Mental model

Practical application

The MIU-puzzle is a portable diagnostic for any closed rule-based system you encounter. The pattern transfers cleanly.

1. What are the actual rules? Most people argue inside systems whose rules they have never enumerated. Write them down. The MIU-system has four; until you list them you cannot reason about what they preserve.

2. What property does every starting position have? Whatever it is, the rules either preserve it or they do not. If they preserve it, every reachable position has the same property; positions lacking it are unreachable.

3. Is the question I'm asking expressible as a rule-move, or is it a question about the rules? This is the meta-move. "Can I win this game?" is sometimes a move; "Is this game winnable?" is always a meta-question. Trying to settle a meta-question with object-level moves is the most common kind of wasted effort in argument.

Example

Consider Sudoku. The "rules" are: each row, column, and 3×3 box must contain the digits 1–9 exactly once. A solution is a derivation. Most players, when stuck, try harder moves — more elimination, more guess-and-backtrack — staying inside the rule system.

But suppose someone hands you a partly filled grid and asks: "Is this puzzle solvable?" That is the MU question. You can play the rules forever without resolving it. The fast answer comes from an invariant: count the candidates in each unit. If any unit has 9 cells but only 8 distinct possible values across all of them, no Sudoku rule can ever land a 9th digit; the puzzle is unsolvable. The argument never used a sudoku-move. It used the meta-property "each unit needs 9 distinct values."

Once you see this you can apply it everywhere. Why can't you build a perpetual-motion machine using clever gears? Energy conservation — an invariant the gears cannot disturb. Why can't a deterministic program reliably tell you whether another program halts? Cantor's diagonal — an invariant the programs cannot disturb. The MIU-puzzle is the same shape: an invariant the rules cannot disturb, plus a target that violates it.

Continue exploring

Tags