The Probability of a Name: When Math Meets Meaning

2026-05-09 · 5 min read

How rare is your name, mathematically? Not "how unusual does it sound" but: if you generated random strings of letters all day, how long would it take to land on yours by accident? The answer is unintuitive, and it also happens to be exactly the question the name-fortune game on this site is computing.

The simple version

Pick a random letter from the English alphabet. The chance that you picked any specific letter — say "I" — is one in twenty-six. Pick two random letters in a row. The chance you picked "IG" is one in 26 × 26, which is one in 676. Pick four letters and the chance you spelled "IGOR" is one in 26⁴, which is one in 456,976. Each additional letter multiplies the denominator by twenty-six.

In symbols: for an alphabet of size A and a name of length n, the probability of one random string spelling the name is p = (1/A)n. For Cyrillic, A is 33; for Latin, 26.

Where the intuition breaks

Names of length 5 still feel small. Five letters, surely the universe can find them quickly. Run the numbers: 26⁵ is 11,881,376. The expected number of random Latin five-letter strings before one of them spells "EMILY" is about twelve million. That is roughly the population of Belgium, generated one at a time. A six-letter name pushes the expected wait to over three hundred million. A seven-letter name takes it past eight billion — more strings than there are humans alive.

The lesson is that a name is more compressed information than it looks. Each letter is a choice from a fairly large set, and the choices multiply. Three letters' worth of difference between "AVA" and "DOROTHY" is the difference between thousands and tens of millions of attempts.

What "expected number of tries" actually means

When the game tells you the expected number of attempts is, say, twelve million, that does not mean it will take twelve million tries. It means the average over many runs would be twelve million. Any particular run could land in the first thousand attempts or take fifty million. This is the geometric distribution at work: the wait time for a single success in a sequence of independent trials with success probability p has mean 1/p, but a wide spread around that mean.

Empirically, half of all runs will finish in fewer than 0.69 × (1/p) attempts. The other half stretch into the long tail. If you watch the same name for ten runs, you will sometimes see a fast result and sometimes see one that drags on. That is the spread, not a malfunction.

Why the empirical probability rarely matches the theoretical

After one match, the empirical probability is 1 / tries. If theory predicted 1 / 12,000,000 and you matched on attempt 9,000,000, your empirical probability is 1 / 9,000,000 — about 33% higher than theoretical. This is normal. Single-success estimates are noisy. The game also reports a relative deviation; do not be alarmed when it is large. With one event you cannot estimate p tightly.

Names in the real world

Real-world name distributions are not uniform: not every five-letter string is equally likely to be a name. There are roughly half a million unique given names in current use globally. Among Latin five-letter strings, only a fraction read as plausible names. The math here treats every string of letters as equally probable; in practice, "EMILY" and "ZQXVK" both have probability 1/11,881,376 in the model, but only one is a name a person actually has. The model is deliberately memoryless; it tests intuition about combinatorics, not phonetics.

Bringing it back to numerology

Numerology compresses your name to a single digit. The probability framing here decompresses your name to its full combinatorial size. Both are useful angles. Numerology gives you one number with a story attached. The probability view gives you a denominator with a sense of scale: the universe would have to throw a lot of dice to land on you by chance. Even if you don't believe in fate, that is a satisfying number to know.