It's quite easy to extend this approach to multiple
alleles.5We're interested in seeing how far back in time we have to go before
all alleles are descended from a single common ancestor. We'll assume
that we have
alleles in our sample. The first thing we have to
calculate is the probability that any two of the alleles in our sample
are identical by descent from the immediately preceding generation. To
make the calculation easier, we assume that the effective size of the
population is large enough that the probability of two coalescent
events in a single generation is vanishingly small. We already know
that the probability of a coalescence in the immediately preceding
generation for two randomly chosen alleles is
. But there are
different pairs of alleles in our sample. So the
probability that one pair of these alleles is involved in a coalescent
event in the immediately preceding generation is
But this is, of course, only the first coalescent event. We were
interested in how long we have to wait until all alleles are
descended from a single common ancestor. Now is where Kingman's sneaky
trick comes in. After the first coalescent event, we now have
alleles in our sample, instead of
. So the whole process starts
over again with
alleles instead of
. Since the time to the
first coalescence depends only on the number of alleles in the sample
and not on how long the first coalescence event took, we can calculate
the average time until all coalescences have happened as
