I've just pointed out the distinction between myoglobin and
hemoglobin. You may also remember that hemoglobin is a multimeric
protein consisting of four subunits, 2
subunits and 2
subunits. What you may not know is that in humans there are actually
two types of
hemoglobin and four types of
hemoglobin,
each coded by a different genetic locus (see
Table 1). The five
-globin loci (
,
,
, and two non-functional pseudogenes) are found in a
cluster on chromosome 16. The six
-globin loci (
,
,
,
,
, and a pseudogene) are found
in a cluster on chromosome 11. The myoglobin locus is on chromosome
22.
|
Not only do we have all of these different types of globin genes in our bodies, they're all related to one another. Comparative sequence analysis has shown that vertebrate myoglobin and hemoglobins diverged from one another about 450 million years ago. Figure 1 shows a phylogenetic analysis of globin genes from humans, mice, and a variety of Archaea. Focus your attention on the part of the tree that has human and mouse sequences. You'll notice two interesting things:
This pattern is exactly what we expect as a result of duplication and divergence. Up to the time that a gene becomes duplicated, its evolutionary history matches the evolutionary history of the organisms containing it. Once there are duplicate copies, each follows an independent evolutionary history. Each traces the history of speciation and divergence. And over long periods duplicate copies of the same gene share more recent common ancestry with copies of the same gene in a different species than they do with duplicate genes in the same genome.
A history of duplication an divergence in multigene families makes it
important to distinguish between two classes of related loci: those
that represent the same locus in different species and between which
divergence is a result of species divergence are orthologs. Those that represent different loci and between which
divergence occurred after duplication of an ancestral gene are paralogs. The
-globin loci of humans and chickens are
orthologous. The
- and
-globin loci of any pair of taxa
are paralogous.
As multigene families go, the globin family is relatively simple and
easy to understand. There are only about a dozen loci involved, one
isolated locus (myoglobin) and two clusters of loci (
- and
-globins). You'll find a diagram of the
-globin cluster
in Figure 2. As you can see the
-globins
are not only evolutionarily related to one another they occur
relatively close to one another on chromosome 11 in humans.
|
Other families are far more complex. Class I and class II MHC loci, for example are part of the same multigene family. Moreover, immunoglobulins, T-cell receptors, and, and MHC loci are part of a larger superfamily of genes, i.e., all are ultimately derived from a common ancestral gene by duplication and divergence. Table 2 lists a few examples of multigene families and superfamilies in the human genome and the number of proteins produced.
|