next up previous
Next: Bibliography Up: Detecting selection on nucleotide Previous: Kreitman and Hudson

Detecting selection in the human genome

I've already mentioned the HapMap project [], a collection of genotype data at roughly 3.2M SNPs in the human genome. The data were collected from four populations:

We expect genetic drift to result in allele frequency differences among populations, and we can summarize the extent of that differentiation at each locus with $F_{ST}$. If all HapMap SNPs are selectively neutral,3 then all loci show have the same $F_{ST}$ within the bounds of statistical sampling error and the evolutionary sampling due to genetic drift. A scan of human chromosome 7 reveals both a lot of variation in individual-locus estimates of $F_{ST}$ and a number of loci where there is substantially more differentiation among populations than is expected by chance (Figure [*]). At very fine genomic scales we can detect even more outliers (Figure [*], suggesting that human populations have been subject to divergent selection pressures at many different loci.

Figure: Single-locus estimates of $F_{ST}$ along chromosome 7 in the HapMap data set. Blue dots denote outliers. Adjacent SNPs in this sample are separated, on average, by about 52kb.
\resizebox{\textwidth}{!}{\includegraphics{outlier.eps}}

Figure: Single-locus estimates of $F_{ST}$ along a portion of chromosome 7 in the HapMap data set. Black dots denote outliers. Solid bars refert to previously identified genes. Adjacent SNPs in this sample are separated, on average, by about 1kb.
\resizebox{\textwidth}{!}{\includegraphics[angle=270]{outlier-high.eps}}


next up previous
Next: Bibliography Up: Detecting selection on nucleotide Previous: Kreitman and Hudson
Kent Holsinger 2008-09-04