next up previous
Next: Bibliography Up: Detecting selection on nucleotide Previous: Kreitman and Hudson

Detecting selection in the human genome

I've already mentioned the HapMap project [1], a collection of genotype data at roughly 3.2M SNPs in the human genome. The data in phase II of the project were collected from four populations:

We expect genetic drift to result in allele frequency differences among populations, and we can summarize the extent of that differentiation at each locus with $F_{ST}$. If all HapMap SNPs are selectively neutral,4 then all loci should have the same $F_{ST}$ within the bounds of statistical sampling error and the evolutionary sampling due to genetic drift. A scan of human chromosome 7 reveals both a lot of variation in individual-locus estimates of $F_{ST}$ and a number of loci where there is substantially more differentiation among populations than is expected by chance (Figure 2). At very fine genomic scales we can detect even more outliers (Figure 3), suggesting that human populations have been subject to divergent selection pressures at many different loci.

Figure 2: Single-locus estimates of $F_{ST}$ along chromosome 7 in the HapMap data set. Blue dots denote outliers. Adjacent SNPs in this sample are separated, on average, by about 52kb.
\resizebox{\textwidth}{!}{\includegraphics{outlier.eps}}

Figure 3: Single-locus estimates of $F_{ST}$ along a portion of chromosome 7 in the HapMap data set. Black dots denote outliers. Solid bars refert to previously identified genes. Adjacent SNPs in this sample are separated, on average, by about 1kb.
8#1


next up previous
Next: Bibliography Up: Detecting selection on nucleotide Previous: Kreitman and Hudson
Kent Holsinger 2010-12-13