Allelic association with SNPs: metrics, populations, and the linkage disequilibrium map (original) (raw)

Linkage disequilibrium patterns vary substantially among populations

European Journal of Human Genetics, 2005

A major initiative to create a global human haplotype map has recently been launched as a tool to improve the efficiency of disease gene mapping. The 'HapMap' project will study common variants in depth in four (and to a lesser degree in up to 12) populations to catalogue haplotypes that are expected to be common to all populations. A hope of the 'HapMap' project is that much of the genome occurs in regions of limited diversity such that only a few of the SNPs in each region will capture the diversity and be relevant around the world. In order to explore the implications of studying only a limited number of populations, we have analyzed linkage disequilibrium (LD) patterns of three 175-320 kb genomic regions in 16 diverse populations with an emphasis on African and European populations. Analyses of these three genomic regions provide empiric demonstration of marked differences in frequencies of the same few haplotypes, resulting in differences in the amount of LD and very different sets of haplotype frequencies. These results highlight the distinction between the statistical concept of LD and the biological reality of haplotypes and their frequencies. The significant quantitative and qualitative variation in LD among populations, even for populations within a geographic region, emphasizes the importance of studying diverse populations in the HapMap project to assure broad applicability of the results.

Major factors influencing linkage disequilibrium by analysis of different chromosome regions in distinct populations: demography, chromosome recombination frequency and selection

Human Molecular Genetics, 2000

of the X chromosome. A dramatic increase of LD was found in the subisolate. Demographic history of populations therefore influences LD. Chromosome-region-specific effects, namely the pattern and frequency of homologous recombination, were next delineated by the analysis of chromosome 6p21, including the HLA region. Patterns of global LD in this region were very similar in the UK and Sardinian populations despite their entirely distinct demographies, and correlate well with the pattern of recombinations. Nevertheless, haplotypes extend across recombination hot spots indicative of selection of certain haplotypes. Subisolate aside, chromosome-regionspecific differences in LD patterns appear to be more important than the differences in intermarker LD between distinct populations.

Linkage disequilibrium in young genetically isolated Dutch population

European Journal of Human Genetics, 2004

The design and feasibility of genetic studies of complex diseases are critically dependent on the extent and distribution of linkage disequilibrium (LD) across the genome and between different populations. We have examined genomewide and region-specific LD in a young genetically isolated population identified in the Netherlands by genotyping approximately 800 Short Tandem Repeat markers distributed genomewide across 58 individuals. Several regions were analyzed further using a denser marker map. The permutationcorrected measure of LD was used for analysis. A significant (Po0.0004) relation between LD and genetic distance on a genomewide scale was found. Distance explained 4% of the total LD variation. For finemapping data, distance accounted for a larger proportion of LD variation (up to 39%). A notable similarity in the genomewide distribution of LD was revealed between this population and other young genetically isolated populations from Micronesia and Costa Rica. Our study population and experiment was simulated in silico to confirm our knowledge of the history of the population. High agreement was observed between results of analysis of simulated and empirical data. We conclude that our population shows a high level of LD similar to that demonstrated previously in other young genetic isolates. In Europe, there may be a large number of young genetically isolated populations that are similar in history to ours. In these populations, a similar degree of LD is expected and thus they may be effectively used for linkage or LD mapping.

The Extent of Linkage Disequilibrium in Four Populations with Distinct Demographic Histories

The American Journal of Human Genetics, 2000

The design and feasibility of whole-genome-association studies are critically dependent on the extent of linkage disequilibrium (LD) between markers. Although there has been extensive theoretical discussion of this, few empirical data exist. The authors have determined the extent of LD among 38 biallelic markers with minor allele frequencies 1.1, since these are most comparable to the common disease-susceptibility polymorphisms that association studies aim to detect. The markers come from three chromosomal regions-1,335 kb on chromosome 13q12-13, 380 kb on chromosome 19q13.2, and 120 kb on chromosome 22q13.3-which have been extensively mapped. These markers were examined in ∼1,600 individuals from four populations, all of European origin but with different demographic histories; Afrikaners, Ashkenazim, Finns, and East Anglian British. There are few differences, either in allele frequencies or in LD, among the populations studied. A similar inverse relationship was found between LD and distance in each genomic region and in each population. Mean D is .68 for marker pairs !5 kb apart and is .24 for pairs separated by 10-20 kb, and the level of LD is not different from that seen in unlinked marker pairs separated by 1500 kb. However, only 50% of marker pairs at distances !5 kb display sufficient LD ( ) to be D 1 .3 useful in association studies. Results of the present study, if representative of the whole genome, suggest that a whole-genome scan searching for common disease-susceptibility alleles would require markers spaced р5 kb apart.

Accuracy of haplotype estimation in a region of low linkage disequilibrium

BMC Genetics, 2005

We compared the accuracy of haplotype inferences at a 6 Mb region on chromosome 7 where significant linkage between a brain oscillation phenotype and a cholinergic muscarinic receptor gene was previously reported. Individual haplotype assignments and haplotype frequencies were estimated using 5, 10, and 14 consecutive Illumina single-nucleotide polymorphisms (SNPs) within the 1-LOD unit support interval of the chromosome 7 linkage peak. Initially, haplotypes were constructed incorporating phase information provided by relatives using the pedigree analysis package MERLIN. Population-based haplotypes were inferred using the haplotype estimation software HAPLO.STATS and PHASE, using unrelated individuals.