Efficient study designs for test of genetic association using sibship data and unrelated cases and controls - PubMed (original) (raw)
. 2006 May;78(5):778-792.
doi: 10.1086/503711. Epub 2006 Mar 20.
Affiliations
- PMID: 16642434
- PMCID: PMC1474028
- DOI: 10.1086/503711
Efficient study designs for test of genetic association using sibship data and unrelated cases and controls
Mingyao Li et al. Am J Hum Genet. 2006 May.
Abstract
Linkage mapping of complex diseases is often followed by association studies between phenotypes and marker genotypes through use of case-control or family-based designs. Given fixed genotyping resources, it is important to know which study designs are the most efficient. To address this problem, we extended the likelihood-based method of Li et al., which assesses whether there is linkage disequilibrium between a disease locus and a SNP, to accommodate sibships of arbitrary size and disease-phenotype configuration. A key advantage of our method is the ability to combine data from different family structures. We consider scenarios for which genotypes are available for unrelated cases, affected sib pairs (ASPs), or only one sibling per ASP. We construct designs that use cases only and others that use unaffected siblings or unrelated unaffected individuals as controls. Different combinations of cases and controls result in seven study designs. We compare the efficiency of these designs when the number of individuals to be genotyped is fixed. Our results suggest that (1) when the disease is influenced by a single gene, the one sibling per ASP-control design is the most efficient, followed by the ASP-control design, and familial cases contribute more association information than singleton cases; (2) when the disease is influenced by multiple genes, familial cases provide more association information than singleton cases, unless the effect of the locus being tested is much smaller than at least one other untested disease locus; and (3) the case-control design can be useful for detecting genes with small effect in the presence of genes with much larger effect. Our findings will be helpful for researchers designing and analyzing complex disease-association studies and will facilitate genotyping resource allocation.
Figures
Figure 1
Association study designs. The black arrows denote individuals to be genotyped at the candidate SNP. The number of individuals to be genotyped at the SNP is fixed at 1,000 for each study design.
Figure 2
Histograms of ranks for different study designs. Results are based on 2,000 replicates of the corresponding sampling units for each study design. All models have disease prevalence of
_K_=5%
and sibling recurrence risk ratio of
λ_s_=1.02
. Power is assessed at the 1% level. For each disease model in table 2 and at each level of disease-SNP LD (
_r_2=.25
, .50, .75, and 1), the seven study designs are ranked by estimated power.
Figure 3
Comparison of case-control design and one sibling per ASP–control design, under five-locus disease models, when the effect size of the test locus increases and the effect size of the four remaining disease loci are fixed. Results are based on 2,000 replicate data sets. The disease prevalence
_K_=5%
. The disease is influenced by five unlinked disease loci, each with a predisposing-allele frequency of 0.1. The SNP, with a minor-allele frequency of 0.1, is completely linked to the first disease locus, and
_r_2
between the two loci is 0.5. All disease loci follow an additive model, with locus-specific
λ_s_
at the test locus increasing from 1.02 to 1.25 and the locus-specific
λ_s_
for the four remaining disease loci fixed at 1.02. Power is assessed at the 1% level. The solid line is for design with 500 cases (one sibling per ASP) and 500 controls. The dashed line is for design with 500 cases and 500 controls.
Figure 4
Power comparison of case-control design and one sibling per ASP–control design, under multilocus disease models, when the effect size of each disease locus is fixed and the number of disease loci increases. Results are based on 2,000 replicate data sets. The disease prevalence
_K_=5%
. The disease is influenced by L (
2⩽_L_⩽10
) unlinked disease loci, each with a predisposing-allele frequency of 0.1. The SNP, with a minor-allele frequency of 0.1, is completely linked to the first disease locus, and
_r_2
between the two loci is 0.5. All disease loci follow a dominant, an additive, or a recessive model, with locus-specific
λ_s_
at each disease locus fixed at 1.02. Power is assessed at the 1% level. The solid line is for design with 500 cases (one sibling per ASP) and 500 controls. The dashed line is for design with 500 cases and 500 controls.
Figure 5
Power comparison of case-control design and one sibling per ASP–control design, under five-locus disease models, when the effect size of the large-effect background disease locus increases and the effect size of the small-effect disease loci, including the test locus, is fixed. Results are based on 2,000 replicate data sets. The disease prevalence
_K_=5%
. The disease is influenced by five unlinked disease loci, each with a predisposing-allele frequency of 0.1. The SNP, with a minor-allele frequency of 0.1, is completely linked to the first disease locus, and
_r_2
between the two loci is 0.5. All disease loci follow a dominant, an additive, or a recessive model, with locus-specific fixed
λ_s_
at the large effect background disease locus increasing from 1.02 to 1.7 and the locus-specific
λ_s_
for the small effect disease loci, including the testing locus, fixed at 1.02. Power is assessed at the 1% level. The solid line is for design with 500 cases (one sibling per ASP) and 500 controls. The dashed line is for design with 500 cases and 500 controls.
Similar articles
- Joint modeling of linkage and association: identifying SNPs responsible for a linkage signal.
Li M, Boehnke M, Abecasis GR. Li M, et al. Am J Hum Genet. 2005 Jun;76(6):934-49. doi: 10.1086/430277. Epub 2005 Apr 5. Am J Hum Genet. 2005. PMID: 15877278 Free PMC article. - Detection of disease genes by use of family data. I. Likelihood-based theory.
Whittemore AS, Tu IP. Whittemore AS, et al. Am J Hum Genet. 2000 Apr;66(4):1328-40. doi: 10.1086/302851. Epub 2000 Mar 29. Am J Hum Genet. 2000. PMID: 10739758 Free PMC article. - The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling.
Risch N, Teng J. Risch N, et al. Genome Res. 1998 Dec;8(12):1273-88. doi: 10.1101/gr.8.12.1273. Genome Res. 1998. PMID: 9872982 Review. - Sibship T2 association tests of complex diseases for tightly linked markers.
Fan R, Knapp M. Fan R, et al. Hum Genomics. 2005 Jun;2(2):90-112. doi: 10.1186/1479-7364-2-2-90. Hum Genomics. 2005. PMID: 16004725 Free PMC article. Review.
Cited by
- Novel genetic variant associated with globoid cell leukodystrophy in a family of mixed breed dogs.
Hammack S, Hague DW, Vieson MD, Esdaile E, Hughes SS, Bellone RR, McCoy AM. Hammack S, et al. J Vet Intern Med. 2023 Sep-Oct;37(5):1710-1715. doi: 10.1111/jvim.16822. Epub 2023 Aug 18. J Vet Intern Med. 2023. PMID: 37593836 Free PMC article. - Population genetics of marmosets in Asian primate research centers and loci associated with epileptic risk revealed by whole-genome sequencing.
Yang X, Mao Y, Wang XK, Ma DN, Xu Z, Gong N, Henning B, Zhang X, He G, Shi YY, Eichler EE, Li ZQ, Takahashi E, Li WD. Yang X, et al. Zool Res. 2023 Sep 18;44(5):837-847. doi: 10.24272/j.issn.2095-8137.2022.514. Zool Res. 2023. PMID: 37501399 Free PMC article. - A Rare Variant Nonparametric Linkage Method for Nuclear and Extended Pedigrees with Application to Late-Onset Alzheimer Disease via WGS Data.
Zhao L, He Z, Zhang D, Wang GT, Renton AE, Vardarajan BN, Nothnagel M, Goate AM, Mayeux R, Leal SM. Zhao L, et al. Am J Hum Genet. 2019 Oct 3;105(4):822-835. doi: 10.1016/j.ajhg.2019.09.006. Am J Hum Genet. 2019. PMID: 31585107 Free PMC article. - Rediscovering the value of families for psychiatric genetics research.
Glahn DC, Nimgaonkar VL, Raventós H, Contreras J, McIntosh AM, Thomson PA, Jablensky A, McCarthy NS, Charlesworth JC, Blackburn NB, Peralta JM, Knowles EEM, Mathias SR, Ament SA, McMahon FJ, Gur RC, Bucan M, Curran JE, Almasy L, Gur RE, Blangero J. Glahn DC, et al. Mol Psychiatry. 2019 Apr;24(4):523-535. doi: 10.1038/s41380-018-0073-x. Epub 2018 Jun 28. Mol Psychiatry. 2019. PMID: 29955165 Free PMC article. Review. - Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease.
Fernández MV, Budde J, Del-Aguila JL, Ibañez L, Deming Y, Harari O, Norton J, Morris JC, Goate AM; NIA-LOAD family study group; NCRAD; Cruchaga C. Fernández MV, et al. Front Neurosci. 2018 Apr 4;12:209. doi: 10.3389/fnins.2018.00209. eCollection 2018. Front Neurosci. 2018. PMID: 29670507 Free PMC article.
References
Web Resource
- University of Michigan Center for Statistical Genetics, http://www.sph.umich.edu/csg/abecasis/lamp/
References
- Risch N, Teng J (1998) The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases. I. DNA pooling. Genome Res 8:1273–1288 - PubMed
Publication types
MeSH terms
Grants and funding
- R01 EY012562/EY/NEI NIH HHS/United States
- R01 HG002651/HG/NHGRI NIH HHS/United States
- R01 HG000376/HG/NHGRI NIH HHS/United States
- EY12562/EY/NEI NIH HHS/United States
- HG02651/HG/NHGRI NIH HHS/United States
- HG00376/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources