Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests - PubMed (original) (raw)

Comparative Study

Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests

Regev Schweiger et al. Nat Commun. 2018.

Abstract

Testing for association between a set of genetic markers and a phenotype is a fundamental task in genetic studies. Standard approaches for heritability and set testing strongly rely on parametric models that make specific assumptions regarding phenotypic variability. Here, we show that resulting p-values may be inflated by up to 15 orders of magnitude, in a heritability study of methylation measurements, and in a heritability and expression quantitative trait loci analysis of gene expression profiles. We propose FEATHER, a method for fast permutation-based testing of marker sets and of heritability, which properly controls for false-positive results. FEATHER eliminated 47% of methylation sites found to be heritable by the parametric test, suggesting a substantial inflation of false-positive findings by alternative methods. Our approach can rapidly identify heritable phenotypes out of millions of phenotypes acquired via high-throughput technologies, does not suffer from model misspecification and is highly efficient.

PubMed Disclaimer

Conflict of interest statement

R.S. is an employee of MyHeritage Ltd. The remaining authors declare no competing interests.

Figures

Fig. 1

Discrepancy in _p_-values in a methylation study. _p_-values from 10,000 permutations, compared to GCTA _p_-values assuming asymptotics (in log scale). Evaluated on 431,366 methylation sites on all autosomal chromosomes, from the KORA dataset, with 1799 individuals, and with sex, age, and smoking status as covariates. Sites with ĥ2=0 or with a parametric p < 10−20 omitted for clarity of presentation, with 99.995% confidence intervals (CIs) shown. Parametric _p_-values are often smaller than the exact _p_-values obtained by the permutation test, frequently by several orders of magnitude, resulting in many false positives

Fig. 2

Discrepancy in _p_-values in a cis-eQTL study. _p_-values from 10,000 permutations, compared to GCTA _p_-values assuming asymptotics (in log scale). Evaluated on 22,171 gene expression profiles in whole-blood samples, from the GTEx dataset, with 338 individuals. Sites with ĥ2=0 (8604 profiles) omitted for clarity of presentation, with 99.995% CIs. Parametric _p_-values are often smaller than the exact _p_-values obtained by the permutation test, frequently by several orders of magnitude, resulting in many false positives

Fig. 3

Discrepancy in _p_-values, with quantile normalization. _p_-values after quantile normalization, from 10,000 permutations, compared to GCTA _p_-values assuming asymptotics (in log scale). Parametric _p_-values show large discrepancies compared to the exact _p_-values obtained by the permutation test, frequently by several orders of magnitude, resulting in many false positives and negatives

Fig. 4

Performance of SAMC. _p_-values from 10,000 permutations, compared to SAMC _p_-values with _t_0 = 1,000 and 1,000,000 permutations (in log scale). Evaluated on 7989 methylation sites on chromosome 22, from the KORA dataset. Sites with ĥ2=0 (3,779 sites), omitted for clarity of presentation, showing a total of 4210 sites, with 99.95% CIs shown. SAMC is well calibrated

References

1. Price AL, et al. Single-tissue and cross-tissue heritability of gene expression via identity-by-descent in related or unrelated individuals. PLoS Genet. 2011;7:e1001317. doi: 10.1371/journal.pgen.1001317. -DOI -PMC -PubMed
1. Wright FA, et al. Heritability and genomics of gene expression in peripheral blood. Nat. Genet. 2014;46:430–437. doi: 10.1038/ng.2951. -DOI -PMC -PubMed
1. Lloyd-Jones LR. The genetic architecture of gene expression in peripheral blood. Am J Hum Genet. 2017;100:228–237. doi: 10.1016/j.ajhg.2016.12.008. -DOI -PMC -PubMed
1. Sun S, et al. Differential expression analysis for RNAseq using Poisson mixed models. Nucleic Acids Res. 2017;45:e106–e106. doi: 10.1093/nar/gkx204. -DOI -PMC -PubMed
1. Bell JT, Spector TD. DNA methylation studies using twins: what are they telling us? Genome Biol. 2012;13:172. doi: 10.1186/gb-2012-13-10-172. -DOI -PMC -PubMed

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Detecting heritable phenotypes without a model using fast permutation testing for heritability and set-tests - PubMed (original) (raw)