Large-scale discovery of novel genetic causes of developmental disorders - PubMed (original) (raw)
. 2015 Mar 12;519(7542):223-8.
doi: 10.1038/nature14135. Epub 2014 Dec 24.
Collaborators
- PMID: 25533962
- PMCID: PMC5955210
- DOI: 10.1038/nature14135
Large-scale discovery of novel genetic causes of developmental disorders
Deciphering Developmental Disorders Study. Nature. 2015.
Abstract
Despite three decades of successful, predominantly phenotype-driven discovery of the genetic causes of monogenic disorders, up to half of children with severe developmental disorders of probable genetic origin remain without a genetic diagnosis. Particularly challenging are those disorders rare enough to have eluded recognition as a discrete clinical entity, those with highly variable clinical manifestations, and those that are difficult to distinguish from other, very similar, disorders. Here we demonstrate the power of using an unbiased genotype-driven approach to identify subsets of patients with similar disorders. By studying 1,133 children with severe, undiagnosed developmental disorders, and their parents, using a combination of exome sequencing and array-based detection of chromosomal rearrangements, we discovered 12 novel genes associated with developmental disorders. These newly implicated genes increase by 10% (from 28% to 31%) the proportion of children that could be diagnosed. Clustering of missense mutations in six of these newly implicated genes suggests that normal development is being perturbed by an activating or dominant-negative mechanism. Our findings demonstrate the value of adopting a comprehensive strategy, both genome-wide and nationwide, to elucidate the underlying causes of rare genetic disorders.
Figures
EDF1. Characteristics of the families
A. Gestation Adjusted Decimal Age at Last Clinical Assessment. Histogram showing the distribution of the gestation adjusted decimal age at last clinical assessment across the 1133 probands. The dashed red line shows the median age. B. Frequency of HPO Term Usage. Bar plot showing, for each used HPO term, the number of times it was observed across the 1133 proband patient records. C. Projection PCA plot of the 1133 probands. PCA plot of 1133 DDD probands projected onto a PCA analysis using 4 different HapMap populations from the 1000 genomes project. Black: African, Red: European, Green: East Asian, Blue: South Asian and the 1133 DDD probands are represented by orange triangles. D. Self Declared and Genetically Defined Consanguinity. Overlaid histogram showing the distribution of kinship coefficients from KING comparing parental samples for each trio. Green: Trios where consanguinity was not entered in the patient record on DECIPHER. Red: Trios consanguinity was declared in the patient record on DECIPHER.
EDF2. Number of Validated de novo SNVs and indels per Proband
Bar plot showing the distribution of the observed number of validated SNVs and indels per proband sample, and the expected distribution assuming a Poisson distribution with the same mean as the observed distribution.
EDF3. Number of Diagnoses per Gene
Histogram showing the number of diagnoses per gene for genes with at least two diagnoses from different proband samples.
EDF4. Burden of Large CNVs in 1133 DDD Proband Samples
Plot comparing the frequency of rare CNVs in three sample groups against CNV size. Y-axis is the on a log scale. Red: DDD probands who have not had previous microarray based genetic testing, Purple: DDD probands who have had negative previous microarray based genetic testing Green: DDD controls.
EDF5. Expected and observed numbers of de novo mutations
The expected and observed numbers of mutations of different functional consequences in three mutually exclusive sets of genes are shown, along with the p value from an assessment of a statistical excess of observed mutations. The three classes of genes are described in the main text.
EDF6. Haploinsufficiency analyses
A. Saturation analysis for detecting haploinsufficient DD genes. A boxplot showing the distribution of statistical power to detect a significant enrichment of LoF mutations across 18,272 genes in the genome, for different numbers of trios studied, from 1,000 trios to 12,000 trios. B. Distribution of haplinsufficiency scores in selected sets of de novo mutations. Violin plot of haploinsufficiency scores in five sets of de novo mutations: Silent - all synonymous mutations, Diagnostic - mutations in known DD genes in diagnosed individuals, Undiagnosed_Func - all functional mutations in undiagnosed individuals, Undiagnosed_LoF - All LoF mutations in undiagnosed individuals, Undiagnosed_recur - mutations in genes with recurrent functional mutations in undiagnosed individuals. P values for a Mann-Whitney test comparing each of the latter four distributions to that observed for the silent (synonymous) variants are plotted at the top of each violin.
Figure 1. Excess of recurrently mutated genes
Each panel shows the observed number of recurrently mutated genes (diamond) and the distribution of the number of recurrently mutated genes in 10,000 simulations (box indicates interquartile range, whiskers indicates 95% confidence interval) under a model of no gene-specific enrichment of mutations: a. all protein-altering mutations in all DDD children and undiagnosed DDD children, b. all LoF mutations in all DDD children and undiagnosed DDD children. Each diamond is annotated with the median excess of recurrently mutated genes, with 95% confidence intervals in brackets. P value of observed excess is <0.0001 for all four tests.
Figure 2. Gene-specific significance of enrichment for DNMs
The –log10(p) value of testing for mutation enrichment is plotted only for each gene with at least one mutation in DDD children. On the X-axis is the p value of the most significant test in the DDD dataset, and on the Y-axis is the minimal p value from the significance testing in the meta-analysis dataset. Red indicates genes already known to be associated with DDs (in DDG2P). Only genes with a p value of less than 0.05/18,272 (red lines) are labeled.
Figure 3. Five novel genes with clustered mutations
The domains (blue), post-translational modifications, and mutation locations (red stars) are shown for five proteins with highly clustered de novo mutations in unrelated children with severe, undiagnosed DDs. For two proteins (COL4A3BP and PCGF2) where all observed mutations are identical, photos are shown to highlight the facial similarities of patients carrying the same mutation.
Figure 4. Candidate gene Loss of Function modeling in zebrafish reveals enrichment for developmentally important proteins
a, Examples of developmental phenotypes: Knockdown of pkn2a results in reduced cartilaginous jaw structures (black arrows), knockdown of fryl results in cardiac and craniofacial defects (white arrowheads and arrows, respectively), while knockdown of psmd3 results in smaller ear primordia (red arrows), and mis-patterned CNS neurons (compare red double arrows and brackets). b, Knockdown outcomes of 7 genes with variants present in microcephaly patients: Interocular measurements of brightfield images from control and LoF embryos reveal significant decreases in head size. A neuronal antibody stain (anti-HuC/D, green channel) labels the brains of control and morphant zebrafish. Measurements taken across the widest extent of the midbrain identify significant reductions in brain size, likely underlying the concomitant head size reductions seen in brightfield. In b, tables show average percentage reduction in head and brain width, and p-values of a _t-_test.
Similar articles
- Prevalence and architecture of de novo mutations in developmental disorders.
Deciphering Developmental Disorders Study. Deciphering Developmental Disorders Study. Nature. 2017 Feb 23;542(7642):433-438. doi: 10.1038/nature21062. Epub 2017 Jan 25. Nature. 2017. PMID: 28135719 Free PMC article. - Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data.
Wright CF, Fitzgerald TW, Jones WD, Clayton S, McRae JF, van Kogelenberg M, King DA, Ambridge K, Barrett DM, Bayzetinova T, Bevan AP, Bragin E, Chatzimichali EA, Gribble S, Jones P, Krishnappa N, Mason LE, Miller R, Morley KI, Parthiban V, Prigmore E, Rajan D, Sifrim A, Swaminathan GJ, Tivey AR, Middleton A, Parker M, Carter NP, Barrett JC, Hurles ME, FitzPatrick DR, Firth HV; DDD study. Wright CF, et al. Lancet. 2015 Apr 4;385(9975):1305-14. doi: 10.1016/S0140-6736(14)61705-0. Epub 2014 Dec 17. Lancet. 2015. PMID: 25529582 Free PMC article. - Whole exome sequencing in family trios reveals de novo mutations in PURA as a cause of severe neurodevelopmental delay and learning disability.
Hunt D, Leventer RJ, Simons C, Taft R, Swoboda KJ, Gawne-Cain M; DDD study; Magee AC, Turnpenny PD, Baralle D. Hunt D, et al. J Med Genet. 2014 Dec;51(12):806-13. doi: 10.1136/jmedgenet-2014-102798. Epub 2014 Oct 23. J Med Genet. 2014. PMID: 25342064 Free PMC article. - Description of neurodevelopmental phenotypes associated with 10 genetic neurodevelopmental disorders: A scoping review.
Hanly C, Shah H, Au PYB, Murias K. Hanly C, et al. Clin Genet. 2021 Mar;99(3):335-346. doi: 10.1111/cge.13882. Epub 2020 Nov 18. Clin Genet. 2021. PMID: 33179249 Review. - [XXX syndrome].
Ohashi H. Ohashi H. Nihon Rinsho. 2006 Jun 28;Suppl 2:522-4. Nihon Rinsho. 2006. PMID: 16817456 Review. Japanese. No abstract available.
Cited by
- Epigenomic newborn screening for conditions with intellectual disability and autistic features in Australian newborns.
Alshawsh M, Wake M, Gecz J, Corbett M, Saffery R, Pitt J, Greaves R, Williams K, Field M, Cheong J, Bui M, Arora S, Sadedin S, Lunke S, Wall M, Amor DJ, Godler DE. Alshawsh M, et al. Epigenomics. 2024;16(18):1203-1214. doi: 10.1080/17501911.2024.2402681. Epub 2024 Oct 4. Epigenomics. 2024. PMID: 39365098 Free PMC article. - Oxygen-induced stress reveals context-specific gene regulatory effects in human brain organoids.
Umans BD, Gilad Y. Umans BD, et al. bioRxiv [Preprint]. 2024 Sep 3:2024.09.03.611030. doi: 10.1101/2024.09.03.611030. bioRxiv. 2024. PMID: 39282424 Free PMC article. Preprint. - Investigation of Pogz Gene Variants in Non-Syndromic Autism Spectrum Disorder.
Tozkır J, Yıldırım G, Demir S, Palabıyık O, Görker I, Gürkan H. Tozkır J, et al. Noro Psikiyatr Ars. 2024 Aug 9;67(3):208-212. doi: 10.29399/npa.28625. eCollection 2024. Noro Psikiyatr Ars. 2024. PMID: 39258134 Free PMC article. - Bayesian Rare Variant Analysis Identifies Novel Schizophrenia Putative Risk Genes.
Han S. Han S. J Pers Med. 2024 Aug 2;14(8):822. doi: 10.3390/jpm14080822. J Pers Med. 2024. PMID: 39202013 Free PMC article. - Thirteen New Patients of PPP2R5D Gene Mutation and the Fine Profile of Genotype-Phenotype Correlation Unraveling the Pathogenic Mechanism Underlying Macrocephaly Phenotype.
Jiang Y, Wu B, Zhang X, Yang L, Wang S, Li H, Zhou S, Qian Y, Wang H. Jiang Y, et al. Children (Basel). 2024 Jul 26;11(8):897. doi: 10.3390/children11080897. Children (Basel). 2024. PMID: 39201832 Free PMC article.
References
- OMIM. Online Mendelian Inheritance in Man, OMIM. 2014 http://omim.org
Publication types
MeSH terms
Substances
Grants and funding
- CZD/16/6/CSO_/Chief Scientist Office/United Kingdom
- DH_/Department of Health/United Kingdom
- WT098051/WT_/Wellcome Trust/United Kingdom
- 15934/CRUK_/Cancer Research UK/United Kingdom
- 098395/WT_/Wellcome Trust/United Kingdom
- WT_/Wellcome Trust/United Kingdom
- 100140/WT_/Wellcome Trust/United Kingdom
- MC_PC_U127561093/MRC_/Medical Research Council/United Kingdom
- 091986/WT_/Wellcome Trust/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases
Miscellaneous