A systematic genome-wide analysis of zebrafish protein-coding gene function - PubMed (original) (raw)
. 2013 Apr 25;496(7446):494-7.
doi: 10.1038/nature11992. Epub 2013 Apr 17.
Elisabeth M Busch-Nentwich, Steven A Harvey, Christopher M Dooley, Ewart de Bruijn, Freek van Eeden, Ian Sealy, Richard J White, Colin Herd, Isaac J Nijman, Fruzsina Fényes, Selina Mehroke, Catherine Scahill, Richard Gibbons, Neha Wali, Samantha Carruthers, Amanda Hall, Jennifer Yen, Edwin Cuppen, Derek L Stemple
Affiliations
- PMID: 23594742
- PMCID: PMC3743023
- DOI: 10.1038/nature11992
A systematic genome-wide analysis of zebrafish protein-coding gene function
Ross N W Kettleborough et al. Nature. 2013.
Abstract
Since the publication of the human reference genome, the identities of specific genes associated with human diseases are being discovered at a rapid rate. A central problem is that the biological activity of these genes is often unclear. Detailed investigations in model vertebrate organisms, typically mice, have been essential for understanding the activities of many orthologues of these disease-associated genes. Although gene-targeting approaches and phenotype analysis have led to a detailed understanding of nearly 6,000 protein-coding genes, this number falls considerably short of the more than 22,000 mouse protein-coding genes. Similarly, in zebrafish genetics, one-by-one gene studies using positional cloning, insertional mutagenesis, antisense morpholino oligonucleotides, targeted re-sequencing, and zinc finger and TAL endonucleases have made substantial contributions to our understanding of the biological activity of vertebrate genes, but again the number of genes studied falls well short of the more than 26,000 zebrafish protein-coding genes. Importantly, for both mice and zebrafish, none of these strategies are particularly suited to the rapid generation of knockouts in thousands of genes and the assessment of their biological activity. Here we describe an active project that aims to identify and phenotype the disruptive mutations in every zebrafish protein-coding gene, using a well-annotated zebrafish reference genome sequence, high-throughput sequencing and efficient chemical mutagenesis. So far we have identified potentially disruptive mutations in more than 38% of all known zebrafish protein-coding genes. We have developed a multi-allelic phenotyping scheme to efficiently assess the effects of each allele during embryogenesis and have analysed the phenotypic consequences of over 1,000 alleles. All mutant alleles and data are available to the community and our phenotyping scheme is adaptable to phenotypic analysis beyond embryogenesis.
Figures
Figure 1. Exome sequencing
a, ENU-mutagenised G0 males are outcrossed to create a population of F1s. Genomic DNA is taken from F1s, which are either outcrossed or cryopreserved as sperm samples. b, F1 genomic DNA is then subjected to exome sequencing. Illumina libraries are made and hybridised to the 120mer biotinylated RNA whole exome baits. Streptavidin coated magnetic beads capture genomic DNA hybridised to the RNA baits, and all other DNA is discarded. Exome-enriched DNA fragments are sequenced. Blue represents exonic and red non-coding genomic DNA.
Figure 2. Mutation detection
a, The cumulative detection of nonsense and essential splice alleles. As each mutagenised library displayed different rates of mutagenesis the order that exomes were sequenced was randomised. b, The detection of non-synonymous mutations. Sequencing 808 exomes resulted in the identification of 85,338 non-synonymous alleles in 19655 genes corresponding to 75% of all protein-coding genes.
Figure 3. Phenotypic analysis of alleles
a, F1 individuals were outcrossed to produce an F2 family. The induced disruptive alleles for one family are shown. b, F2s were incrossed and genotyped. c, First round, phenotypically wild-type embryos were collected from each clutch at 5 dpf and genotyped for the mutations heterozygous in both parents. The number of homozygous mutant F3 embryos was assessed using a Chi-squared test (p-value cut-off <0.05). Mutations homozygous in less than 25% of embryos were suspected to cause a phenotype. Here, there were no homozygous embryos in the phenotypically wild-type set for the alleles sa365, sa371 and sa379. d-e, Second round, phenotypes present within each incross were genotyped for putative phenotypic mutations. d, slc22a7bsa365 shows pigment phenotype. e, mphosph10sa371 shows small head and pericardiac oedema phenotype. lamc1sa379 mutants are shown in Fig. 4 l, n.
Figure 4. Confirmation of causality through complementation crosses
Depicted are four examples, polymerase I polypeptide a (polr1a) (a-d), midasin homologue (yeast) (mdn1) (e-h), titin a (ttna) (i-k) and laminin subunit gamma-1 (lamc1) (l-n), where heterozygous carriers of two independent alleles in the same gene were used to generate compound heterozygote offspring. Where possible incrosses of individual alleles are shown as well. In all images non-phenotypic siblings are above and phenotypic homozygous mutant or compound heterozygous embryos below. a-c, At 48 hpf embryos homozygous for either sa1376, sa2745 or compound heterozygous for sa1376 and sa2745 have small eyes, a hydrocephalic hindbrain and pericardiac oedema. d, sa2745 disrupts a splice donor site through a G>A transition at the first intronic nucleotide 3′ of coding nucleotide 985. Allele sa1376 is a C>T transition producing a premature STOP codon at amino acid (aa) 1487. e-g, At 96 hpf embryos homozygous for either sa1349, sa6631 or compound heterozygous for sa1349 and sa6631 have smaller heads with malformed jaws and mild pericardiac oedema. h, sa1349 and sa6631 produce premature STOP codons at aa 4597 (T>A transversion) and aa 5333 (G>A transition), respectively. i, j, At 48 hpf embryos homozygous for sa787 or compound heterozygous for sa787 and sa2492 are growth retarded, paralysed and have pericardiac oedema. k, Alleles sa787 and sa2492 produce premature STOP codons at aa 24946 (C>T transition) and aa 27471 (C>T transition), respectively. l, m, At 24 hpf embryos homozygous for sa379 or compound heterozygous for sa379 and m466 are shorter with an undifferentiated notochord, and brain and eye malformations. n, Allele m466 is a G>A transition producing a premature STOP codon at aa 13. Allele sa379 disrupts a splice acceptor site through a G>A transition one nucleotide 5′ of coding nucleotide 975.
Comment in
- Genomics: Zebrafish earns its stripes.
Schier AF. Schier AF. Nature. 2013 Apr 25;496(7446):443-4. doi: 10.1038/nature12094. Epub 2013 Apr 17. Nature. 2013. PMID: 23594741 No abstract available. - Genomics: New zebrafish genome resources.
Burgess DJ. Burgess DJ. Nat Rev Genet. 2013 Jun;14(6):368-9. doi: 10.1038/nrg3500. Epub 2013 Apr 30. Nat Rev Genet. 2013. PMID: 23629549 No abstract available.
Similar articles
- A large-scale zebrafish gene knockout resource for the genome-wide study of gene function.
Varshney GK, Lu J, Gildea DE, Huang H, Pei W, Yang Z, Huang SC, Schoenfeld D, Pho NH, Casero D, Hirase T, Mosbrook-Davis D, Zhang S, Jao LE, Zhang B, Woods IG, Zimmerman S, Schier AF, Wolfsberg TG, Pellegrini M, Burgess SM, Lin S. Varshney GK, et al. Genome Res. 2013 Apr;23(4):727-35. doi: 10.1101/gr.151464.112. Epub 2013 Feb 4. Genome Res. 2013. PMID: 23382537 Free PMC article. - Multi-allelic phenotyping--a systematic approach for the simultaneous analysis of multiple induced mutations.
Dooley CM, Scahill C, Fényes F, Kettleborough RN, Stemple DL, Busch-Nentwich EM. Dooley CM, et al. Methods. 2013 Aug 15;62(3):197-206. doi: 10.1016/j.ymeth.2013.04.013. Epub 2013 Apr 23. Methods. 2013. PMID: 23624102 Free PMC article. - A Multifunctional Mutagenesis System for Analysis of Gene Function in Zebrafish.
Quach HN, Tao S, Vrljicak P, Joshi A, Ruan H, Sukumaran R, Varshney GK, LaFave MC; Ds Screen Team; Burgess SM, Winkler C, Emelyanov A, Parinov S, Sampath K. Quach HN, et al. G3 (Bethesda). 2015 Apr 2;5(6):1283-99. doi: 10.1534/g3.114.015842. G3 (Bethesda). 2015. PMID: 25840430 Free PMC article. - Functional genomics tools for the analysis of zebrafish pigment.
Pickart MA, Sivasubbu S, Nielsen AL, Shriram S, King RA, Ekker SC. Pickart MA, et al. Pigment Cell Res. 2004 Oct;17(5):461-70. doi: 10.1111/j.1600-0749.2004.00189.x. Pigment Cell Res. 2004. PMID: 15357832 Review. - Mutagenesis and phenotyping resources in zebrafish for studying development and human disease.
Varshney GK, Burgess SM. Varshney GK, et al. Brief Funct Genomics. 2014 Mar;13(2):82-94. doi: 10.1093/bfgp/elt042. Epub 2013 Oct 26. Brief Funct Genomics. 2014. PMID: 24162064 Free PMC article. Review.
Cited by
- Aligning with the 3Rs: alternative models for research into muscle development and inherited myopathies.
Mehmood H, Kasher PR, Barrett-Jolley R, Walmsley GL. Mehmood H, et al. BMC Vet Res. 2024 Oct 18;20(1):477. doi: 10.1186/s12917-024-04309-z. BMC Vet Res. 2024. PMID: 39425123 Review. - Functional role of myosin-binding protein H in thick filaments of developing vertebrate fast-twitch skeletal muscle.
Mead AF, Wood NB, Nelson SR, Palmer BM, Yang L, Previs SB, Ploysangngam A, Kennedy GG, McAdow JF, Tremble SM, Zimmermann MA, Cipolla MJ, Ebert AM, Johnson AN, Gurnett CA, Previs MJ, Warshaw DM. Mead AF, et al. J Gen Physiol. 2024 Dec 2;156(12):e202413604. doi: 10.1085/jgp.202413604. Epub 2024 Oct 7. J Gen Physiol. 2024. PMID: 39373654 - Zebrafish Models for Skeletal and Extraskeletal Osteogenesis Imperfecta Features: Unveiling Pathophysiology and Paving the Way for Drug Discovery.
Masiero C, Aresi C, Forlino A, Tonelli F. Masiero C, et al. Calcif Tissue Int. 2024 Sep 25. doi: 10.1007/s00223-024-01282-5. Online ahead of print. Calcif Tissue Int. 2024. PMID: 39320469 Review. - Presynaptic Nrxn3 is essential for ribbon-synapse maturation in hair cells.
Jukic A, Lei Z, Cebul ER, Pinter K, Tadesse Y, Jarysta A, David S, Mosqueda N, Tarchini B, Kindt K. Jukic A, et al. Development. 2024 Oct 1;151(19):dev202723. doi: 10.1242/dev.202723. Epub 2024 Oct 10. Development. 2024. PMID: 39254120 Free PMC article. - The metalloproteinase PAPP-A is required for IGF-dependent chondrocyte differentiation and organization.
Harboe M, Kjaer-Sorensen K, Füchtbauer EM, Fenton RA, Thomsen JS, Brüel A, Oxvig C. Harboe M, et al. Sci Rep. 2024 Aug 29;14(1):20161. doi: 10.1038/s41598-024-71062-6. Sci Rep. 2024. PMID: 39215168 Free PMC article.
References
- Gossler A, Joyner AL, Rossant J, Skarnes WC. Mouse embryonic stem cells and reporter constructs to detect developmentally regulated genes. Science. 1989;244:463–465. - PubMed
- Skarnes WC, Auerbach BA, Joyner AL. A gene trap approach in mouse embryonic stem cells: the lacZ reported is activated by splicing, reflects endogenous gene expression, and is mutagenic in mice. Genes Dev. 1992;6:903–918. - PubMed
- Waterston RH, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi:10.1038/nature01262. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- 5R01HG00481/HG/NHGRI NIH HHS/United States
- R01 HG004819/HG/NHGRI NIH HHS/United States
- WT_/Wellcome Trust/United Kingdom
- 098051/WT_/Wellcome Trust/United Kingdom
- G0777791/MRC_/Medical Research Council/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous