MAQGene: software to facilitate C. elegans mutant genome sequence analysis (original) (raw)

C. elegans Mutant Identification with a One-Step Whole-Genome-Sequencing and SNP Mapping Strategy

PLoS ONE, 2010

Whole-genome sequencing (WGS) is becoming a fast and cost-effective method to pinpoint molecular lesions in mutagenized genetic model systems, such as Caenorhabditis elegans. As mutagenized strains contain a significant mutational load, it is often still necessary to map mutations to a chromosomal interval to elucidate which of the WGS-identified sequence variants is the phenotype-causing one. We describe here our experience in setting up and testing a simple strategy that incorporates a rapid SNP-based mapping step into the WGS procedure. In this strategy, a mutant retrieved from a genetic screen is crossed with a polymorphic C. elegans strain, individual F2 progeny from this cross is selected for the mutant phenotype, the progeny of these F2 animals are pooled and then whole-genome-sequenced. The density of polymorphic SNP markers is decreased in the region of the phenotype-causing sequence variant and therefore enables its identification in the WGS data. As a proof of principle, we use this strategy to identify the molecular lesion in a mutant strain that produces an excess of dopaminergic neurons. We find that the molecular lesion resides in the Pax-6/Eyeless ortholog vab-3. The strategy described here will further reduce the time between mutant isolation and identification of the molecular lesion.

Caenorhabditis elegans mutant allele identification by whole-genome sequencing

Nature Methods, 2008

Identification of the molecular lesion in Caenorhabditis elegans mutants isolated through forward genetic screens usually involves time-consuming genetic mapping. We used Illumina deep sequencing technology to sequence a complete, mutant C. elegans genome and thus pinpointed a single-nucleotide mutation in the genome that affects a neuronal cell fate decision. This constitutes a proof-of-principle for using whole-genome sequencing to analyze C. elegans mutants. Correspondence should be addressed to I.P. (ip2169@columbia.edu) or O.H. (or38@columbia.edu).. AUTHOR CONTRIBUTIONS S.S. isolated and mapped lsy-12(ot177), performed the manual resequencing analysis and the RNAi analysis; S.P. performed the bioinformatic analysis; I.P. designed, performed and supervised the bioinformatic analysis; M.M.O. performed complementation tests and rescue analysis; and O.H. initiated and supervised the project and wrote the paper.

Whole-genome sequencing and variant discovery in C. elegans

Nature methods, 2008

Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and fo...

Rapid Mapping and Identification of Mutations in Caenorhabditis elegans by Restriction Site-Associated DNA Mapping and Genomic Interval Pull-Down Sequencing

Genetics, 2011

Forward genetic screens provide a powerful approach for inferring gene function on the basis of the phenotypes associated with mutated genes. However, determining the causal mutation by traditional mapping and candidate gene sequencing is often the rate-limiting step, especially when analyzing many mutants. We report two genomic approaches for more rapidly determining the identity of the affected genes in Caenorhabditis elegans mutants. First, we report our use of restriction site-associated DNA (RAD) polymorphism markers for rapidly mapping mutations after chemical mutagenesis and mutant isolation. Second, we describe our use of genomic interval pull-down sequencing (GIPS) to selectively capture and sequence megabase-sized portions of a mutant genome. Together, these two methods provide a rapid and cost-effective approach for positional cloning of C. elegans mutant loci, and are also applicable to other genetic model systems.

High-Throughput Gene Mapping in Caenorhabditis elegans

Genome Research

Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 ± 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18.

Whole-Genome Profiling of Mutagenesis in Caenorhabditis elegans

Genetics, 2010

Deep sequencing offers an unprecedented view of an organism's genome. We describe the spectrum of mutations induced by three commonly used mutagens: ethyl methanesulfonate (EMS), N-ethyl-Nnitrosourea (ENU), and ultraviolet trimethylpsoralen (UV/TMP) in the nematode Caenorhabditis elegans. Our analysis confirms the strong GC to AT transition bias of EMS. We found that ENU mainly produces A to T and T to A transversions, but also all possible transitions. We found no bias for any specific transition or transversion in the spectrum of UV/TMP-induced mutations. In 10 mutagenized strains we identified 2723 variants, of which 508 are expected to alter or disrupt gene function, including 21 nonsense mutations and 10 mutations predicted to affect mRNA splicing. This translates to an average of 50 informative mutations per strain. We also present evidence of genetic drift among laboratory wild-type strains derived from the Bristol N2 strain. We make several suggestions for best practice using massively parallel short read sequencing to ensure mutation detection.

Rapid and Efficient Identification of Caenorhabditis elegans Legacy Mutations Using Hawaiian SNP-Based Mapping and Whole Genome Sequencing

G3 (Bethesda, Md.), 2015

The production of viable embryos requires the coordination of many cellular processes including protein synthesis, cytoskeletal reorganization, establishment of polarity, cell migration, cell division, and in Caenorhabditis elegans, eggshell formation. Defects in any of these processes can lead to embryonic lethality. We examined six temperature sensitive mutants as well as one non-conditional mutant that were previously identified in genetic screens as either embryonic lethal (maternal-effect or zygotic lethal) or eggshell defective. The responsible molecular lesion for each had never been determined. After confirmation of temperature sensitivity and lethality, we performed whole-genome sequencing using a SNP mapping strategy to pinpoint the molecular lesions. Gene candidates were confirmed by RNAi phenocopy and/or complementation tests and one mutant was further validated by CRISPR/Cas9 gene editing. This approach identified new alleles of several genes that had only been previous...

Whole genome sequencing facilitates intragenic variant interpretation following modifier screening in C. elegans

BMC Genomics

Background Intragenic modifiers (in-phase, second-site variants) are known to have dramatic effects on clinical outcomes, affecting disease attributes such as severity or age of onset. However, despite their clinical importance, the focus of many genetic screens in model systems is on the discovery of extragenic variants, with many labs still relying upon more traditional methods to identify modifiers. However, traditional methods such as PCR and Sanger sequencing can be time-intensive and do not permit a thorough understanding of the intragenic modifier effects in the context of non-isogenic genomic backgrounds. Results Here, we apply high throughput approaches to identify and understand intragenic modifiers using Caenorhabditis elegans. Specifically, we applied whole genome sequencing (WGS) to a mutagen-induced forward genetic screen to identify intragenic suppressors of a temperature-sensitive zyg-1(it25) allele in C. elegans. ZYG-1 is a polo kinase that is important for centriol...

Towards a mutation in every gene in Caenorhabditis elegans

Briefings in functional genomics & proteomics, 2008

The combined efforts of the Caenorhabditis elegans Knockout Consortium and individuals within the worm community are moving us closer to the goal of identifying mutations in every gene in the nematode C. elegans. At present, we count about 7000 deletion alleles that fall within 5500 genes. The principal method used to detect deletion mutations in the nematode utilizes polymerase chain reaction (PCR). More recently, the Moerman group has incorporated array comparative genome hybridization (aCGH) to detect deletions across the entire coding genome. Other methods used to detect mutant alleles in C. elegans include targeting induced local lesion in genomes (TILLING), transposon tagging, using either Tc1 or Mos1 and resequencing. These combined strategies have improved the overall throughput of the gene-knockout labs, and have broadened the types of mutations that we, and others, can identify. In this review, we will discuss these different approaches.