Identification of Rat Genes by TWINSCAN Gene Prediction, RT–PCR, and Direct Sequencing (original) (raw)
- Jia Qian Wu1,
- David Shteynberg2,
- Manimozhiyan Arumugam2,
- Richard A. Gibbs1, and
- Michael R. Brent2,3
- 1 Human Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- 2 Laboratory for Computational Genomics, Washington University, St. Louis, Missouri 63130, USA
Abstract
The publication of a draft sequence of a third mammalian genome—that of the rat—suggests a need to rethink genome annotation. New mammalian sequences will not receive the kind of labor-intensive annotation efforts that are currently being devoted to human. In this paper, we demonstrate an alternative approach: reverse transcription-polymerase chain reaction (RT–PCR) and direct sequencing based on dual-genome de novo predictions from TWINSCAN. We tested 444 TWINSCAN-predicted rat genes that showed significant homology to known human genes implicated in disease but that were partially or completely missed by methods based on protein-to-genome mapping. Using primers in exons flanking a single predicted intron, we were able to verify the existence of 59% of these predicted genes. We then attempted to amplify the complete predicted open reading frames of 136 genes that were verified in the single-intron experiment. Spliced sequences were amplified in 46 cases (34%). We conclude that this procedure for elucidating gene structures with native cDNA sequences is cost-effective and will become even more so as it is further optimized.
Footnotes
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1959604.
↵3 Corresponding author. E-MAIL brent{at}cse.wustl.edu; FAX (314) 935-7302.
- Accepted November 17, 2003.
- Received September 11, 2003.
Cold Spring Harbor Laboratory Press