Gene finding for the helical cytokines - PubMed (original) (raw)
Gene finding for the helical cytokines
Darrell Conklin et al. Bioinformatics. 2005.
Abstract
Motivation: Gene finding remains an open problem well after the sequencing of the human genome. The low gene sensitivity of current methods is a problem for divergent protein families, because fairly accurate exon assemblies are required before sensitive fold recognition algorithms can be applied. This paper presents a new genomic threading algorithm which integrates the gene finding and fold recognition steps into a single process. The method is applicable to evolutionarily divergent protein families that have retained some trace of their common ancestry, number and phase of introns, sizes of exons and placement of structural elements on specific exons. Such conserved structural signals may be visible despite dramatic evolution of protein sequence.
Results: The method is evaluated on the family of helical cytokines by cross-validation sensitivity analysis. The method has also been applied to all intergenic regions of the human genome, and an expression and cloning approach has been coupled with the predictions of the method. Two genes discovered by this method are discussed.
Supplementary information: All data used and the results obtained in the cross-validation analysis are available at http://www.soi.city.ac.uk/\~conklin/papers/GT/
Similar articles
- PALMA: mRNA to genome alignments using large margin algorithms.
Schulze U, Hepp B, Ong CS, Rätsch G. Schulze U, et al. Bioinformatics. 2007 Aug 1;23(15):1892-900. doi: 10.1093/bioinformatics/btm275. Epub 2007 May 30. Bioinformatics. 2007. PMID: 17537755 - Common structural patterns in human genes.
Hill A, Sorscher E. Hill A, et al. Bioinformatics. 2004 Jul 10;20(10):1632-5. doi: 10.1093/bioinformatics/bth134. Epub 2004 Feb 26. Bioinformatics. 2004. PMID: 14988119 - CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.
Parra G, Bradnam K, Korf I. Parra G, et al. Bioinformatics. 2007 May 1;23(9):1061-7. doi: 10.1093/bioinformatics/btm071. Epub 2007 Mar 1. Bioinformatics. 2007. PMID: 17332020 - Advances in the Exon-Intron Database (EID).
Shepelev V, Fedorov A. Shepelev V, et al. Brief Bioinform. 2006 Jun;7(2):178-85. doi: 10.1093/bib/bbl003. Epub 2006 Mar 9. Brief Bioinform. 2006. PMID: 16772261 Review. - [Physical mapping of disease-associated regions and functional analysis of pathogenic genes: gene hunting by genomic DNA sequencing and cDNA screening].
Shimizu N, Kudoh J, Minoshima S, Kawasaki K, Asakawa S. Shimizu N, et al. Tanpakushitsu Kakusan Koso. 1997 Dec;42(17 Suppl):2729-36. Tanpakushitsu Kakusan Koso. 1997. PMID: 9455185 Review. Japanese. No abstract available.
Cited by
- Vstm3 is a member of the CD28 family and an important modulator of T-cell function.
Levin SD, Taft DW, Brandt CS, Bucher C, Howard ED, Chadwick EM, Johnston J, Hammond A, Bontadelli K, Ardourel D, Hebb L, Wolf A, Bukowski TR, Rixon MW, Kuijper JL, Ostrander CD, West JW, Bilsborough J, Fox B, Gao Z, Xu W, Ramsdell F, Blazar BR, Lewis KE. Levin SD, et al. Eur J Immunol. 2011 Apr;41(4):902-15. doi: 10.1002/eji.201041136. Epub 2011 Mar 18. Eur J Immunol. 2011. PMID: 21416464 Free PMC article. - The role of genomic data in the discovery, annotation and evolutionary interpretation of the interferon-lambda family.
Fox BA, Sheppard PO, O'Hara PJ. Fox BA, et al. PLoS One. 2009;4(3):e4933. doi: 10.1371/journal.pone.0004933. Epub 2009 Mar 20. PLoS One. 2009. PMID: 19300512 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous