PatternHunter: faster and more sensitive homology search - PubMed (original) (raw)
Comparative Study
PatternHunter: faster and more sensitive homology search
Bin Ma et al. Bioinformatics. 2002 Mar.
Abstract
Motivation: Genomics and proteomics studies routinely depend on homology searches based on the strategy of finding short seed matches which are then extended. The exploding genomic data growth presents a dilemma for DNA homology search techniques: increasing seed size decreases sensitivity whereas decreasing seed size slows down computation.
Results: We present a new homology search algorithm 'PatternHunter' that uses a novel seed model for increased sensitivity and new hit-processing techniques for significantly increased speed. At Blast levels of sensitivity, PatternHunter is able to find homologies between sequences as large as human chromosomes, in mere hours on a desktop.
Availability: PatternHunter is available at http://www.bioinformaticssolutions.com, as a commercial package. It runs on all platforms that support Java. PatternHunter technology is being patented; commercial use requires a license from BSI, while non-commercial use will be free.
Similar articles
- D-ASSIRC: distributed program for finding sequence similarities in genomes.
Vincens P, Badel-Chagnon A, André C, Hazout S. Vincens P, et al. Bioinformatics. 2002 Mar;18(3):446-51. doi: 10.1093/bioinformatics/18.3.446. Bioinformatics. 2002. PMID: 11934744 - Patternhunter II: highly sensitive and fast homology search.
Li M, Ma B, Kisman D, Tromp J. Li M, et al. J Bioinform Comput Biol. 2004 Sep;2(3):417-39. doi: 10.1142/s0219720004000661. J Bioinform Comput Biol. 2004. PMID: 15359419 - PatternHunter II: highly sensitive and fast homology search.
Li M, Ma B, Kisman D, Tromp J. Li M, et al. Genome Inform. 2003;14:164-75. Genome Inform. 2003. PMID: 15706531 - Finding homologs to nucleic acid or protein sequences using the framesearch program.
Healy M. Healy M. Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review. - Finding homologs to nucleotide sequences using network BLAST searches.
Ladunga I. Ladunga I. Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.3. doi: 10.1002/0471250953.bi0303s00. Curr Protoc Bioinformatics. 2002. PMID: 18792938 Review.
Cited by
- On the Maximal Independent Sets of k-mers with the Edit Distance.
Ma L, Chen K, Shao M. Ma L, et al. ACM BCB. 2023 Sep;2023:42. doi: 10.1145/3584371.3612982. Epub 2023 Oct 4. ACM BCB. 2023. PMID: 38050580 Free PMC article. - Comparative analysis of Mycobacterium and related Actinomycetes yields insight into the evolution of Mycobacterium tuberculosis pathogenesis.
McGuire AM, Weiner B, Park ST, Wapinski I, Raman S, Dolganov G, Peterson M, Riley R, Zucker J, Abeel T, White J, Sisk P, Stolte C, Koehrsen M, Yamamoto RT, Iacobelli-Martinez M, Kidd MJ, Maer AM, Schoolnik GK, Regev A, Galagan J. McGuire AM, et al. BMC Genomics. 2012 Mar 28;13:120. doi: 10.1186/1471-2164-13-120. BMC Genomics. 2012. PMID: 22452820 Free PMC article. - The Rauvolfia tetraphylla genome suggests multiple distinct biosynthetic routes for yohimbane monoterpene indole alkaloids.
Stander EA, Lehka B, Carqueijeiro I, Cuello C, Hansson FG, Jansen HJ, Dugé De Bernonville T, Birer Williams C, Vergès V, Lezin E, Lorensen MDBB, Dang TT, Oudin A, Lanoue A, Durand M, Giglioli-Guivarc'h N, Janfelt C, Papon N, Dirks RP, O'connor SE, Jensen MK, Besseau S, Courdavault V. Stander EA, et al. Commun Biol. 2023 Nov 24;6(1):1197. doi: 10.1038/s42003-023-05574-8. Commun Biol. 2023. PMID: 38001233 Free PMC article. - Early evolution of conserved regulatory sequences associated with development in vertebrates.
McEwen GK, Goode DK, Parker HJ, Woolfe A, Callaway H, Elgar G. McEwen GK, et al. PLoS Genet. 2009 Dec;5(12):e1000762. doi: 10.1371/journal.pgen.1000762. Epub 2009 Dec 11. PLoS Genet. 2009. PMID: 20011110 Free PMC article. - SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.
Hu X, Friedberg I. Hu X, et al. Gigascience. 2019 Oct 1;8(10):giz118. doi: 10.1093/gigascience/giz118. Gigascience. 2019. PMID: 31648300 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials