A local alignment tool for very long DNA sequences - PubMed (original) (raw)
A local alignment tool for very long DNA sequences
K M Chao et al. Comput Appl Biosci. 1995 Apr.
Abstract
This paper presents a practical program, called sim2, for building local alignments of two sequences, each of which may be hundreds of kilobases long. sim2 first constructs n best non-intersecting chains of 'fragments', such as all occurrences of identical 5-tuples in each of two DNA sequences, for any specified n > or = 1. Each chain is then refined by delivering an optimal alignment in a region delimited by the chain. sim2 requires only space proportional to the size of the input sequences and the output alignments, and the same source code runs on Unix machines, on Macintoshes, on PCs, and on DEC Alpha PCs. We also describe an application of sim2 for aligning long DNA sequences from Escherichia coli. sim2 facilitates contig-building by providing a complete view of the related sequences, so differences can be analyzed and inconsistencies resolved. Examples are shown using the alignment display and editing functions from the software tool ChromoScope.
Similar articles
- A tool for analyzing and annotating genomic sequences.
Huang X, Adams MD, Zhou H, Kerlavage AR. Huang X, et al. Genomics. 1997 Nov 15;46(1):37-45. doi: 10.1006/geno.1997.4984. Genomics. 1997. PMID: 9403056 - Phylo-mLogo: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences.
Shih AC, Lee DT, Peng CL, Wu YW. Shih AC, et al. BMC Bioinformatics. 2007 Feb 24;8:63. doi: 10.1186/1471-2105-8-63. BMC Bioinformatics. 2007. PMID: 17319966 Free PMC article. - transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.
Bininda-Emonds OR. Bininda-Emonds OR. BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156. BMC Bioinformatics. 2005. PMID: 15969769 Free PMC article. - ABC: software for interactive browsing of genomic multiple sequence alignment data.
Cooper GM, Singaravelu SA, Sidow A. Cooper GM, et al. BMC Bioinformatics. 2004 Dec 8;5:192. doi: 10.1186/1471-2105-5-192. BMC Bioinformatics. 2004. PMID: 15588288 Free PMC article. - Finding homologs to nucleic acid or protein sequences using the framesearch program.
Healy M. Healy M. Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review.
Cited by
- Pairwise Sequence Alignment for Very Long Sequences on GPUs.
Li J, Ranka S, Sahni S. Li J, et al. IEEE Int Conf Comput Adv Bio Med Sci. 2012:10.1109/ICCABS.2012.6182641. doi: 10.1109/ICCABS.2012.6182641. IEEE Int Conf Comput Adv Bio Med Sci. 2012. PMID: 24336227 Free PMC article. - Genome-wide evidence for local DNA methylation spreading from small RNA-targeted sequences in Arabidopsis.
Ahmed I, Sarazin A, Bowler C, Colot V, Quesneville H. Ahmed I, et al. Nucleic Acids Res. 2011 Sep 1;39(16):6919-31. doi: 10.1093/nar/gkr324. Epub 2011 May 17. Nucleic Acids Res. 2011. PMID: 21586580 Free PMC article. - progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.
Darling AE, Mau B, Perna NT. Darling AE, et al. PLoS One. 2010 Jun 25;5(6):e11147. doi: 10.1371/journal.pone.0011147. PLoS One. 2010. PMID: 20593022 Free PMC article. - Validation of mRNA/EST-based gene predictions in human Xp11.4 revealed differences to the organization of the orthologous mouse locus.
Wen G, Ramser J, Taudien S, Gausmann U, Blechschmidt K, Frankish A, Ashurst J, Meindl A, Platzer M. Wen G, et al. Mamm Genome. 2005 Dec;16(12):934-41. doi: 10.1007/s00335-005-0090-3. Epub 2005 Dec 8. Mamm Genome. 2005. PMID: 16341673 - Combined evidence annotation of transposable elements in genome sequences.
Quesneville H, Bergman CM, Andrieu O, Autard D, Nouaud D, Ashburner M, Anxolabehere D. Quesneville H, et al. PLoS Comput Biol. 2005 Jul;1(2):166-75. doi: 10.1371/journal.pcbi.0010022. Epub 2005 Jul 29. PLoS Comput Biol. 2005. PMID: 16110336 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Research Materials