SequenceMatrix: concatenation software for the fast assembly of multi-gene datasets with character set and codon information - PubMed (original) (raw)
SequenceMatrix: concatenation software for the fast assembly of multi-gene datasets with character set and codon information
Gaurav Vaidya et al. Cladistics. 2011 Apr.
Free article
Abstract
We present SequenceMatrix, software that is designed to facilitate the assembly and analysis of multi-gene datasets. Genes are concatenated by dragging and dropping FASTA, NEXUS, or TNT files with aligned sequences into the program window. A multi-gene dataset is concatenated and displayed in a spreadsheet; each sequence is represented by a cell that provides information on sequence length, number of indels, the number of ambiguous bases ("Ns"), and the availability of codon information. Alternatively, GenBank numbers for the sequences can be displayed and exported. Matrices with hundreds of genes and taxa can be concatenated within minutes and exported in TNT, NEXUS, or PHYLIP formats, preserving both character set and codon information for TNT and NEXUS files. SequenceMatrix also creates taxon sets listing taxa with a minimum number of characters or gene fragments, which helps assess preliminary datasets. Entire taxa, whole gene fragments, or individual sequences for a particular gene and species can be excluded from export. Data matrices can be re-split into their component genes and the gene fragments can be exported as individual gene files. SequenceMatrix also includes two tools that help to identify sequences that may have been compromised through laboratory contamination or data management error. One tool lists identical or near-identical sequences within genes, while the other compares the pairwise distance pattern of one gene against the pattern for all remaining genes combined. SequenceMatrix is Java-based and compatible with the Microsoft Windows, Apple MacOS X and Linux operating systems. The software is freely available from http://code.google.com/p/sequencematrix/. © The Willi Hennig Society 2010.
© The Willi Hennig Society 2010.
Similar articles
- FASconCAT: Convenient handling of data matrices.
Kück P, Meusemann K. Kück P, et al. Mol Phylogenet Evol. 2010 Sep;56(3):1115-8. doi: 10.1016/j.ympev.2010.04.024. Epub 2010 Apr 21. Mol Phylogenet Evol. 2010. PMID: 20416383 - RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits.
Teeling H, Gloeckner FO. Teeling H, et al. BMC Bioinformatics. 2006 Feb 13;7:66. doi: 10.1186/1471-2105-7-66. BMC Bioinformatics. 2006. PMID: 16476165 Free PMC article. - TaxMan: a taxonomic database manager.
Jones M, Blaxter M. Jones M, et al. BMC Bioinformatics. 2006 Dec 18;7:536. doi: 10.1186/1471-2105-7-536. BMC Bioinformatics. 2006. PMID: 17176465 Free PMC article. - Alview: Portable Software for Viewing Sequence Reads in BAM Formatted Files.
Finney RP, Chen QR, Nguyen CV, Hsu CH, Yan C, Hu Y, Abawi M, Bian X, Meerzaman DM. Finney RP, et al. Cancer Inform. 2015 Sep 13;14:105-7. doi: 10.4137/CIN.S26470. eCollection 2015. Cancer Inform. 2015. PMID: 26417198 Free PMC article. Review. - Common file formats.
Leonard SA, Littlejohn TG, Baxevanis AD. Leonard SA, et al. Curr Protoc Bioinformatics. 2007 Jan;Appendix 1:Appendix 1B. doi: 10.1002/0471250953.bia01bs16. Curr Protoc Bioinformatics. 2007. PMID: 18428774 Review.
Cited by
- Analyses of Xenorhabdus griffiniae genomes reveal two distinct sub-species that display intra-species variation due to prophages.
Heppert JK, Awori RM, Cao M, Chen G, McLeish J, Goodrich-Blair H. Heppert JK, et al. BMC Genomics. 2024 Nov 15;25(1):1087. doi: 10.1186/s12864-024-10858-2. BMC Genomics. 2024. PMID: 39548374 - Additions to the genus Kirschsteiniothelia (Dothideomycetes); Three novel species and a new host record, based on morphology and phylogeny.
Tang X, Jeewon R, Jayawardena RS, Gomdola D, Lu YZ, Xu RJ, Alrefaei AF, Alotibi F, Hyde KD, Kang JC. Tang X, et al. MycoKeys. 2024 Oct 28;110:35-66. doi: 10.3897/mycokeys.110.133450. eCollection 2024. MycoKeys. 2024. PMID: 39502522 Free PMC article. - The evolutionary dynamics of genome sizes and repetitive elements in Ensifera (Insecta: Orthoptera).
Yuan H, Liu XJ, Liu XZ, Zhao LN, Mao SL, Huang Y. Yuan H, et al. BMC Genomics. 2024 Nov 5;25(1):1041. doi: 10.1186/s12864-024-10949-0. BMC Genomics. 2024. PMID: 39501135 Free PMC article. - Morpho-phylogenetic evidence reveals novel species and new records of Nigrograna (Nigrogranaceae) associated with medicinal plants in Southwestern China.
Du HZ, Lu YH, Cheewangkoon R, Liu JK. Du HZ, et al. MycoKeys. 2024 Oct 23;110:1-33. doi: 10.3897/mycokeys.110.132628. eCollection 2024. MycoKeys. 2024. PMID: 39493641 Free PMC article. - The complete mitochondrial genome of Dimorphopterus japonicus (Hidaka, 1959) (Hemiptera, Lygaeoidea) and phylogenetic relationships within the Lygaeoidea superfamily.
Zhao W, Wang Y, Jia T, Zhang Y, Wang Y, Liu D, Zhang H. Zhao W, et al. Sci Rep. 2024 Nov 2;14(1):26374. doi: 10.1038/s41598-024-78192-x. Sci Rep. 2024. PMID: 39487309 Free PMC article.
References
- Ang, Y., Puniamoorthy, N., Meier, R., 2008. Secondarily reduced foreleg armature in Perochaeta dikowi sp.n. (Diptera: Cyclorrhapha: Sepsidae) due to a novel mounting technique. Syst. Entomol. 33, 552-559.
- Balke, M., Ribera, I., Hendrich, L., Miller, M.A., Sagata, K., Posman, A., Vogler, A.P., Meier, R., 2009. New Guinea highland origin of a widespread arthropod supertramp. Proc. R. Soc. Lond. B Biol. Sci. 276, 2359-2367.
- Dunn, C.W., Hejnol, A., Matus, D.Q., Pang, K., Browne, W.E., Smith, S.A., Seaver, E., Rouse, G.W., Obst, M., Edgecombe, G.D., Sorensen, M.V., Haddock, S.H.D., Schmidt-Rhaesa, A., Okusu, A., Kristensen, R.M., Wheeler, W.C., Martindale, M.Q., Giribet, G., 2008. Broad phylogenomic sampling improves resolution of the animal tree of life. Nature 452, 745-749.
- Goloboff, P.A., Farris, J.S., Nixon, K.S., 2008. TNT, a free program for phylogenetic analysis. Cladistics 24, 774-786.
- Huang, D.W., Meier, R., Todd, P.A., Chou, L.M., 2009. More evidence for pervasive paraphyly in scleractinian corals: systematic study of Southeast Asian Faviidae (Cnidaria; Scleractinia) based on molecular and morphological data. Mol. Phylogenet. Evol. 50, 102-116.
LinkOut - more resources
Full Text Sources
Miscellaneous