Tabix: fast retrieval of sequence features from generic TAB-delimited files - PubMed (original) (raw)
Tabix: fast retrieval of sequence features from generic TAB-delimited files
Heng Li. Bioinformatics. 2011.
Abstract
Tabix is the first generic tool that indexes position sorted files in TAB-delimited formats such as GFF, BED, PSL, SAM and SQL export, and quickly retrieves features overlapping specified regions. Tabix features include few seek function calls per query, data compression with gzip compatibility and direct FTP/HTTP access. Tabix is implemented as a free command-line tool as well as a library in C, Java, Perl and Python. It is particularly useful for manually examining local genomic features on the command line and enables genome viewers to support huge data files and remote custom tracks over networks.
Availability and implementation: http://samtools.sourceforge.net.
Similar articles
- The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets.
Nicol JW, Helt GA, Blanchard SG Jr, Raja A, Loraine AE. Nicol JW, et al. Bioinformatics. 2009 Oct 15;25(20):2730-1. doi: 10.1093/bioinformatics/btp472. Epub 2009 Aug 4. Bioinformatics. 2009. PMID: 19654113 Free PMC article. - SCALCE: boosting sequence compression algorithms using locally consistent encoding.
Hach F, Numanagic I, Alkan C, Sahinalp SC. Hach F, et al. Bioinformatics. 2012 Dec 1;28(23):3051-7. doi: 10.1093/bioinformatics/bts593. Epub 2012 Oct 9. Bioinformatics. 2012. PMID: 23047557 Free PMC article. - The Sequence Alignment/Map format and SAMtools.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R; 1000 Genome Project Data Processing Subgroup. Li H, et al. Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8. Bioinformatics. 2009. PMID: 19505943 Free PMC article. - Genome data mining for everyone.
Lee GW, Kim S. Lee GW, et al. BMB Rep. 2008 Nov 30;41(11):757-64. doi: 10.5483/bmbrep.2008.41.11.757. BMB Rep. 2008. PMID: 19017486 Review. - UCSC genome browser tutorial.
Zweig AS, Karolchik D, Kuhn RM, Haussler D, Kent WJ. Zweig AS, et al. Genomics. 2008 Aug;92(2):75-84. doi: 10.1016/j.ygeno.2008.02.003. Epub 2008 Jun 2. Genomics. 2008. PMID: 18514479 Review.
Cited by
- GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture.
Guðbjartsson H, Georgsson GF, Guðjónsson SA, Valdimarsson RÞ, Sigurðsson JH, Stefánsson SK, Másson G, Magnússon G, Pálmason V, Stefánsson K. Guðbjartsson H, et al. Bioinformatics. 2016 Oct 15;32(20):3081-3088. doi: 10.1093/bioinformatics/btw199. Epub 2016 Jun 23. Bioinformatics. 2016. PMID: 27339714 Free PMC article. - hipFG: high-throughput harmonization and integration pipeline for functional genomics data.
Cifello J, Kuksa PP, Saravanan N, Valladares O, Wang LS, Leung YY. Cifello J, et al. Bioinformatics. 2023 Nov 1;39(11):btad673. doi: 10.1093/bioinformatics/btad673. Bioinformatics. 2023. PMID: 37947320 Free PMC article. - Ferret: a user-friendly Java tool to extract data from the 1000 Genomes Project.
Limou S, Taverner AM, Winkler CA. Limou S, et al. Bioinformatics. 2016 Jul 15;32(14):2224-6. doi: 10.1093/bioinformatics/btw147. Epub 2016 Mar 18. Bioinformatics. 2016. PMID: 27153588 Free PMC article. - Deeper genomic insights into tomato CLE genes repertoire identify new active peptides.
Carbonnel S, Falquet L, Hazak O. Carbonnel S, et al. BMC Genomics. 2022 Nov 17;23(1):756. doi: 10.1186/s12864-022-08980-0. BMC Genomics. 2022. PMID: 36396987 Free PMC article. - lncRNAKB, a knowledgebase of tissue-specific functional annotation and trait association of long noncoding RNA.
Seifuddin F, Singh K, Suresh A, Judy JT, Chen YC, Chaitankar V, Tunc I, Ruan X, Li P, Chen Y, Cao H, Lee RS, Goes FS, Zandi PP, Jafri MS, Pirooznia M. Seifuddin F, et al. Sci Data. 2020 Oct 5;7(1):326. doi: 10.1038/s41597-020-00659-z. Sci Data. 2020. PMID: 33020484 Free PMC article.
References
- Alekseyenko AV, Lee CJ. Nested containment list (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases. Bioinformatics. 2007;23:1386–1393. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources