Weighting aligned protein or nucleic acid sequences to correct for unequal representation - PubMed (original) (raw)
Comparative Study
Weighting aligned protein or nucleic acid sequences to correct for unequal representation
P R Sibbald et al. J Mol Biol. 1990.
Abstract
Aligned sequences from the same family (e.g. the haemoglobins) are seldom representative of the entire family. This is because (1) the sequence databases are heavily skewed toward a small number of organisms and (2) only a minute fraction of all the different family members have been sequenced. For many applications, such as using alignments or profiles to perform database searches for distantly related family members, such unequal representation requires correction. An algorithm to perform appropriate weighting of individual sequences is presented along with examples illustrating its efficacy.
Similar articles
- Hidden Markov models in computational biology. Applications to protein modeling.
Krogh A, Brown M, Mian IS, Sjölander K, Haussler D. Krogh A, et al. J Mol Biol. 1994 Feb 4;235(5):1501-31. doi: 10.1006/jmbi.1994.1104. J Mol Biol. 1994. PMID: 8107089 - Identification and characterization of a shrimp white spot syndrome virus (WSSV) gene that encodes a novel chimeric polypeptide of cellular-type thymidine kinase and thymidylate kinase.
Tsai MF, Yu HT, Tzeng HF, Leu JH, Chou CM, Huang CJ, Wang CH, Lin JY, Kou GH, Lo CF. Tsai MF, et al. Virology. 2000 Nov 10;277(1):100-10. doi: 10.1006/viro.2000.0597. Virology. 2000. PMID: 11062040 - Low molecular weight proteins: a challenge for post-genomic research.
Rudd KE, Humphery-Smith I, Wasinger VC, Bairoch A. Rudd KE, et al. Electrophoresis. 1998 Apr;19(4):536-44. doi: 10.1002/elps.1150190413. Electrophoresis. 1998. PMID: 9588799 - Thymidine kinase.
Kit S. Kit S. Microbiol Sci. 1985 Dec;2(12):369-75. Microbiol Sci. 1985. PMID: 3939993 Review. No abstract available.
Cited by
- TwinCons: Conservation score for uncovering deep sequence similarity and divergence.
Penev PI, Alvarez-Carreño C, Smith E, Petrov AS, Williams LD. Penev PI, et al. PLoS Comput Biol. 2021 Oct 29;17(10):e1009541. doi: 10.1371/journal.pcbi.1009541. eCollection 2021 Oct. PLoS Comput Biol. 2021. PMID: 34714829 Free PMC article. - A phylogenetic approach for weighting genetic sequences.
De Maio N, Alekseyenko AV, Coleman-Smith WJ, Pardi F, Suchard MA, Tamuri AU, Truszkowski J, Goldman N. De Maio N, et al. BMC Bioinformatics. 2021 May 28;22(1):285. doi: 10.1186/s12859-021-04183-8. BMC Bioinformatics. 2021. PMID: 34049487 Free PMC article. - Phylogenetic weighting does little to improve the accuracy of evolutionary coupling analyses.
Hockenberry AJ, Wilke CO. Hockenberry AJ, et al. Entropy (Basel). 2019 Oct;21(10):1000. doi: 10.3390/e21101000. Epub 2019 Oct 12. Entropy (Basel). 2019. PMID: 31662602 Free PMC article. - Maximum diversity weighting for biomarkers with application in HIV-1 vaccine studies.
He Z, Fong Y. He Z, et al. Stat Med. 2019 Sep 10;38(20):3936-3946. doi: 10.1002/sim.8212. Epub 2019 Jun 19. Stat Med. 2019. PMID: 31215662 Free PMC article. - Charting the landscape of tandem BRCT domain-mediated protein interactions.
Woods NT, Mesquita RD, Sweet M, Carvalho MA, Li X, Liu Y, Nguyen H, Thomas CE, Iversen ES Jr, Marsillac S, Karchin R, Koomen J, Monteiro AN. Woods NT, et al. Sci Signal. 2012 Sep 18;5(242):rs6. doi: 10.1126/scisignal.2002255. Sci Signal. 2012. PMID: 22990118 Free PMC article.