A Simple Approach to Building Ensembles of Naive Bayesian Classifiers for Word Sense Disambiguation (original) (raw)

2000, Computing Research Repository

Abstract

This paper presents a corpus-based approach to word sense disambiguation that builds an ensemble of Naive Bayesian classifiers, each of which is based on lexical features that represent co-occurring words in varying sized windows of context. Despite the simplicity of this approach, empirical results disambiguating the widely studied nouns line and interest show that such an ensemble achieves accuracy rivaling the best previously published results.

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (15)

E. Brill and J. Wu. 1998. Classifier combination for improved lexical disambiguation. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics, Montreal.
R. Bruce and J. Wiebe. 1994. Word-sense disam- biguation using decomposable models. In Proceed- ings of the 32nd Annual Meeting of the Associ- ation for Computational Linguistics, pages 139- 146.
T. Dietterich. 1997. Machine-learning research: Four current directions. AI magazine, 18(4):97- 136.
P. Domingos and M. Pazzani. 1997. On the optimal- ity of the simple Bayesian classifier under zero-one loss. Machine Learning, 29:103-130.
R. Duda and P. Hart. 1973. Pattern Classification and Scene Analysis. Wiley, New York, NY.
W. Gale, K. Church, and D. Yarowsky. 1992. A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26:415- 439.
J. Henderson and E. Brill. 1999. Exploiting diver- sity in natural language processing: Combining parsers. In Proceedings of the Fourth Conference on Empirical Methods in Natural Language Pro- cessing, College Park, MD, June.
C. Leacock, G. Towell, and E. Voorhees. 1993. Corpus-based statistical sense resolution. In Pro- ceedings of the ARPA Workshop on Human Lan- guage Technology, pages 260-265, March.
C. Leacock, M. Chodorow, and G. Miller. 1998. Us- ing corpus statistics and WordNet relations for sense identification. Computational Linguistics, 24(1):147-165, March.
R. Mooney. 1996. Comparative experiments on dis- ambiguating word senses: An illustration of the role of bias in machine learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 82-91, May.
H.T. Ng and H.B. Lee. 1996. Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In Proceedings of the 34th Annual Meeting of the Society for Com- putational Linguistics, pages 40-47.
T. Pedersen and R. Bruce. 1997. A new supervised learning algorithm for word sense disambiguation. In Proceedings of the Fourteenth National Con- ference on Artificial Intelligence, pages 604-609, Providence, RI, July.
T. Pedersen, R. Bruce, and J. Wiebe. 1997. Sequen- tial model selection for word sense disambigua- tion. In Proceedings of the Fifth Conference on Applied Natural Language Processing, pages 388- 395, Washington, DC, April.
T. Pedersen. 2000. An ensemble approach to corpus-based word sense disambiguation. In Pro- ceedings of the Conference on Intelligent Text Processing and Computational Linguistics, pages 205-218, Mexico City, February.
G. Towell and E. Voorhees. 1998. Disambiguating highly ambiguous words. Computational Linguis- tics, 24(1):125-146, March.