CSE 639 (original) (raw)
CIS 639
Statistical Approaches to Natural Language Processing
Spring 2002
(Constantly Under Construction)
Mitch Marcus
Office: Moore461a
Phone: 215-898-2538
See here for course syllabus and overview.
����������� Readings available on the web & additional readings
HMMs:
For more info: Jelinek, F. Statistical Methods for Speech Recognition. MIT Press: Cambridge (1998), Chapter 2.
Eric Brill, A Simple Rule-Based Part Of Speech Tagger, Proceedingsof ANLP-92, 3rd Conference on Applied Natural Language Processing
Eric Brill, Some Advances in Transformation-Based Part of Speech Tagging, AAI, Vol. 1, 1994
�
The class will interleave three modes:
- Lectures on the contents of Section III of� Manning & Sch�tze, Foundations of Statistical Natural Language Processing,�
- �Student-led discussions of recent papers on NP Chunking from the group of papers to be found
- at Erik Tjong Kim Sang�s web site on NP Chunking and the methods behind them including
- A Tutorial on Support Vector Machines for Pattern Recognition - Burges), and
- IGTree: Using Trees for Compression and Classification in Lazy Learning Algorithms - Daelemans, van den Bosch, Weijters).
- Group discussion of the details of maximum entropy and generative probabilistic models for statistical NLP included in Michael Collin's Ph.D. dissertation and Adwait Ratnaparkhi's Ph.D. dissertation.�
Required work will include leading a discussion of selected papers, a final paper or course project, and two or three exercises during the semester.