CSE 517 - Natural Language Processing (original) (raw)
Instructor: Yejin Choi (yejin at cs dot washington dot edu) Office hours: Mondays at 3 - 4pm at CSE 578 (and by appointment) | TA: Eunsol Choi (eunsol at cs dot washington dot edu) Office hours: Wednesdays 11am - noon at CSE 394 (and by appointment) TA: Andrea M. Kahn (amkahn at uw dot edu) Office hours: Thursdays 2 - 3pm at CSE 021 (and by appointment) |
---|
Schedule (subject to change)
Week | Dates | Topics & Lecture Slides | Notes (Required) | Textbook | Supplementary Readings |
---|---|---|---|---|---|
1 | Jan 5, 7 | Introduction [Slides]; Language Models (LM) [Slides] | LM Notes | J&M 4.1-4; M&S 6 | [Large LMs] [Berkeley LM] |
2 | Jan 12, 14 | Sequences: Language Models and Smoothing; Hidden Markov Models (HMMs) [Slides] | HMM Notes | J&M 4.5-7; M&S 6 | [Smoothing] |
3 | Jan 21 | Hidden Markov Models (HMMs) [Slides (Jan26)]& Part-Of-Speech Tagging [Slides (Jan26)] | J&M 5.1-5.3; 6.1-6.4; M&S 9, 10.1-10.3 | [TnT Tagger] [Stanford Tagger] [SOTA POS] | |
4 | Jan 26, 28 | Trees: Probabilistic Context Free Grammars (PCFG) and Parsing [Slides (Feb2)] | PCFG Notes, Lexicalized PCFGs | J&M 13-14; M&S 11-12 | [Syntax Intro] [Incremental] [Best First] [A* Parsing] [Lexicalized] [Unlexicalized] [Split Merge] |
5 | Feb 2, 4 | More Parsing [Slides (Feb2)]; Expectation Maximization (EM)[Slides] | EM Notes, Forward-backward, Inside-outside | J&M 6.5; M&S 9.3-4; 11.3-4 | [Semi-supervised Naive Bayes] [EM Tutorial] [EM for Feature-Rich] |
6 | Feb 9, 11 | Semantics: Frame Semantics [Slides (Feb2)]; Distributional Semantics | J&M 19.4; 20.7; 20.9; M&S 8 | [Fillmore-Tribute Workshop] [Frame-Semantic Parsing] [Composition in Distributional Models] | |
7 | Feb 18 | Machine Translation (MT): Word Alignment [Slides] | IBM Models 1 and 2 | J&M 25.1-6; M&S 13 | [IBM Models] [HMM Model] [MERT Training] |
8 | Feb 23, 25 | Phrase-based MT [Slides]; Syntax-based MT [Slides I] [II] | Phrase-based Notes | J&M 25.6-10; M&S 13 | [SCFG Tutorial] [Hiero] [Tree-to-String] [Tree-to-Tree] |
9 | Mar 2, 4 | Log-Linear / Feature-Rich Models: Conditional Random Fields (CRFs) [Slides] | Log-linear models CRF Notes | J&M 6.6-6.8; M&S 16.2-16.3 | [MaxExt] [CRF Tutorial] [CRF LM] [CRF Parsing] |
10 | Mar 9, 11 | Knowledge & Semantic Relations: Information Extraction; Entailment; [Slides] | J&M 22 | [Entailment Graphs] [Paraphrasing w/ MT] [Paraphrasing and Entailment] |
Textbooks
- Recommended: D. Jurafsky & James H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition, Prentice Hall, Second Edition, 2009. (J&M)
- Optional: C.D. Manning & H. Schuetze, Foundations of Statistical Natural Language Processing, Cambridge: MIT Press, 1999 (available online, free if accessed from UW computers) (M&S)
Contact
- Please feel free to email the course staff, addresses above, and come to office hours. Let us know if you need to meet outside of the scheduled hours, we will do our best to accomodate.
- We also have a GoPost discussion board. Please consider posting your questions there; everyone will benefit. We also encourage you to try to answer questions, which will count as class participation. We will monitor daily and contribute as long as the boards are being used.
- Grades: Assignment grades are posted in the online CSE 517 Gradebook. Please let us know if you see any errors.
Homeworks
We will have 4 programming-based homework assignments (60% of grade). Data/code/instruction are linked at Dropbox
- Assignment 1: Language Models (Due Tue Jan 20, 5pm)
- Assignment 2: HMMs (Due Fri Feb 6, 5pm)
- Assignment 3: Parsing (Due Fri Feb 20, 5pm)
- Assignment 4: MT (Due Fri Mar 6, 5pm) In addition, there will be lightweight 2 written homework assignments (15% of grade).
- Assignment 1: LM & HMMs (Due Mon Mar 2, 5pm)
- Assignment 2: MT (Due Tue Mar 10, 5pm) Please submit all your assignments to the online DropBox.
Final Mini-project
A final mini-project (20% of grade) will be completed during the last weeks of the term. Students are encouraged to design projects that overlap with their research interests.
- Proposal (Due Feb 13, 5pm)
- Final report (Due Mar 20, 5pm)
Grading
The final grade will consist of programming-based homeworks (60%), a final mini-project (20%), non-programming assignments (15%) and course/discussion board participation (5%). No midterm or final exam.
Course Administration and Policies
- Assignments must be done individually unless otherwise specified. You may discuss the subject matter with other students in the class, but all final answers must be your own work. You are expected to maintain the utmost level of academic integrity in the course.
- Each assignment may be handed in up to three days late, at a penalty of 10% of the maximum grade per day. You have 3 panelty-free late day credits that you can use at any time during the quarter. Above 10% substraction will apply only after you have used all your late day credits. Being late by a partial day (e.g., 1 hour) will be rounded up to 1 full day. This late day policy does not apply to the final project submission due to tight grading schedule at the end of the quarter.
- Comments can be sent to the instructor or TA using this anonymous feedback form.
![]() |
Department of Computer Science & Engineering University of Washington Box 352350 Seattle, WA 98195-2350 (206) 543-1695 voice, (206) 543-2969 FAX |