CS 6740/INFO 6300, Spring 2010 (original) (raw)

Instructor: Prof. Lillian Lee. Office hours: usually Fridays 10:45-11:45 or by appointment, but please check my webpage, http://www.cs.cornell.edu/home/llee, for updates.
Lectures: TR 10:10–11:25, Upson 315
Midterm: March 18th, in class; Final: Thursday May 13, 2-4:30pm, Upson 315

The content below renders in tab-organized format if javascript is enabled. This page sets a cookie so that the next time you return to this page (if ever ...), the tab from which you left will be open.

Administration

Brief overview

Philosophy, Spring 2010: This class is a graduate-level introduction to research fundamentals for information retrieval and natural language processing. The course focuses on the development and derivation of major ideas, and aims to promote research skills for students working in and outside of language technologies.While this course is thus not primarily a survey of the field, pointers to related/current work will be provided. Because of the wealth of Cornell machine-learning courses, learning is not an emphasis of this class (despite its immense importance in the field) to avoid overlap.

Please see the Course description and policies handout for more information.

Tentative syllabus: See “Lectures” section/tab.

Prerequisites: (Firm) knowledge of elementary computer science, probability, and linear algebra. Neither CS/INFO 4300 nor CS/COGST/LING 4740 are prerequisites.

Administrative handouts

General resources

Homepage for my previous running of this course click here for Fall 07

Reference texts

Pointers to papers

Alistair Moffat, Justin Zobel, and David Hawking, Recommended reading for IR research students, SIGIR Forum 39(2):3–14, 2005. [pdf,pdf2]

Cornell IR/NLP courses

Datasets and software:

0. (Jan 26) A prefatory lecture

1. (Jan 28) Information-retrieval basics (setting, evaluation); intro to the vector-space model

0. (Jan 26) A prefatory lecture

1. (Jan 28) Information-retrieval basics (setting, evaluation); intro to the vector-space model

1. (Jan 28) Information-retrieval basics (setting, evaluation); intro to the vector-space model

2. (Feb 2) length normalization (who'da thunk?)

3. (Feb 4) pivoted document-length normalization

1. (Jan 28) Information-retrieval basics (setting, evaluation); intro to the vector-space model

2. (Feb 2) length normalization (who'da thunk?)

3. (Feb 4) pivoted document-length normalization

4. (Feb 9) Evaluation: annotation and experimental design

5. (Feb 11) Introduction to (Robertson/Spärck Jones) probabilistic retrieval

6. (Feb 16) RSJ probabilistic retrieval: binary models and the IDF

7. (Feb 18) Two-Poisson models and BM weighting

5. (Feb 11) Introduction to (Robertson/Spärck Jones) probabilistic retrieval

6. (Feb 16) RSJ probabilistic retrieval: binary models and the IDF

7. (Feb 18) Two-Poisson models and BM weighting

8 (Feb 23) Intro to the language-modeling approach to IR

9 (Feb 25) About query likelihood; relevance LMs

10 (Mar 2) More on language models

11 (Mar 4) The Good-Turing estimate

12 (Mar 9) Smoothing; LM evaluation

13 (Mar. 16) Zipf's law and Miller's monkeys

8 (Feb 23) Intro to the language-modeling approach to IR

9 (Feb 25) About query likelihood; relevance LMs

10 (Mar 2) More on language models

11 (Mar 4) The Good-Turing estimate

12 (Mar 9) Smoothing; LM evaluation

13 (Mar. 16) Zipf's law and Miller's monkeys

14 (Mar 30) Relevance feedback

15 (Apr 1) Clickthrough data as implicit relevance feedback

16 (Apr 13) End relevance feedback; begin syntactic structure

14 (Mar 30) Relevance feedback

15 (Apr 1) Clickthrough data as implicit relevance feedback

16 (Apr 13) End relevance feedback; begin syntactic structure

16 (Apr 13) End relevance feedback; begin syntactic structure

17 (Apr 15) Feature-based CFGs with unification constraints

18 (Apr 20). Feature-based CFGs; TAGs

19 (Apr 22) More on TAGs

20 (April 27) Feature-based TAGs

16 (Apr 13) End relevance feedback; begin syntactic structure

17 (Apr 15) Feature-based CFGs with unification constraints

18 (Apr 20). Feature-based CFGs; TAGs

19 (Apr 22) More on TAGs

20 (April 27) Feature-based TAGs

21 (Apr 29) PCFGs and EM

22. Finish EM, start discourse

21 (Apr 29) PCFGs and EM

22. Finish EM, start discourse

22. Finish EM, start discourse

23 (May 6) Local and global theories of discourse

22. Finish EM, start discourse

23 (May 6) Local and global theories of discourse