CS 664 Computer Vision - Spring 2008 (original) (raw)

CS 664 Computer Vision � Spring2008
Cornell University

Class TR 2:55-4:10, 315 Upson

Professor: Dan Huttenlocher
4133 Upson
Office Hours, Wednesday 1-2pm
dph "at" cs.cornell.edu

Brief overview:

This course is intended for graduate students and advanced undergraduates who are interested in processing image and video data, in order to extract information about the scene that is being imaged.� There is no textbook for the course.�Handouts and papers will be made available online.� A recommended text is Forsyth and Ponce�s book �Computer Vision:� A Modern Approach�, but it covers topics that we won�t and vice versa.

The course has a more algorithmic flavor than many introductory computer vision courses.� We will focus on efficient algorithms, precise problem definitions and methods that work well in practice.�

We use material from various areas of algorithms and mathematics as well as requiring programming assignments, but this course does not teach algorithms, mathematics or programming. Thus we expect that students have good programming skills (using C or C++), a good mathematics background, and knowledge of algorithms. Students will be expected to pick up new mathematical and algorithmic techniques during the semester, as covered in lecture, and to relate the concepts from lecture to the programming assignments.

Assignments:

Assignment 1, filtering and edge detection
Assignment 2, project proposal

To hand in an assignment, login at http://cms.csuglab.cornell.eduwith your Cornell netid and password and go to assignment 1 to upload a single zip or tar file with your source, executable and writeup.��

Data for the final project:

Data files are available at http://web3.cs.cornell.edu/cs664/.�The README file provides information about the data files and sensor configuration.� For those who might care, the camera lens is Pentax Model#: C30405TH (4.8mm F/1.8).

Movies corresponding to the raw data files may also be useful, and are available as log1, log2part1, log2part2 and log2part3.� You may need the FFDShowcodec to view them.

Course outline:

Here is an outline of the topics to be covered and approximate schedule.�This schedule (and possibly also the topics) will be updated during the semester.� Readings will be handed out in class and added to the schedule as we go along.

1. Jan 22 Intro
2. Jan 24 Filtering Edge Detection Handout, Wells paper
3. Jan 29 NO CLASS
4. Jan 31 Edge detection Edge Detection Handout
5. Feb 5 Corner Detection (Crandall lecturer)
6. Feb 7 2D Geometry/Transforms
7. Feb 12 Interest points � SIFT features Lowe SIFT paper
8. Feb 14 Distance Transforms
9. Feb 19 Matching: Chamfer and Hausdorff Hausdorff Matching paper
10. Feb 21 3D camera geometry
11. Feb 26 3D camera geometry (cont�d)
12. Feb 28 Stereo Scharstein and Szeliski paper
13. Mar 4 CLASS CANCELLED
14. Mar 6 Markov Random Field models
15. Mar 11 MRF Inference � Loopy BP Felzenszwalb loopy bp paper
16. Mar 13 MRF Inference � Graph Cuts (Crandall) Boykov et al paper
17. Mar 25 Visual motion/optical flow
18. Mar 27 Parametric motion
19. Apr 1 Structure from motion Tomasi and Kanade paper
20. Apr 3 Some Recent Advances
21. Apr 8 Image Segmentation Felzenszwalb segmentation paper
22. Apr 10 Image Segmentation Weiss eigenvector segmentation paper
23. Apr 15 Face Recognition, Subspace Methods (Crandall)
24. Apr 17 Object Category Recognition, k-fans (Crandall) Crandall k-fans paper
25. Apr 22 Pictorial Structures and fast algorithms
26. Apr 24 Image Matching and Robust Fitting
27. Apr 29 Recognition using latent SVM�s Felzenszwalb et al paper
28. May 1 TBD

Course Requirements:

There will be a few short in-class quizzes, two assignments and a final project. The assignments and project will require programming, testing with image or video data, and a well thought-out write-up explaining what was done and what was learned.

The programming is probably best done in C or C++ due to the availability of libraries such as OpenCV, but Matlab can also be an option (Java is not recommended).�