Computer Vision : Fall 2024 (original) (raw)
Computer Vision (CMU 16-385)
This course provides a comprehensive introduction to computer vision. Major topics include image processing, detection and recognition, geometry-based and physics-based vision and video analysis. Students will learn basic concepts of computer vision as well as hands on experience to solve real-life vision problems.
Basic Info
Mon/Wed 11:00am-12:20pm
Tepper 1403
See the Course Info page for more info on policies and logistics.
Getting Started
To get started with the class you need to do just three things:
- Sign up for the course Piazza.
- Sign up for an account on this webpage. (The signup code is on Canvas.)
- Carefully read through the Course Info.
Fall 2024 Schedule
Assignments
Assignments will be released via Piazza. A list of assignments is available below. Reference material is available on the Lectures page.
(Due Sep 18) | Programming Assignment 1: Image Filtering and Hough Transform |
---|
(Due Oct 2) | Programming Assignment 2: Augmented Reality with Planar Homographies |
---|
(Due Oct 23) | Programming Assignment 3: 3D Reconstruction |
---|
(Due Nov 6) | Programming Assignment 4: Scene Recognition with Bag of Words |
---|
(Due Nov 20) | Programming Assignment 5: Neural Networks for Recognition |
---|
(Due Dec 6) | Programming Assignment 6: Video Tracking |
---|
Acknowledgments
The lecture notes have been pieced together from many different people and places. Special thanks to colleagues for sharing their slides: Matt O'Toole, Kris Kitani, Bob Collins, Srinivasa Narashiman, Martial Hebert, Alyosha Efros, Ali Faharadi, Deva Ramanan, Yaser Sheikh, and Todd Zickler. Many thanks also to the following people for making their lecture notes and materials available online: Steve Seitz, Richard Selinsky, Larry Zitnick, Noah Snavely, Lana Lazebnik, Kristen Grauman, Yung-Yu Chuang, Tinne Tuytelaars, Fei-Fei Li, Antonio Torralba, Rob Fergus, David Claus, and Dan Jurafsky.