Lectures and Readings : Computer Vision : Fall 2024 (original) (raw)

Computer Vision (CMU 16-385)

The lecture slides for this course can be found here: Lecture Slides Folder

(Overview of computer vision)

(Image transformations, point image processing, linear shift-invariant image filtering, convolution, image gradients)

Basic reading:

(Image downsampling, aliasing, Gaussian image pyramid, Laplacian image pyramid, Fourier series, frequency domain, Fourier transform, frequency-domain filtering, sampling)

Basic reading:

Additional reading:

(Finding boundaries, line fitting, line parameterization, Hough transform, Hough circles)

Basic reading:

(Visualizing quadratics, Harris corner detector, multi-scale detection)

Basic reading:

(Designing feature descriptors, MOPS descriptor, GIST descriptor, Histogram of Textons descriptor, HOG descriptor, SIFT)

Basic reading:

(2D transformations, projective geometry, classification of 2D transformations, determining unknown 2D transformations)

Basic reading:

Additional reading:

(Panoramas, Image homographies, Computing with homographies, direct linear transform (DLT), random sample consensus (RANSAC))

Basic reading:

Additional reading:

(Introduction to learning-based vision, image classification, bag-of-words, K-means clustering, classification, K-nearest neighbors, naive Bayes, support vector machines)

Basic reading:

(Perceptron, neural networks, training perceptrons, gradient descent, backpropagation, stochastic gradient descent)

Basic reading (No standard textbooks yet!):

(Intro to vision for video, optical flow, constant flow, Horn-Schunck flow)

Basic reading:

(Motion magnification using optical flow, image alignment, Lucas-Kanade alignment, Baker-Matthews alignment, inverse alignment, KLT tracking, mean-shift tracking, modern trackers)

Basic reading: