Recent Advances in Computer Vision
CSE 291 Topics in Computer Vision, Winter 2018
Instructor: Manmohan Chandraker
Email: mkchandraker [AT] eng [DOT] ucsd [DOT] edu
TA: Shashank Shastry
Email: scshastr [AT] eng [DOT] ucsd [DOT] edu
Lectures: WF 5-6:20pm at CSE 4140
Instructor office hours: F 3-4pm at CSE 4122
TA office hours: Tu noon-1pm at CSE B215
Overview
The advent of deep learning has led to significant gains in computer vision. In this course, we will study advances in several areas of computer vision, such as image classification, object detection, semantic segmentation, 3D reconstruction and activity recognition. We will study these by discussing a mix of canonical and recent publications. A focus area will also be to study weakly supervised, self-supervised and domain adaptation frameworks in computer vision.
Prerequisites
This is an advanced class, covering recent developments in computer vision research. Students at all levels including undergraduates, masters and PhD, with a strong interest in computer vision may enroll. Comfort with optimization, linear algebra, probability and statistics is necessary. Prior background in computer vision and machine learning is desirable, preferably through research experience or as covered by CSE 152, 252 and similar offerings. Contact the instructor if unsure about meeting criteria for enrollment.
Course Format and Requirements
The course will involve presenting and discussing recent papers related to computer vision. The papers will be grouped by concept. These will be supported with lectures by the instructor to provide historical background or introduce fundamentals. Students will be required to do a project or write a critical analysis on a focused topic. Wide flexibility is available for the choice of topic, but should be discussed with the instructor beforehand. The best projects may consist of conceptual development that pushes the boundaries of the state-of-art, or a practical implementation that goes beyond the ones associated with the papers read in class.
Students may take the class with a letter grade option, which requires one or two in-class presentations (depending on enrollment strength), critical reviews, a final exam and completion of the project. Alternately, a 1 unit, S-U or P-NP registration is available, which requires in-class presentations and reviews. You may also audit, but a registration is preferred.
Grades will be weighted as 20% for in-class presentations, 20% for reviews, 20% for final exam, 10% for class particiation and 30% for the project. The project may be in groups of three. The goal of the course is to develop an understanding of the current state of computer vision and gain appreciation of its limits and potential.
Topics
The course will cover a diverse range of topics in computer vision. An inexhaustive list includes:
- Correspondence estimation
- Optical flow
- Stereo
- 3D Reconstruction
- Structure from motion
- Image classification
- Face recognition
- Human pose estimation
- Material estimation
- Object detection
- Semantic segmentation
- Action recognition
- Behavior prediction
- Adversarial learning
- Domain adaptation
Outline
Jan 10: Lecture
- Course overview, introduction to computer vision [PDF] [Compressed]
Jan 12: Lecture
Jan 17: Lecture
- Optimization and training of deep neural networks [PDF] [Compressed]
Jan 19: Lecture
Jan 24: Correspondence and stereo
- UCN and LIFT [PDF]
- Stereo CNN (Presenter: Chunyi Lyu) [PDF]
Jan 26: Optical flow
- Lecture [PDF]
- FlowNet (Presenter: Prahal Arora) [PDF]
Jan 31: Depth estimation
- Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture (Presenter: Dewal Gupta) [PDF]
- Unsupervised Monocular Depth Estimation with Left-Right Consistency (Presenter: Jiageng Zhang) [PDF]
Feb 02: Face recognition
- DeepFace: Closing the Gap to Human-Level Performance in Face Verification (Presenter: Zhongjian Zhu) [PDF]
- Deep Learning Face Representation by Joint Identification-Verification (Presenter: Tongzhou Mu) [PDF]
Face alignment
- Face Alignment Across Large Poses: A 3D Solution (Presenter: Shixin Li) [PDF]
- Deep Convolutional Inverse Graphics Network (Presenter: Patrick Hayes) [PDF]
Human pose estimation
- DeepPose: Human Pose Estimation via Deep Neural Networks (Presenter: Ling-Yi Liao) [PDF]
- Convolutional Pose Machines (Presenter: Aditi Ashutosh Mavalankar) [PDF]
Material and illumination
- Material Editing Using a Physically Based Rendering Network (Presenter: Hanxiao He) [PDF]
- Learning to Predict Indoor Illumination from a Single Image (Presenter: Chih-Hui Ho) [PDF]
Object detection
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (Presenter: Tushar Bansal) [PDF]
- You Only Look Once: Unified, Real-Time Object Detection (Presenter: Chun-Yan Yao) [PDF]
Object detection
- R-FCN: Object Detection via Region-based Fully Convolutional Networks (Presenter: Tianyu Bai) [PDF]
- SSD: Single Shot MultiBox Detector (Presenter: Siyu Jiang) [PDF]
Image captioning
- Show and Tell: A Neural Image Caption Generator (Presenter: Mithun Chakravarti) [PDF]
Multi-target tracking
- Online Multi-Target Tracking Using Recurrent Neural Networks (Presenter: Haotian Zhang) [PDF]
- Recurrent Autoregressive Networks for Online Multi-Object Tracking (Presenter: Ishan Gupta) [PDF]
Action recognition
- Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset (Presenter: Zhisheng Huang) [PDF]
Semantic segmentation
- Fully Convolutional Networks for Semantic Segmentation (Presenter: Nicolas Jourdan) [PDF]
- Multi-scale Context Aggregation by Dilated Convolutions (Presenter: Yian Li) [PDF]
Unsupervised domain adaptation
- Unsupervised Domain Adaptation by Backpropagation (Presenter: Xingyu Gu) [PDF]
- Domain Separation Networks (Presenter: Abhilash Kulkarni) [PDF]
Generative Adversarial Networks
- Generative Adversarial Text to Image Synthesis (Presenter: Jingyao Zhan) [PDF]
- Generating Videos with Scene Dynamics (Presenter: Sai Bi) [PDF]
Resources
- Books: There are no books required for this course. Any chapters of books that are extensively referenced in class will be provided as hand-outs.
- Papers: The papers will be provided as PDFs or made available for download through provided links.
Manmohan Chandraker
Last modified: Tue, Mar 20, 2018