Readings in Media Processing:

3D Computer Vision and Video Computing

CSc 83300, Spring 2006

Prof. Zhigang Zhu
Associate Professor of Computer Science
The City College of New York  and Graduate Center
The City University of New York (CUNY)

Time: Tuesday 6:30 - 8:30 pm
Room: 3305
Credits: 3.0

Office Hours: Tuesday 4:30 - 6:00 pm Rm 4439

Course Description

Computer vision has a rich history of work on stereo and visual motion, which has dealt with the problems of 3D reconstruction from stereo or multiple images, and structure from motion from video sequences. Recently, in addition to these traditional problems, the stereo and motion information present in multiple images or a video sequence is also being used to solve several other problems, for instance video mosaicing, video synthesis, video segmentation, video compression, video registration, and video surveillance an monitoring. This is summarized as Video Computing. Computer vision is playing an important and somewhat different role in solving these problems in video computing than the original image analysis considered in the early days of vision research.

The course "Video Computing and 3D Computer Vision" will include advanced topics in video computing as well as fundamentals in stereo and motion.

Course Organization

The course will consist of lectures by the instructor (about 30%), talks by a few invited speakers from both academia and industry doing active research in computer vision (about 10%),  and presentations by students for their readings and projects (60%).

 The course will be roughly divided into three stages:

(1) 3D Computer Vision Basics (lectures)
    - Introduction - Jan 30, 2006  (Homework #1 - Resume)
    - Camera models - Feb 07, 2006
    - Camera calibration - Feb 07, 2006 (Assignment #2- Due Feb 14)
    - Stereo vision - Feb 14, 2006 (Assignment #3- Due March 07, Grading for all the three Assignments)
    - Visual motion - Feb 28, 2006 ( Reading List)

(2) Video Computing (readings and presentations)

   -  Seminar on Airborne Remote Sensing, by Prof. Harvey Rhody at RIT - March 07, 2006

   -  Project/Reading Discussions (15~20 minutes for each student)- March 14, 2006 ( Reading List)

   -  Student Reading Presentations (Hu,Calle - 60 minutes each), March 21, 2004
   -  Student Reading Presentations (Ganic,Wang - 60 minutes each), March 28, 2004
   -  Student Reading Presentations (Elsayed, Wong - 60 minutes each), April  4, 2004

   -  CUNY Conference on 3D Imaging and Visualization,  April 4, 2006, 9:00 am - 4:00 pm
        ( you are required to attend at least part of the event;  no class meet on April 11)

Please continue your project designs and implementations before and during the spring break 

   -  Student Project/Reading Presentations (Wang, Ganic, Calle - 40 minutes each), April  25, 2004
   -  Student Project/Reading Presentations (Hu, Elsayed,Wong - 40 minutes each ), May 02, 2004

   -  Student Project Presentations (Wang, Ganic, Elsayed - 40 minutes each), May 09, 2004
   -  Student Project Presentations (Hu, Calle, Wong - 40 minutes each), May 16, 2004

Grading

Students who take the course for credits will be required
(1) to finish 2 assignments consisting of mainly paperwork  (20%)
(2) to submit a term paper or complete a project on a topic related to the material presented in the lectures/readings (40%), and
(3) to give at least two presentations to the class in the middle and at the end of the semester (40%). 

Textbook, References and Reading List

Textbook (for basic preparation):
    “Introductory Techniques for 3-D Computer Vision,”  Emanuele Trucco and Alessandro Verri, Prentice Hall, Inc., 1998  (ISBN: 0132611082, 343 pages ).

References:

  1.   “Computer Vision – A Modern Approach,” David A. Forsyth, Jean Ponce, Prentice Hall, 2003 (ISBN: 0130851981 , 693 pages).
  2.   “Three Dimensional Computer Vision: A Geometric Viewpoint,” Olivier Faugeras, The MIT Press, November 19, 1993 (ISBN: 0262061589 , 695 pages)


Copyright @ Zhigang Zhu (email zhu@cs.ccny.cuny.edu ), 2005-2006.