Motion Layer Extraction in the Presence of Occlusion

CCNY Seminar in Computer Vision, Perceptual Robotics and Video Computing

---------------------------------------------------------------------------------------------------

Motion Layers Extraction in the Presence of Occlusion Using Graph Cuts

Dr. Mubarak Shah
Computer Vision Lab, School of Computer Science
University of Central Florida, Orlando, FL 32816

http://www.cs.ucf.edu/~vision/

Date: October 11, 2004

Time: 1:30 pm – 2:30 pm

Location: NAC 8/207, City College of New York

Host: Prof. Zhigang Zhu <zzhu@ccny.cuny.edu>

Extracting layers from video is very important for video representation, analysis, compression, and synthesis. Assuming that a scene can be approximately described by multiple planar regions, we have developed a robust and novel approach to automatically extract a set of affine or projective transformations induced by these regions, correctly detect the occlusion pixels over multiple consecutive frames, and accurately segment the scene into several motion layers. First, using correspondences in two frames a number of seed regions are determined, and the regions are expanded and the outliers are rejected employing the two-terminal graph cuts method integrated with the level sets representation. Second, these initial regions are merged into several initial layers according to motion similarity. Third, the occlusion order constraints on multiple frames are explored, which guarantee that the occlusion area increases with the temporal order in a short period and effectively maintains segmentation consistency over multiple consecutive frames. Then, the correct layer segmentation is obtained by using the alpha expansion algorithm for solving the multiple labeling problem, and the occlusions between the overlapping layers are explicitly determined.

In this talk several experimental results will be demonstrated to show that our approach is effective and robust. Furthermore, based on motion segmentation, three novel applications: layer-based compression, layer-based video registration and layer- based video completion will be demonstrated.

For more details visit:

http://www.cs.ucf.edu/~vision/projects/motion_layer_extraction/

Dr. Mubarak Shah, a professor of Computer Science, and the founding director of the Computer Vision Laboratory at University of Central Florida (UCF), is a researcher in computer vision. He is a co-author of two books Video Registration (2003) and Motion-Based Recognition (1997), both by Kluwer Academic Publishers. He has supervised several Ph.D., M.S., and B.S. students to completion, and is currently directing twenty Ph.D. and several B.S. students. He has published close to one hundred fifty papers in leading journals and conferences on topics including activity and gesture recognition, violence detection, event ontology, object tracking (fixed camera, moving camera, multiple overlapping and non-overlapping cameras), video segmentation, story and scene segmentation, view morphing, ATR, wide-baseline matching, and video registration. . Dr. Shah is a fellow of IEEE, was an IEEE Distinguished Visitor speaker for 1997-2000, and is often invited to present seminars, tutorials and invited talks all over the world. He received the Harris Corporation Engineering Achievement Award in 1999, the TOKTEN awards from UNDP in 1995, 1997, and 2000; Teaching Incentive Program award in 1995 and 2003, Research Incentive Award in 2003, and IEEE Outstanding Engineering Educator Award in 1997. He is an editor of international book series on “Video Computing”; editor in chief of Machine Vision and Applications journal, and an associate editor Pattern Recognition journal. He was an associate editor of the IEEE Transactions on PAMI, and a guest editor of the special issue of International Journal of Computer Vision on Video Computing.