Open Access Open Access  Restricted Access Subscription or Fee Access

Abstracting Videos Using Gaussian Pyramids

K.S. Bhagat, D.T. Ingole

Abstract


Video and digital cameras provide relatively low resolution images, covering a limited field of view. Both the lower resolution and the limited field of view problems can be overcome by combining several images into an extended image mosaic.

The paper investigates the problem of how information contained in multiple, overlapping images of the same scene may be combined to produce images of superior quality. This area, generally titled image mosaic, offers the possibility of reducing noise, extending the field of view, removing blur, increasing spatial resolution and improving dynamic range. As such, this research has many applications in fields as diverse as forensic image restoration, computer generated special effects, video image compression, and digital video editing.

A coarse-to-fine method is used to produce better estimates. Images are put through the following pipeline. They are first smoothed with a 5 by 5 Gaussian filter with a standard deviation of 1.0. For each pair of neighboring images, they are registered under a coarse-to-fine hierarchy using the Gaussian pyramid [34] to produce better estimate. At the coarest level the complete graph is built. The shortest path is found by considering all nodes. This path is interpolated and the optical flow is determined from this path. The new interpolated path is used in next finer pyramid levels.

Throughout this work, the performance of the algorithm is evaluated using real image sequences.


Keywords


Gaussian Pyramids,video image compression and digital video editing.

Full Text:

PDF

References


Y. Wexler, D. Simakov, “Space Time Scene Manifold”, Dept. Of Computer Science and Applied Math,The Weizmann Institute of Science Rehovot, 76100 Israel

A. Rav-Acha and Y. Shor and S. Peleg, “Mosaicing with Parallax using Time Warping” Second IEEE Workshop on Image and Video Registration (IVR’04), 2004. Http://www.cs.huji.ac.il/_alexis/

M. Irani and B. Rousso and S. Peleg, “Recovery of Ego- Motion Using Region Alignment,” IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 19(3), pp. 268- 272, 1997

A. Zomet and D. Feldman and S. Peleg and D. Weinshall, “Mosaicing New Views: The Crossed-Slits Projection” IEEE Trans. On Pattern Analysis and Machine Intelligence, Vol. 25(6), pp 741-754, 2003

E.W. Dijkstra, “A Note on Two Problems in Connection with Graphs.” Numerische Math. Vol 1, pp. 269-271, 1959.

Y. Wexler and E. Shechtman and M. Irani, “Space-Time Video Completion”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04), Vol. 1, Pp. 120-127, 2004.

S. Seitz and J. Kim, “The Space of All Stereo Images” IJCV, Marr Prize Special Issue, Vol. 48(1), pp. 21-38, 2002.

Http:/grail.cs.washington.edu/projects/stereo/

A.W. Fitzgibbon and Y. Wexler and A. Zisserman. “Imagebased rendering using image-based priors” International Conference on Computer Vision (ICCV), pp. 1176-1183, 2003.

V. Kwatra and A. Schdl and I. Essa and G. Turk and A. Bobick, “Graphcut Textures: Image and Video Synthesis Using Graph Cuts”, ACM Transactions on Graphics, SIGGRAPH 2003, Vol. 22(3), pp. 277-286, 2003.

A.A. Efros and T.K. Leung, “Texture Synthesis by Nonparametric Sampling” IEEE International Conference on Computer Vision, pp. 1033-1038, 1999.

A. Agarwala, C. Zheng, C. Pal, M. Agrawala, M. Cohen, B. Curless, D. Salesin, R. Szeliski, “Panoramic Video Textures” ACM Transactions on Graphics, SIGGRAPH 2005, 2005.

A. Klein, P. Sloan, A. Finkelstein and M. Cohen”, “Stylized video cubes”, ACM SIGGRAPH Symposium on Computer Animation, pp. 15-22, 2002.

Michal Irani, P Anandan, “ Efficient Representation of Video Sequences and Their Applications”,David Sarnoff Research Center, U.S.A

H. Wilson and J. Bergen' "A four mechanism model for threshold special vision", Vision Research. Vol. 19, pp. L9-31, 1979.

C. Anderson, "An alternative to the Burt pyramid algorithm", memo in preparation.

P Burt and E. Adelson, "The Laplacian Pyramid as a Compact Image Code," IEEE Transactions on Communication, COM-31 pp. 532-540, 1983a.

P. Burt, X. Xu and C. Yen, "Multi-Resolution Flow-Through Motion Analysis, " RCA Technical Report,PRRL-84-TR-009, 1984.

P. Burt and E. Adelson, "Multiresolution Spline with Application to Image Mosaics." ACM Transactions on Graphics, Vol. 2, pp. 217-236, 1983b.

Sing Bing Kang, “A survey of Image Based Rendering Techniques”, Cambridge Research Laboratory, Technical Report Series, August !997.

Carig B Knowles, “Temporal Image Mosaic And Its artistic Applications”,Queen’s University, December 2003.

E. H. Adelson , C. H. Anderson ,J. R. Bergen ,P. J. Burt ,J. M. Ogden,” Pyramid Methods in Image Processing”, RCA Engineer 29-6 Nov/Dec 1984

J. Torborg and J. T. Kajiya. Talisman: “Commodity realtime 3D graphics for the PC.” Computer Graphics (SIGGRAPH’96), pages 353–363, August 1996.

S. Peleg and J. Herman. “Panoramic mosaics by manifold projection.” In Conference on Computer Vision and Pattern Recognition, pages 338–343, San Juan, Puerto Rico, June 1997.

Svetlana Lazebnik, Cordelia Schmid, “Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories” Beckman InstituteUniversity of Illnois

M Sankar Kishore And K Veerabhadra Rao, “ A study of correlation technique on pyramid Processed images” sadhana, Vol. 25, Part 1, February 2000, pp. 37±43

Peter J. Burt, “The Laplacian Pyramid As A Compact Image Code”, Ieee Transactions On Communications, Vol. Com-3l, No. 4, April 1983

J.M. Ogden E.H. Adelson J R. Bergen |P.J. Burt, “Pyramid-based computer graphics”, RCA Corporation Final manuscript received October 21, 1985

P.Saravanan, Narayanan .C.K., P.V.S.S Prakash, and Prabhakara Rao .G.V, “Techniques for Video Mosaicing” Proceedings Of World Academy Of Science, Engineering And Technology Volume 5 April 2005 Issn 1307-6884

Craig B. Knowles, “The Temporal Image Mosaic and its Artistic Applications in Filmmaking”, Queen’s University Kingston, Ontario, Canada December 2003

E. H. Adelson C. H. Anderson J. R. Bergen P. J. Burt,J. M. Ogden, “Pyramid methods in image processing”, RCA Corporation Final manuscript received November 12, 1984

Michal Irani, P Anandan, “Efficient Representation of Video Sequences and Their Applications”, David Sarnoff Research Center, U.S.A

David Peter Capel, “Image Mosaicing and Super-resolution” Robotics Research Group Department of Engineering Science University of Oxford Trinity Term, 2001

Bruce D. Lucas Takeo Kanade, “An Iterative Image Registration Technique with an Application to Stereo Vision”

S. Coorg, N.Master, and S. Teller. “Acquisition of a large pose-mosaic dataset.” In Proc.IEEE Conference on Computer Vision and Pattern Recognition, Santa Barbara, pages 872.878, 1998.

E.W. Dijkstra, “A Note on Two Problems in Connection with Graphs.” Numerische Math. Vol 1, pp. 269-271, 1959.

S. Coorg and S. Teller. “Spherical mosaics with quaternions and dense correlation.” International Journal of Computer Vision, 37(3):259.273, June 2000.

J.E. Davis. “Mosaics of scenes with moving objects.” In Proc. IEEE Conference on Computer Vision and Pattern Recognition, Santa Barbara, pages 354.360, 1998.

E. Adelson. “ Layered representations for vision and video” In ICCV Workshop on the Representation of Visual Scenes, 1995.

M. Irani and S. Peleg. “Motion analysis for image enhancement:resolution, occlusion,And transparency.” Journal of Visual Communication and Image Representation,4:324.335, 1993.

S.B. Kang And R. Szeliski. “3-D Scene Data Recovery Using Omnidirectional Multibaseline Stereo.” In Proc. IEEE Conference On Computer Vision And Pattern Recognition, Pages 364.370, 1996.

Chapter 3 of: I. Drori, D. Cohen-Or, H. Yeshurun, “Fragment-Based Image Completion,” ACM SIGGRAPH, pp. 303 - 312, July 2003.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.