Open Access Open Access  Restricted Access Subscription or Fee Access

Automatic Text Extraction in a Complex Background and Different Font Styles Regions of Moving Videos

M.B. Suresh, T.R. Mahesh, M. Vinayababu

Abstract


Efficient content based retrieval of image and video databases is an important emerging application due to rapid proliferation of image and digital video data on the internet and corporate intranets and exponential growth of video content in general. Text either embedded or superimposed within video frames is very useful for describing the semantic content of the frames, as it enables both keyword and free-text based search, automatic video logging, and video cataloging. Extracting text directly from video data becomes especially important when closed captioning or speech recognition is not available to generate textual transcripts of audio or when video footage that completely lacks audio needs to be automatically annotated and searched based on frame content. Towards building a video query system, developed a scheme for automatically extracting text from digital image and videos for content annotation and robust text extract which can handle complex backgrounds in video frames, deal with different font sizes, font styles, and font appearances such as normal and inverse videos. The algorithm results in segmented characters from video frames that can be directly processed by an OCR system to produce ASCII text. Results from the experiments obtained from MPEG video streams demonstrate the good performance if our systems in terms of text identification accuracy and computational efficiency.

Keywords


Video, Digital Images, Content-Based Search and Annotation, Automatic Text Extraction, Image Segmentation, Character Recognition

Full Text:

PDF

References


JulindaGllavata, Ralph Ewerth et al.. “A Robust Algorithm for Text Detection in Images “,SFBEK 615, University of Siegen,, Germany.

J. Ohya, A. Shio, and S. Akarnatsu, "Recognizing characters in scene images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 16, pp. 214-220, Feb1994.

Xiaoou Tang, Bo Luo”Video text extraction using temporal feature vectors” in IEEE Proc. 2002

M. Smith and T. Kanade, "Video skimming and characterization through the combination of image and language understanding techniques," in Proc. IEEE Conference on Computer Vision and Pattern Recognition, (Puerto Rico), pp. 775-781, June 1997.

R. Lienhart, "Automatic text recognition for video indexing," in Proc. ACM Multimedia96, (Boston, MA), pp. 11-20, November 1996.

Detection of text on road signs from video, Wen Wu Xilin Chen Jie Yang Intelligent Transportation Systems, IEEE Transactionson On page(s): 378 - 390 , Volume: 6 Issue: 4, Dec. 2005.

Automatic text detection and tracking in digital video, Huiping Li Doermann, D. Kia, O. Image Processing, IEEE Transactions on page(s): 147 - 156 , Volume: 9 Issue: 1, Jan 2000.

Automatic caption localization in compressed video, Yu Zhong Hongjiang Zhang Jain, A.K. Pattern Analysis and Machine Intelligence, IEEE Transactions on page(s): 385 - 392 , Volume: 22 Issue: 4, Apr 2000.

AI at IBM Research, Apte, C. Morgenstern, L. Se June Hong Intelligent Systems and their Applications, IEEE On page(s): 51 - 57 , Volume: 15 Issue: 6, Nov/Dec 2000.

A spatial-temporal approach for video caption detection and recognition, Xiaoou Tang Xinbo Gao Jianzhuang Liu Hongjiang Zhang Neural Networks, IEEE Transactions on On page(s): 961 - 971 , Volume: 13 Issue: 4, Jul 2002.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.