Text Extraction in Video

Main Article Content

Dhanashri Holgare, Rutuja Talewar, Prof. Karishma Dhule

Abstract

The detection and extraction of scene and caption text from unconstrained, general purpose video is an important research problem in the context of content-based retrieval and summarization of visual information. The current state of the art for extracting text from video either makes simplistic assumptions as to the nature of the text to be found, or restricts itself to a subclass of the wide variety of text that can occur in broadcast video. Most published methods only work on artificial text (captions) that is composited on the video frame. Also, these methods have been developed for extracting text from images that have been applied to video frames. They do not use the additional temporal information in video to good effect.This thesis presents a reliable system for detecting, localizing, extracting, tracking and binarizing text from unconstrained, general-purpose video. In developing methods for extraction of text from video it was observed that no single algorithm could detect all forms of text. The strategy is to have a multi-pronged approach to the problem, one that involves multiple methods, and algorithms operating in functional parallelism. The system utilizes the temporal information available in video. The system can operate on JPEG images, MPEG-1 bit streams, as well as live video feeds. It is also possible to operate the methods individually and independently.

Article Details

How to Cite
, D. H. R. T. P. K. D. (2018). Text Extraction in Video. International Journal on Future Revolution in Computer Science &Amp; Communication Engineering, 4(3), 327–331. Retrieved from http://www.ijfrcsce.org/index.php/ijfrcsce/article/view/1315
Section
Articles