Movie Subtitle Extraction

As some movies, and specifically opera videos, contains embedded subtitles without accompanied text files, there is a need for a robust system capable of extracting these subtitles from the movie and into readable text. Extraction of this information involves detection, localization, tracking, enhancement, and recognition of the text from a given image.
However, variations of text due to differences in size, style, orientation, and alignment, as well as low image contrast and complex background make the problem of automatic text extraction challenging. This project is aimed at delivering such a system, for the purpose of movie subtitles extraction and possible translation later on. Whereas the first part of the project was researched and implemented within the Matlab environment, the second project consisted of a C++ implementation using the OpenCV library.

Movie Subtitle Extraction

 

Movie Subtitle Extraction
Movie Subtitle Extraction
Collaboration:

Prof. Boaz Porat