Automated system for text detection in individual video images

dc.contributor.authorDu, Yingze
dc.contributor.authorChang, Chein-I
dc.contributor.authorThouin, Paul D.
dc.date.accessioned2024-06-11T13:30:08Z
dc.date.available2024-06-11T13:30:08Z
dc.date.issued2003-07-1
dc.description.abstractText detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated systemfor text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
dc.description.sponsorshipThe authors would like to thank the U.S. Department of Defense for supporting their work through contract MDA-904-00-C2120. The authors would also like to thank Dr. D. Doermann of the Language and Media Processing Laboratory at the University of Maryland, College Park, for providing the database used for these experiments.
dc.description.urihttps://www.spiedigitallibrary.org/journals/journal-of-electronic-imaging/volume-12/issue-3/0000/Automated-system-for-text-detection-in-individual-video-images/10.1117/1.1584050.full
dc.format.extent13 pages
dc.genrejournal articles
dc.identifierdoi:10.13016/m24vuj-pxrl
dc.identifier.citationDu, Yingze, Chein-I. Chang, and Paul D. Thouin. “Automated System for Text Detection in Individual Video Images.” Journal of Electronic Imaging 12, no. 3 (July 2003): 410–22. https://doi.org/10.1117/1.1584050.
dc.identifier.urihttps://doi.org/10.1117/1.1584050
dc.identifier.urihttp://hdl.handle.net/11603/34564
dc.language.isoen_US
dc.publisherSPIE
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Student Collection
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/
dc.titleAutomated system for text detection in individual video images
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5450-4891

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
410_1.pdf
Size:
985.7 KB
Format:
Adobe Portable Document Format