Automated system for text detection in individual video images

Du, Yingze; Chang, Chein-I; Thouin, Paul D.

Automated system for text detection in individual video images

dc.contributor.author	Du, Yingze
dc.contributor.author	Chang, Chein-I
dc.contributor.author	Thouin, Paul D.
dc.date.accessioned	2024-06-11T13:30:08Z
dc.date.available	2024-06-11T13:30:08Z
dc.date.issued	2003-07-1
dc.description.abstract	Text detection in video images is a challenging research problem because of the poor spatial resolution and complex background, which may contain a variety of colors. An automated systemfor text detection in video images is presented. It makes use of four modules to implement a series of processes to extract text regions from video images. The first module, called the multistage pulse code modulation (MPCM) module, is used to locate potential text regions in color video images. It converts a video image to a coded image, with each pixel encoded by a priority code ranging from 7 down to 0 in accordance with its priority, and further produces a binary thresholded image, which segments potential text regions from the background. The second module, called the text region detection module, applies a sequence of spatial filters to remove noisy regions and eliminate regions that are unlikely to contain text. The third module, called the text box finding module, merges text regions and produces boxes that are likely to contain text. Finally, the fourth module, called the optical character recognition (OCR) module, eliminates the text boxes that produce no OCR output. An extensive set of experiments is conducted and demonstrates that the proposed system is effective in detecting text in a wide variety of video images.
dc.description.sponsorship	The authors would like to thank the U.S. Department of Defense for supporting their work through contract MDA-904-00-C2120. The authors would also like to thank Dr. D. Doermann of the Language and Media Processing Laboratory at the University of Maryland, College Park, for providing the database used for these experiments.
dc.description.uri	https://www.spiedigitallibrary.org/journals/journal-of-electronic-imaging/volume-12/issue-3/0000/Automated-system-for-text-detection-in-individual-video-images/10.1117/1.1584050.full
dc.format.extent	13 pages
dc.genre	journal articles
dc.identifier	doi:10.13016/m24vuj-pxrl
dc.identifier.citation	Du, Yingze, Chein-I. Chang, and Paul D. Thouin. “Automated System for Text Detection in Individual Video Images.” Journal of Electronic Imaging 12, no. 3 (July 2003): 410–22. https://doi.org/10.1117/1.1584050.
dc.identifier.uri	https://doi.org/10.1117/1.1584050
dc.identifier.uri	http://hdl.handle.net/11603/34564
dc.language.iso	en
dc.publisher	SPIE
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.relation.ispartof	UMBC Student Collection
dc.rights	This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rights	Public Domain
dc.rights.uri	https://creativecommons.org/publicdomain/mark/1.0/
dc.title	Automated system for text detection in individual video images
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-5450-4891

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 410_1.pdf
Size:: 985.7 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department
UMBC Student Collection