Paper
1 August 1992 Document understanding using layout styles of title page images
Louis H. Sharpe II, Basil Manns
Author Affiliations +
Proceedings Volume 1661, Machine Vision Applications in Character Recognition and Industrial Inspection; (1992) https://doi.org/10.1117/12.130273
Event: SPIE/IS&T 1992 Symposium on Electronic Imaging: Science and Technology, 1992, San Jose, CA, United States
Abstract
An important problem in the application of compound document architectures is the input of data from raster images. One technique is to use visual, syntactic cues found in the layout of the raster document to infer its logical structure or semantics. Another is to use context derived from characters recognized within a given block of raster data. Both character- and image- based information are considered here. A well-constrained environment is defined for use in developing rules that can be applied to basic book title page understanding. This paper identifies the attributes of title page layout objects which aid in mapping them into the fields of a simple bibliographic format. Using as input the raster images of the title page and the verso of the title page along with the ASCII output of a generic character recognition engine from these same images, a system of rules is defined for generating a marked-up text wherein key bibliographic fields may be identified.
© (1992) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Louis H. Sharpe II and Basil Manns "Document understanding using layout styles of title page images", Proc. SPIE 1661, Machine Vision Applications in Character Recognition and Industrial Inspection, (1 August 1992); https://doi.org/10.1117/12.130273
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Standards development

Raster graphics

Hough transforms

Optical character recognition

Photodynamic therapy

Visualization

Associative arrays

RELATED CONTENT

Similarity measures for pattern matching on-the-fly
Proceedings of SPIE (December 24 2013)
Extraction of text boxes from engineering drawings
Proceedings of SPIE (August 01 1992)
Detection of text strings from mixed text/graphics images
Proceedings of SPIE (December 21 2000)
Location and recovery of text on oriented surfaces
Proceedings of SPIE (December 22 1999)
Compressing images for the Internet
Proceedings of SPIE (January 02 1998)
Benchmarking of document page segmentation
Proceedings of SPIE (December 22 1999)

Back to Top