05335290 is referenced by 120 patents and cites 7 patents.

In a character recognition system, a method and apparatus for segmenting a document image into areas containing text and non-text. Document segmentation in the present invention is comprised generally of the steps of: providing a bit-mapped representation of the document image, extracting run lengths for each scanline from the bit-mapped representation of the document image; constructing rectangles from the run lengths; initially classifying each of the rectangles as either text or non-text; correcting for the skew in the rectangles; merging associated text into one or more text blocks; and logically ordering the text blocks.

Title
Segmentation of text, picture and lines of a document image
Application Number
7/864423
Publication Number
5335290
Application Date
April 6, 1992
Publication Date
August 2, 1994
Inventor
Koichi Ejiri
Narashino
JP
John F Cullen
Palo Alto
CA, US
Agent
Blakely Sokoloff Taylor & Zafman
Assignee
Ricoh Company
JP
Ricoh Corporation
CA, US
IPC
G06K 9/34
View Original Source