1
Jan O Pedersen, Per Kristian Halvorsen, Douglass R Cutting, John W Tukey, Eric A Bier, Daniel G Bobrow: Iterative technique for phrase query formation and an information retrieval system employing same. Xerox Corporation, Oliff & Berridge, January 11, 1994: US05278980 (435 worldwide citation)

An information retrieval system and method are provided in which an operator inputs one or more query words which are used to determine a search key for searching through a corpus of documents, and which returns any matches between the search key and the corpus of documents as a phrase containing th ...


2
M Margaret Withgott, William Newman, Steven C Bagley, Daniel P Huttenlocher, Ronald M Kaplan, Todd A Cass, Per Kristian Halvorsen, John Seely Brown, Martin Kay: Method and apparatus for supplementing significant portions of a document selected without document image decoding with retrieved information. Xerox Corporation, Oliff & Berridge, May 5, 1998: US05748805 (129 worldwide citation)

A method and apparatus for applying morphological image criteria that identify image units in an undecoded document image having significant information content, and for retrieving related data that supplements the document either from elsewhere within the document or a source external to the docume ...


3
M Margaret Withgott, Steven C Bagley, Dan S Bloomberg, Daniel P Huttenlocher, Ronald M Kaplan, Todd A Cass, Per Kristian Halvorsen, Ramana B Rao, Douglass R Cutting: Methods and apparatus for selecting semantically significant images in a document image without decoding image content. Xerox Corporation, Oliff & Berridge, February 14, 1995: US05390259 (57 worldwide citation)

A method and apparatus for processing a document image, using a programmed general or special purpose computer, includes forming the image into image units, and at least one image unit classifier of at least one of the image units is determined, without decoding the content of the at least one of th ...


4
M Margaret Withgott, Steven C Bagley, Dan S Bloomberg, Per Kristian Halvorsen, Daniel P Huttenlocher, Todd A Cass, Ronald M Kaplan, Ramana R Rao: Method and apparatus for summarizing a document without document image decoding. Xerox Corporation, Oliff & Berridge, February 13, 1996: US05491760 (55 worldwide citation)

A method and apparatus for excerpting and summarizing an undecoded document image, without first converting the document image to optical character codes such as ASCII text, identifies significant words, phrases and graphics in the document image using automatic or interactive morphological image re ...


5
Todd A Cass, Per Kristian Halvorsen, Daniel P Huttenlocher, Ronald M Kaplan, M Margaret Withgott: Method and apparatus for determining the frequency of words in a document without document image decoding. Xerox Corporation, Oliff & Berridge, June 28, 1994: US05325444 (52 worldwide citation)

A method and apparatus for determining word frequency from a document without first converting the document to character codes. The method includes morphological image processing to determine word unit characteristics for placement into equivalence classes utilizing non-content based information. Wo ...


6
Daniel P Huttenlocher, Ronald M Kaplan, M Margaret Withgott, Todd A Cass, Per Kristian Halvorsen, Dan S Bloomberg, Ramana B Rao: Methods and apparatus for automatic modification of semantically significant portions of a document without document image decoding. Xerox Corporation, Oliff & Berridge, January 24, 1995: US05384863 (29 worldwide citation)

Methods and apparatus of processing an undecoded document image in a digital computer to modify the document image so as to emphasize semantically significant portions without first converting the document image to character codes. The document image is segmented into image units, and morphological ...