1
Hinrich Schuetze, Francine R Chen, Peter L Pirolli, James E Pitkow, Ed H Chi, Jun Li, Ullas Gargi: System and method for quantitatively representing data objects in vector space. Xerox Corporation, July 26, 2005: US06922699 (141 worldwide citation)

A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quant ...


2
Francine R Chen, Lynn D Wilcox, Dan S Bloomberg: Word spotting in bitmap images using word bounding boxes and hidden Markov models. Xerox Corporation, Townsend and Townsend Khourie and Crew, August 1, 1995: US05438630 (118 worldwide citation)

Font-independent spotting of user-defined keywords in a scanned image. Word identification is based on features of the entire word without the need for segmentation or OCR, and without the need to recognize non-keywords. Font-independent character models are created using hidden Markov models (HMMs) ...


3
Julian M Kupiec, Jan O Pedersen, Francine R Chen, Daniel C Brotsky, Steven B Putz: Automatic method of generating feature probabilities for automatic extracting summarization. Xerox Corporation, Tracy L Hurt, July 7, 1998: US05778397 (110 worldwide citation)

A method of automatically generating feature probabilities that allow later automatic generation of document extracts. The computer system generates the probabilities by analyzing each document a document at a time. First, the computer system designates one of the documents as a selected document. N ...


4
Francine R Chen, Hinrich Schuetze, Ullas Gargi: System and method for information browsing using multi-modal features. Xerox Corporation, April 27, 2004: US06728752 (100 worldwide citation)

A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for ...


5
Hinrich Schuetze, Francine R Chen, Peter L Pirolli, James E Pitkow, Ed H Chi, Jun Li: System and method for identifying similarities among objects in a collection. Xerox Corporation, September 6, 2005: US06941321 (93 worldwide citation)

A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quant ...


6
Ayman O Farahat, Francine R Chen, Charles R Mathis, Geoffrey D Nunberg: Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections. Xerox Corporation, Oliff & Berridge, Eugene Palazzo, March 6, 2007: US07188117 (54 worldwide citation)

Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, process ...


7
Francine R Chen: Automatic method of generating thematic summaries. Xerox Corporation, Tracy L Hurt, November 18, 1997: US05689716 (53 worldwide citation)

A technique for automatically generating thematic summaries for machine readable representations of documents. The technique begins with determining the number of thematic terms to be used based upon the number of thematic sentence to be extracted. To insure some commonality of theme between extract ...


8
Donald G Kimber, Lynn D Wilcox, Francine R Chen: Method of speaker clustering for unknown speakers in conversational audio data. Xerox Corporation, R Christine Jacobs, Tracy L Hurt, January 28, 1997: US05598507 (53 worldwide citation)

A method for clustering speaker data from a plurality of unknown speakers. The method includes steps of providing a portion of audio data containing speech from at least all the speakers in the audio data and dividing the portion into data clusters. A pairwise distance between each pair of clusters ...


9
Vijay Balasubramanian, Francine R Chen, Philip A Chou, Donald G Kimber, Alex D Poon, Karon A Weber, Lynn D Wilcox: Segmentation of audio data for indexing of conversational speech for real-time or postprocessing applications. Xerox Corporation, R Christine Jacobs, Tracy L Hurt, August 5, 1997: US05655058 (50 worldwide citation)

A method for segmenting audio data, comprising speech from a plurality of individual speakers, according to speaker is provided. The method comprises providing individual HMMs for each individual speaker, each individual HMM including at least one state, and constructing a speaker network HMM by con ...


10
Francine R Chen, Lynn D Wilcox, Dan S Bloomberg: Word spotting in bitmap images using text line bounding boxes and hidden Markov models. Xerox Corporation, Townsend and Townsend and Crew, April 28, 1998: US05745600 (49 worldwide citation)

Font-independent spotting of user-defined keywords in a scanned image. Word identification is based on features of the entire word without the need for segmentation or OCR, and without the need to recognize non-keywords. Font-independent character models are created using hidden Markov models (HMMS) ...