1
William Pugh, Monika H Henzinger: Detecting duplicate and near-duplicate files. Google, John C Pokotylo, Straub & Pokotylo, December 2, 2003: US06658423 (455 worldwide citation)

Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the popu ...


2
Alexander Mark Franz, Monika H Henzinger, Sergey Brin, Brian Christopher Milch: Voice interface for a search engine. Google, Harrity Snyder, April 11, 2006: US07027987 (186 worldwide citation)

A system provides search results from a voice search query. The system receives a voice search query from a user, derives one or more recognition hypotheses, each being associated with a weight, from the voice search query, and constructs a weighted boolean query using the recognition hypotheses. Th ...


3
Luis Gravano, Monika H Henzinger: Systems and methods for using anchor text as parallel corpora for cross-language information retrieval. Google, Harrity Snyder, December 5, 2006: US07146358 (95 worldwide citation)

A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the tr ...


4
Monika H Henzinger: Loadbalancing multiple files across computing devices. Google, Harrity & Harrity, December 8, 2009: US07631310 (93 worldwide citation)

A load balancer evenly distributes processing loads to multiple computing devices. A data structure may be divided into multiple files, each of which corresponds to an estimated load value. The files are assigned to the computing devices in such a way that the processing load at each of the computin ...


5
Georges R Harik, Monika H Henzinger: Document ranking based on semantic distance between terms in a document. Google, Harrity & Harrity, May 11, 2010: US07716216 (89 worldwide citation)

Techniques are disclosed that locate implicitly defined semantic structures in a document, such as, for example, implicitly defined lists in an HTML document. The semantic structures can be used in the calculation of distance values between terms in the documents. The distance values may be used, fo ...


6
Lance M Berc, Sanjay Ghemawat, Monika H Henzinger, Richard L Sites, Carl A Waldspurger, William E Weihl: High frequency sampling of processor performance counters. Digital Equipment Corporation, Dirk Brinkman, August 18, 1998: US05796939 (79 worldwide citation)

In a computer system, an apparatus is configured to collect performance data of a computer system including a plurality of processors for concurrently executing instructions of a program. A plurality of performance counters are coupled to each processor. The performance counters store performance da ...


7
Andrei Z Broder, Michael Burrows, Monika H Henzinger, Sanjay Ghemawat, Puneet Kumar, Suresh Venkatasubramanian: Connectivity server for locating linkage information between Web pages. Alta Vista Company, Pennie & Edmonds, June 6, 2000: US06073135 (77 worldwide citation)

A server computer is provided for representing and navigating the connectivity of Web pages. The Web pages include links to other Web pages. The links and Web page s have associated names (URLs). The names of the Web pages are sorted in a memory of the connectivity server. The sorted names are delta ...


8
William Pugh, Monika H Henzinger: Detecting duplicate and near-duplicate files. Google, Straub and Pokotylo, John C Pokotylo, April 29, 2008: US07366718 (49 worldwide citation)

Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the popu ...


9
Urs Hoelzle, Monika H Henzinger, David Desjardins: Systems and methods for performing in-context searching. Google, Harrity Snyder, December 4, 2007: US07305380 (43 worldwide citation)

A system limits search results based on context information. The system obtains the context information and a search query, and obtains a set of references to documents in response to the search query. The system then filters the set of references based on the context information and presents the fi ...


10
Alexander Mark Franz, Monika H Henzinger, Sergey Brin, Brian Christopher Milch: Voice interface for a search engine. Google, Harrity Snyder, April 29, 2008: US07366668 (38 worldwide citation)

A system provides search results from a voice search query. The system receives a voice search query from a user, derives one or more recognition hypotheses, each being associated with a weight, from the voice search query, and constructs a weighted boolean query using the recognition hypotheses. Th ...