08122026 is referenced by 61 patents and cites 254 patents.

A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity.

Title
Finding and disambiguating references to entities on web pages
Application Number
11/551657
Publication Number
8122026 (B1)
Application Date
October 20, 2006
Publication Date
February 21, 2012
Inventor
Jeffrey Reynar
New York
NY, US
Nikolai V Yakovenko
New York
NY, US
Nikola Jevtic
Newark
NJ, US
Leonardo A Laroco Jr
Philadelphia
PA, US
Agent
Morgan Lewis & Bockius
Assignee
Google
CA, US
IPC
G06F 17/30
View Original Source