07983903 is referenced by 20 patents and cites 15 patents.

Systems and methods for identifying translation pairs from web pages are provided. One disclosed method includes receiving monolingual web page data of a source language, and processing the web page data by detecting the occurrence of a predefined pattern in the web page data, and extracting a plurality of translation pair candidates. Each of the translation pair candidates may include a source language string and target language string. The method may further include determining whether each translation pair candidate is a valid transliteration. The method may also include, for each translation pair that is determined not to be a valid transliteration, determining whether each translation pair candidate is a valid translation. The method may further include adding each translation pair that is determined to be a valid translation or transliteration to a dictionary.

Title
Mining bilingual dictionaries from monolingual web pages
Application Number
11/851402
Publication Number
7983903 (B2)
Application Date
September 7, 2007
Publication Date
July 19, 2011
Inventor
Jianfeng Gao
Kirkland
WA, US
Agent
Alleman Hall McCoy Russell & Tuttle
Assignee
Microsoft Corporation
WA, US
IPC
G06F 17/20
G06F 17/28
G06F 17/21
View Original Source