07493251 is referenced by 14 patents and cites 5 patents.

A method and apparatus for segmenting text is provided that identifies a sequence of entity types from a sequence of characters and thereby identifies a segmentation for the sequence of characters. Under the invention, the sequence of entity types is identified using probabilistic models that describe the likelihood of a sequence of entities and the likelihood of sequences of characters given particular entities. Under one aspect of the invention, organization name entities are identified from a first sequence of identified entities to form a final sequence of identified entities.

Title
Using source-channel models for word segmentation
Application Number
10/448644
Publication Number
7493251 (B2)
Application Date
May 30, 2003
Publication Date
February 17, 2009
Inventor
Ming Zhou
Beijing
CN
Lei Zhang
Beijing
CN
Jian Sun
Beijing
CN
Chang Ning Huang
Beijing
CN
Mu Li
Beijing
CN
Jianfeng Gao
Beijing
CN
Agent
Westman Champlin & Kelly P A
Thomas M Magee
Assignee
Microsoft Corporation
WA, US
IPC
G10L 11/00
G06F 17/27
View Original Source