07043422 is referenced by 107 patents.

A method and apparatus are provided for adapting a language model to a task-specific domain. Under the method and apparatus, the relative frequency of n-grams in a small training set (i.e. task-specific training data set) and the relative frequency of n-grams in a large training set (i.e. out-of-domain training data set) are used to weight a distribution count of n-grams in the large training set. The weighted distributions are then used to form a modified language model by identifying probabilities for n-grams from the weighted distributions.

Title
Method and apparatus for distribution-based language model adaptation
Application Number
9/945930
Publication Number
7043422 (B2)
Application Date
September 4, 2001
Publication Date
May 9, 2006
Inventor
Mingjing Li
Beijing
CN
Jianfeng Gao
Beijing
CN
Agent
Westman Champlin & Kelly P A
Theodore M Magee
Assignee
Microsoft Corporation
WA, US
IPC
G06F 17/27
View Original Source