1
Hai Feng Wang, Chang Ning Huang, Kai Fu Lee, Shuo Di, Jianfeng Gao, Dong Feng Cai, Lee Feng Chien: System and iterative method for lexicon, segmentation and language model joint optimization. Microsoft Corporation, Microsoft, June 7, 2005: US06904402 (77 worldwide citation)

A method for optimizing a language model is presented comprising developing an initial language model from a lexicon and segmentation derived from a received corpus using a maximum match technique, and iteratively refining the initial language model by dynamically updating the lexicon and re-segment ...


2
Ming Zhou, Hua Wu, Yue Zhang, Jianfeng Gao, Chang Ning Huang: Method and system for retrieving confirming sentences. Microsoft Corporation, John D Veldhuis Kroeze, Westman Champlin & Kelly P A, March 20, 2007: US07194455 (22 worldwide citation)

A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engin ...


3
Jianfeng Gao, Mu Li, Chang Ning Huang, Jian Sun, Lei Zhang, Ming Zhou: Using source-channel models for word segmentation. Microsoft Corporation, Thomas M Magee, Westman Champlin & Kelly P A, February 17, 2009: US07493251 (14 worldwide citation)

A method and apparatus for segmenting text is provided that identifies a sequence of entity types from a sequence of characters and thereby identifies a segmentation for the sequence of characters. Under the invention, the sequence of entity types is identified using probabilistic models that descri ...


4
Endong Xun, Ming Zhou, Chang Ning Huang: System and method for identifying base noun phrases. Microsoft Corporation, Joseph R Kelly, Westman Champlin & Kelly P A, February 22, 2005: US06859771 (9 worldwide citation)

A system and method identify base noun phrases (baseNP) in a linguistic input. A part-of-speech tagger identifies N-best part-of-speech tag sequences corresponding to the linguistic input. A baseNP identifier identifies baseNPs in the linguistic input using a unified statistical model that identifie ...


5
Endong Xun, Ming Zhou, Chang Ning Huang: System and method for identifying base noun phrases. Microsoft Corporation, Westman Champlin & Kelly P A, February 24, 2009: US07496501 (5 worldwide citation)

A system and method identify base noun phrases (baseNP) in a linguistic input. A part-of-speech tagger identifies N-best part-of-speech tag sequences corresponding to the linguistic input. A baseNP identifier identifies baseNPs in the linguistic input using a unified statistical model that identifie ...


6
Chang Ning Huang, Hong Qiao Li, Jianfeng Gao: Standardized natural language chunking utility. Microsoft Corporation, Westman Champlin & Kelly P A, March 2, 2010: US07672832 (4 worldwide citation)

A method is disclosed for providing a chunking utility that supports robust natural language processing. A corpus is chunked in accordance with a draft chunking specification. Chunk inconsistencies in the corpus are automatically flagged for resolution, and a chunking utility is provided in which at ...


7
Ming Zhou, Hua Wu, Yue Zhang, Jianfeng Gao, Chang Ning Huang: Method and system for retrieving confirming sentences. Joseph R Kelly, Joseph R Kelly, Westman Champlin & Kelly P A, July 5, 2011: US07974963 (1 worldwide citation)

A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engin ...


8
Hua Wu, Ming Zhou, Chang Ning Huang: Processing noisy data and determining word similarity. Microsoft Corporation, Joseph R Kelly, Westman Champlin & Kelly P A, March 11, 2008: US07343280

The present invention deals with noisy data not by eliminating low frequency dependency structures, but rather by weighting the dependency structures. The dependency structures are weighted to give less weight to dependency structures which are more likely incorrect and to give more weight to depend ...


9
Jianfeng Gao, Mu Li, Chang Ning Huang, Jian Sun, Lei Zhang, Ming Zhou: Method and apparatus using source-channel models for word segmentation. Microsoft Corporation, Theodore M Magee, Westman Champlin & Kelly, December 2, 2004: US20040243408-A1

A method and apparatus for segmenting text is provided that identifies a sequence of entity types from a sequence of characters and thereby identifies a segmentation for the sequence of characters. Under the invention, the sequence of entity types is identified using probabilistic models that descri ...


10
Ming Zhou, Hua Wu, Yue Zhang, Jianfeng Gao, Chang Ning Huang: Method and system for retrieving confirming sentences. John Veldhuis Kroeze, Westman & Champlin & Kelly, March 25, 2004: US20040059718-A1

A method, computer readable medium and system are provided which retrieve confirming sentences from a sentence database in response to a query. A search engine retrieves confirming sentences from the sentence database in response to the query. IN retrieving the confirming sentences, the search engin ...