06738745 is referenced by 24 patents and cites 1 patents.

Methods and apparatus are disclosed for detecting non-target language references in an audio transcription or speech recognition system using a confidence score. The confidence score may be based on (i) a probabilistic engine score provided by a speech recognition system, (ii) additional scores based on background models, or (iii) a combination of the foregoing. The engine score provided by the speech recognition system for a given input speech utterance reflects the degree of acoustic and linguistic match of the utterance with the trained target language. The background models are created or trained based on speech data in other languages, which may or may not include the target language itself. A number of types of background language models may be employed for each modeled language, including one or more of (i) prosodic models; (ii) acoustic models; (iii) phonotactic models; and (iv) keyword spotting models. The engine score can be combined with the background model scores to normalize the engine score for non-target languages. The present invention identifies a non-target language utterance within an audio stream when the confidence score falls below a predefined criteria. A language rejection mechanism can interrupt or modify the transcription process when speech in the non-target language is detected.

Title
Methods and apparatus for identifying a non-target language in a speech recognition system
Application Number
9/544678
Publication Number
6738745 (B1)
Application Date
April 7, 2000
Publication Date
May 18, 2004
Inventor
Mahesh Viswanathan
Yorktown Heights
NY, US
Jiri Navratil
White Plains
NY, US
Agent
Ryan Mason & Lewis
US
Agent
Thu Ann Dang Esq
US
Assignee
International Business Machines Corporation
NY, US
IPC
G10L 15/08
View Original Source