1
Ian M Bennett: Query engine for processing voice based queries including semantic decoding. Phoenix Solutions, PatentBest, Andrew McAleavey, May 25, 2010: US07725307 (189 worldwide citation)

An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's ...


2
Alexander Mark Franz, Monika H Henzinger, Sergey Brin, Brian Christopher Milch: Voice interface for a search engine. Google, Harrity Snyder, April 11, 2006: US07027987 (186 worldwide citation)

A system provides search results from a voice search query. The system receives a voice search query from a user, derives one or more recognition hypotheses, each being associated with a weight, from the voice search query, and constructs a weighted boolean query using the recognition hypotheses. Th ...


3
Mukund Padmanabhan, Michael Picheny, David Nahamoo, Salim Roukos: Telephone messaging and editing system. International Business Machines Corporation, F Chau & Associates, April 17, 2001: US06219638 (176 worldwide citation)

A messaging system for receiving speech over a telephone and converting the speech to text includes a first server for receiving speech input by a user, a speech recognition system for converting the speech to text, a speech synthesizer for converting the text to speech for playing back the synthesi ...


4
Dragutin Petkovic, Dulce Beatriz Ponceleon, Savitha Srinivasan: System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval. International Business Machines Corporation, John L Rogitz, February 6, 2001: US06185527 (175 worldwide citation)

A system and method for indexing an audio stream for subsequent information retrieval and for skimming, gisting, and summarizing the audio stream includes using special audio prefiltering such that only relevant speech segments that are generated by a speech recognition engine are indexed. Specific ...


5
David E Heckerman, Fileno A Alleva, Robert L Rounthwaite, Daniel Rosen, Mei Yuh Hwang, Yoram Yaacovi, John L Manferdelli: Methods and apparatus for automatically synchronizing electronic audio files with electronic text files. Microsoft Corporation, Michael P Straub, Straub & Pokotylo, July 10, 2001: US06260011 (158 worldwide citation)

Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. A statistical language model is generated from the text data. A speech recognition operation is then pe ...


6
Philip Neil Garner, Jason Peter Andrew Charlesworth, Asako Higuchi: Language recognition using a similarity measure. Canon Kabushiki Kaisha, Fitzpatrick Cella Harper & Scinto, December 18, 2007: US07310600 (154 worldwide citation)

A dynamic programming technique is provided for matching two sequences of phonemes both of which may be generated from text or speech. The scoring of the dynamic programming matching technique uses phoneme confusion scores, phoneme insertion scores and phoneme deletion scores which are obtained in a ...


7
Laurence S Gillick, Joel M Gould, Robert Roth, Paul A van Mulbregt, Michael D Bibeault: Speech recognition language models. Dragon Systems, Fish & Richardson P C, December 26, 2000: US06167377 (136 worldwide citation)

Language model results are combined according to a combination expression to produce combined language model results for a set of candidates. A candidate is selected and the combination expression is adjusted using language model results associated with the selected candidate.


8
Michael S Phillips, Etienne Barnard, Jean Guy Dahan, Michael J Metzger: Dynamic semantic control of a speech recognition system. Speechworks International, Fish & Richardson P C, February 11, 2003: US06519562 (122 worldwide citation)

A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is ...


9
Jhing Fa Wang, Jia Ching Wang, Tai Lung Chen, Chin Chan Chang: Speech recognition system. National Cheng Kung University, September 4, 2007: US07266496 (120 worldwide citation)

The present invention discloses a complete speech recognition system having a training button and a recognition button, and the whole system uses the application specific integrated circuit (ASIC) architecture for the design, and also uses the modular design to divide the speech processing into 4 mo ...


10
Malcolm Slaney: System and method for automatic classification of speech based upon affective content. Interval Research Corporation, Burns Doane Swecker & Mathis L, January 9, 2001: US06173260 (115 worldwide citation)

The classification of speech according to emotional content employs acoustic measures in addition to pitch as classification input. In one embodiment, two different kinds of features in a speech signal are analyzed for classification purposes. One set of features is based on pitch information that i ...