1
Chris Weider, Richard Kennewick, Mike Kennewick, Philippe Di Cristo, Robert A Kennewick, Samuel Menaker, Lynn Elise Armstrong: Mobile systems and methods of supporting natural language human-machine interactions. VoiceBox Technologies, Pillsbury Winthrop Shaw Pittman LL, May 24, 2011: US07949529 (402 worldwide citation)

A mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications. The mobile system identifies and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/o ...


2
Pieter J Vermeulen, Robert E Savoie, Stephen Sutton, Forrest S Mozer: Method and apparatus of specifying and performing speech recognition operations. Sensory, Fountainhead Law Group P C, Chad R Walsh, May 18, 2010: US07720683 (372 worldwide citation)

A speech recognition technique is described that has the dual benefits of not requiring collection of recordings for training while using computational resources that are cost-compatible with consumer electronic products. Methods are described for improving the recognition accuracy of a recognizer b ...


3
Philippe Di Cristo, Min Ke, Robert A Kennewick, Lynn Elise Armstrong: Systems and methods for responding to natural language speech utterance. VoiceBox Technologies, Pillsbury Winthrop Shaw Pittman, December 29, 2009: US07640160 (235 worldwide citation)

Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, ...


4
Jay L Gainsboro: Automatic key word or phrase speech recognition for the corrections industry. Opus Telecom L L C, Ward & Olivo, May 16, 2000: US06064963 (221 worldwide citation)

The present invention comprises speaker-independent, continuous speech, multilingual, multi-dialect, Automatic Speech Recognition (ASR) technology. In particular, the present application integrates the ASR technology into call control technology such that it will identify key words in two ways. Firs ...


5
James Hendrickson, Debra Drylie Scott, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk: Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment. Vocollect, Additon Higgins & Pendleton P A, December 16, 2014: US08914290 (217 worldwide citation)

Method and apparatus that dynamically adjusts operational parameters of a text-to-speech engine in a speech-based system. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more envir ...


6
Avery Li Chun Wang, Christopher Jacques Penrose Barton, Dheeraj Shankar Mukherjee, Philip Inghelbrecht: Method and system for purchasing pre-recorded music. Landmark Digital Services, Woodcock Washburn, December 14, 2010: US07853664 (195 worldwide citation)

A method and system is described which allows users to identify (pre-recorded) sounds such as music, radio broadcast, commercials, and other audio signals in almost any environment. The audio signal (or sound) must be a recording represented in a database of recordings. The service can quickly ident ...


7
Jonathan Foote: Method for automatic analysis of audio including music and speech. Fuji Xerox, Fliesler Dubb Meyer & Lovejoy, April 1, 2003: US06542869 (186 worldwide citation)

A method for determining points of change or novelty in an audio signal measures the self similarity of components of the audio signal. For each time window in an audio signal, a formula is used to determine a vector parameterization value. The self-similarity as well as cross-similarity between eac ...


8
Ian M Bennett: Method for processing speech signal features for streaming transport. Phoenix Solutions, J Nicholas Gross, May 20, 2008: US07376556 (175 worldwide citation)

Speech signal information is formatted, processed and transported in accordance with a format adapted for TCP/IP protocols used on the Internet and other communications networks. NULL characters are used for indicating the end of a voice segment. The method is useful for distributed speech recogniti ...


9
Marsal Gavalda, Moshe Wasserblat: System and method for improving the accuracy of audio searching. NICE Systems, Pearl Cohen Zedek Latzer, May 25, 2010: US07725318 (173 worldwide citation)

A system and method for improving the accuracy of audio searching using multiple models to process an audio file or stream to obtain search tracks. The search tracks are processed to locate at least one search term and generate multiple search results. The number of search results is equivalent to t ...


10
Michael J Knight, Jonathan Scott, Steven J Yurick, John Hancock: Audio content search engine. Sonic Foundry, Foley & Lardner, July 19, 2011: US07983915 (165 worldwide citation)

A method of generating an audio content index for use by a search engine includes determining a phoneme sequence based on recognized speech from an audio content time segment. The method also includes identifying k-phonemes which occur within the phoneme sequence. The identified k-phonemes are store ...