1
Andreas Stolcke, Malcolm Slaney, Sree Harsha Yella: Neural network-based speech processing. Microsoft Technology Licensing, Alin Corie, Sandy Swain, Micky Minhas, April 26, 2016: US09324320 (3 worldwide citation)

Pairs of feature vectors are obtained that represent speech. Some pairs represent two samples of speech from the same speakers, and other pairs represent two samples of speech from different speakers. A neural network feeds each feature vector in a sample pair into a separate bottleneck layer, with ...


2
Elizabeth Shriberg, Luciana Ferrer, Andreas Stolcke, Martin Graciarena, Nicolas Scheffer: Method and apparatus for speaker-calibrated speaker detection. SRI INTERNATIONAL, Barnes & Thornburg, Thomas J McWilliams, Edward F Behm Jr, September 29, 2015: US09147401 (1 worldwide citation)

The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a ...


3
Gokhan Tur, Horacio E Franco, Elizabeth Shriberg, Gregory K Myers, William S Mark, Norman D Winarsky, Andreas Stolcke, Bart Peintner, Michael J Wolverton, Luciana Ferrer, Martin Graciarena, Neil Yorke Smith, Harry Bratt: Method and apparatus for tailoring the output of an intelligent automated assistant to a user. SRI INTERNATIONAL, Barnes & Thornburg, November 22, 2016: US09501743 (1 worldwide citation)

The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the ...


4
Gokhan Tur, Horacio E Franco, Elizabeth Shriberg, Gregory K Myers, William S Mark, Norman D Winarsky, Andreas Stolcke, Bart Peintner, Michael J Wolverton, Luciana Ferrer, Martin Graciarena, Harry Bratt, Neil Yorke Smith: Method and apparatus for tailoring the output of an intelligent automated assistant to a user. SRI INTERNATIONAL, Barnes & Thornburg, Thomas J McWilliams, Edward F Behm Jr, December 15, 2015: US09213558 (1 worldwide citation)

The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the ...


5
Kristin Precoda, Horacio Franco, Jing Zheng, Michael Frandsen, Victor Abrash, Murat Akbacak, Andreas Stolcke: Method and apparatus for adding new vocabulary to interactive translation and dialogue systems. SRI INTERNATIONAL, Barnes & Thornburg, February 21, 2017: US09576570

The present invention relates to a method and apparatus for adding new vocabulary to interactive translation and dialog systems. In one embodiment, a method for adding a new word to a vocabulary of an interactive dialog includes receiving an input signal that includes at least one word not currently ...


6
Michael Levit, Sarangarajan Parthasarathy, Andreas Stolcke: Language model optimization for in-domain application. Microsoft Technology Licensing, Shook Hardy & Bacon L, May 15, 2018: US09972311

Systems and methods are provided for optimizing language models for in-domain applications through an iterative, joint-modeling approach that expresses training material as alternative representations of higher-level tokens, such as named entities and carrier phrases. From a first language model, an ...


7
Michael Levit, Sarangarajan Parthasarathy, Andreas Stolcke, Shuangyu Chang: Token-level interpolation for class-based language models. Microsoft Technology Licensing, Shook Hardy & Bacon L, August 15, 2017: US09734826

Optimized language models are provided for in-domain applications through an iterative, joint-modeling approach that interpolates a language model (LM) from a number of component LMs according to interpolation weights optimized for a target domain. The component LMs may include class-based LMs, and ...


8
Andreas Stolcke, Geoffrey Zweig, Malcolm Slaney: Modification of visual content to facilitate improved speech recognition. Microsoft Technology Licensing, Alin Corie, Sandy Swain, Micky Minhas, February 28, 2017: US09583105

Technologies described herein relate to modifying visual content for presentment on a display to facilitate improving performance of an automatic speech recognition (ASR) system. The visual content is modified to move elements further away from one another, wherein the moved elements give rise to am ...


9
Elizabeth Shriberg, Luciana Ferrer, Andreas Stolcke, Martin Graciarena, Nicolas Scheffer: Method and apparatus for speaker-calibrated speaker detection. SRI INTERNATIONAL, Barnes & Thornburg, February 7, 2017: US09564134

The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a ...


10
Harry BRATT, Luciana Ferrer, Martin Graciarena, Sachin Kajarekar, Elizabeth Shriberg, Mustafa Sonmez, Andreas Stolcke, Gokhan Tur, Anand Venkataraman: Method and apparatus for speaker recognition. Patterson & Sheridan, Sri International, January 10, 2008: US20080010065-A1

A method and apparatus for speaker recognition is provided. One embodiment of a method for determining whether a given speech signal is produced by an alleged speaker, where a plurality of statistical models (including at least one support vector machine) have been produced for the alleged speaker b ...