1
Tuomo J Raitio, Kishore Sunkeswari Prahallad, Alistair D Conkie, Ladan Golipour, David A Winarsky: Unit-selection text-to-speech synthesis based on predicted concatenation parameters. Apple, Dentons US, April 3, 2018: US09934775 (2 worldwide citation)

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units ...


2
Alistair D Conkie, Ladan Golipour: System and method for unified normalization in text-to-speech and automatic speech recognition. AT&T INTELLECTUAL PROPERTY I, February 5, 2019: US10199034

A system, method and computer-readable storage devices are for using a single set of normalization protocols and a single language lexica (or dictionary) for both TTS and ASR. The system receives input (which is either text to be converted to speech or ASR training text), then normalizes the input. ...