Innovation at Nuance
Explore stories from our R&D and engineering teams, as they explain how to solve real world customer problems by pushing the boundaries of technology.
Innovation at Nuance’s stories
Prediction Network Architecture in RNN-T for ASR

About the author: Dario Albesano is a Principal Research Scientist at Nuance Communications. He received his master degree in Computer Science and Artificial Intelligence from Università di Torino – Italy in 1990. His research interests include neural networks, speech recognition, artificial intelligence, acoustics and music reproduction. Acknowledgements: Jesus Andrés-Ferrer contributed equally to this work; Nicola […]

Training automatic speech recognition models with de-identified data

About the author: Martin Flechl is a research scientist at Nuance. His current research interests include speech recognition and text-to-speech. He received his PhD in high-energy physics from Uppsala University, Sweden, in 2010. Prior to joining Nuance in 2019, he worked on machine learning algorithms to analyse data from the LHC particle collider at CERN […]

Improved far field speech recognition with microphone arrays

About the author: Dushyant Sharma is a Principal Research Scientist, working as part of the Central Research and Development organization at Nuance on front end signal processing and acoustic data augmentation. His research interests include non-intrusive speech signal analysis, single and multi-channel speech enhancement and acoustic data augmentation and simulation. Dushyant joined Nuance in 2012 […]

Data augmentation for real-world multi-channel speech recognition

About the author: Marco Gaudesi is a Senior Research Scientist at Nuance Communications. He received his PhD degree in computer and control engineering from Politecnico di Torino, Italy, in 2015. His research interests include deep learning, speech recognition, and artificial intelligence. Felix Weninger, Dushyant Sharma, and Puming Zhan also contributed to the paper and this […]

Combining the advantages of close-talk and far-talk speech recognition

About the Author: Felix Weninger is a senior tech lead (Senior Principal Research Scientist) at Nuance Communications. His research interests include deep learning, speech recognition, speech emotion recognition, and source separation. He received his PhD degree in computer science from Technical University of Munich (TUM), Germany, in 2015. Prior to joining Nuance, he worked at […]

Improving named entity speech recognition accuracy

About the author: Jesus Andres Ferrer is a Senior Principal Research Scientist at Nuance. Acknowledgments: Dario Albesano contributed equally to this research. Paul Vozila and Puming Zhan contributed to the paper and this blog post. The paper will be presented in September at the Interspeech 2021 conference. Speech is the most natural way for us […]

Improving automatic speech recognition from distant microphones using self-attention

About the author: Rong Gong is a senior research scientist in Nuance Communications Austria. He works on the Audio Video Processing (AVP) team, developing multichannel far-field speech recognition technologies. His main research interests are speech enhancement and far-field speech recognition. In Dragon Ambient eXperience (DAX) research, we utilize a microphone array device to capture far-field […]

Delivering non-intrusive signal intelligence with deep learning

About the author: Dushyant Sharma is a Principal Research Scientist, working as part of the Central Research and Development organization at Nuance on front end signal processing and acoustic data augmentation. His research interests include non-intrusive speech signal analysis, single and multi-channel speech enhancement and acoustic data augmentation and simulation. Dushyant joined Nuance in 2012 […]

Reducing the human labeling effort for training end-to-end speech recognition

About the Author: Felix Weninger is a senior tech lead (Senior Principal Research Scientist) at Nuance Communications. His research interests include deep learning, speech recognition, speech emotion recognition, and source separation. He received his PhD degree in computer science from Technical University of Munich (TUM), Germany, in 2015. Prior to joining Nuance, he worked at […]

Making speech recognizers more robust in the wild

About the author: Franco Mana is a Principal Research Scientist in Nuance’s Central Research department. Franco’s interest in neural networks started in 1990, and since then, he has contributed to various developmental stages of neural network technologies as applied to speech recognition systems. These contributions include adopting graphic cards (GPU) for neural network acceleration and […]

Delivering personalized user experiences with speaker adapted end-to-end speech recognition

About the Author: Felix Weninger is a senior tech lead (Senior Principal Research Scientist) at Nuance Communications. His research interests include deep learning, speech recognition, speech emotion recognition, and source separation. He received his PhD degree in computer science from Technical University of Munich (TUM), Germany, in 2015. Prior to joining Nuance, he worked at […]

DeepAAA: Detecting Abdominal Aortic Aneurysms with deep learning

About the author: Rupert Brooks is a Senior Research Scientist in Nuance Global Research and Development focused on problems relating to diagnostic imaging that affect the longer term development of Nuance’s AI marketplace.  His current research interests include deep learning models for medical imaging, federated learning and privacy preserving approaches to machine learning. He has […]