speech synthesis
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The breakthrough could be…
Respeecher’s ethics-first approach to AI voice cloning locks in new funding
Ukrainian synthetic voice startup Respeecher is finding success despite not just bombs raining down on their city, but a wave of hype that has raised up sometimes controversial competitors. A…
The growing ease with which anyone can create convincing audio in someone else’s voice has a lot of people on edge, and rightly so. Resemble AI’s proposal for watermarking generated…
Instagram adds TikTok-like Text-to-Speech and Voice Effects tools to Reels
Instagram added two features to Reels yesterday: text-to-speech and voice effects. These features are popular already on TikTok, but now, creators can use them on Instagram too. This marks yet…
The voices on Amazon’s Alexa, Google Assistant and others still lack the rhythms and intonation that make speech human. NVIDIA has unveiled new tools that can capture those natural speech…
Apple adds two brand new Siri voices and will no longer default to a female or male voice in iOS
Apple is adding two new voices to Siri’s English offerings, and eliminating the default “female voice” selection in the latest beta version of iOS. This means that every person setting…
WellSaid Labs research takes synthetic speech from seconds-long clips to hours
Millions of homes have voice-enabled devices, but when was the last time you heard a piece of synthesized speech longer than a handful of seconds? WellSaid Labs has pushed the…
Azure Cognitive Services learns more languages
It wouldn’t be a Microsoft Build without a bunch of new capabilities for Azure Cognitive Services, Microsoft’s cloud-based AI tools for developers. The first new feature is what Microsoft calls…
Thanks to modern machine learning techniques, text-to-speech engines have made massive strides over the last few years. It used to be incredibly easy to know that it was a computer…
Google today announced an update to its Cloud Speech-to-Text and Text-to-Speech APIs that introduces a few new features that should be especially interesting to enterprise users, as well as improved…
Google Cloud’s Text-to-Speech and Speech-to-Text APIs are getting a bunch of updates today that introduce support for more languages, make it easier to hear auto-generated voices on different speakers and…
Google’s Tacotron 2 simplifies the process of teaching an AI to speak
Creating convincing artificial speech is a hot pursuit right now, with Google arguably in the lead. The company may have leapt ahead again with the announcement today of Tacotron 2,…