speech synthesis

Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The breakthrough could be…

Largest text-to-speech AI model yet shows ’emergent abilities’

Ukrainian synthetic voice startup Respeecher is finding success despite not just bombs raining down on their city, but a wave of hype that has raised up sometimes controversial competitors. A…

Respeecher’s ethics-first approach to AI voice cloning locks in new funding

The growing ease with which anyone can create convincing audio in someone else’s voice has a lot of people on edge, and rightly so. Resemble AI’s proposal for watermarking generated…

‘Inaudible’ watermark could identify AI-generated voices

Instagram added two features to Reels yesterday: text-to-speech and voice effects. These features are popular already on TikTok, but now, creators can use them on Instagram too. This marks yet…

Instagram adds TikTok-like Text-to-Speech and Voice Effects tools to Reels

The voices on Amazon’s Alexa, Google Assistant and others still lack the rhythms and intonation that make speech human. NVIDIA has unveiled new tools that can capture those natural speech…

NVIDIA’s latest tech makes AI voices more expressive and realistic

Apple is adding two new voices to Siri’s English offerings, and eliminating the default “female voice” selection in the latest beta version of iOS. This means that every person setting…

Apple adds two brand new Siri voices and will no longer default to a female or male voice in iOS

Millions of homes have voice-enabled devices, but when was the last time you heard a piece of synthesized speech longer than a handful of seconds? WellSaid Labs has pushed the…

WellSaid Labs research takes synthetic speech from seconds-long clips to hours

It wouldn’t be a Microsoft Build without a bunch of new capabilities for Azure Cognitive Services, Microsoft’s cloud-based AI tools for developers. The first new feature is what Microsoft calls…

Azure Cognitive Services learns more languages

Thanks to modern machine learning techniques, text-to-speech engines have made massive strides over the last few years. It used to be incredibly easy to know that it was a computer…

AWS’ new text-to-speech engine sounds like a newscaster

Google today announced an update to its Cloud Speech-to-Text and Text-to-Speech APIs that introduces a few new features that should be especially interesting to enterprise users, as well as improved…

Google Cloud’s speech APIs get cheaper and learn new languages

Google Cloud’s Text-to-Speech and Speech-to-Text APIs are getting a bunch of updates today that introduce support for more languages, make it easier to hear auto-generated voices on different speakers and…

Google updates its speech services for developers

Creating convincing artificial speech is a hot pursuit right now, with Google arguably in the lead. The company may have leapt ahead again with the announcement today of Tacotron 2,…

Google’s Tacotron 2 simplifies the process of teaching an AI to speak