DeepL says its tech could be used for real-time translation with meeting tools like Zoom and Microsoft Teams ...
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.
Dubai-based Camb.AI focuses on speech synthesis and translation for media dubbing. Palabra, backed by Reddit co-founder ...
Neuphonic and Rapport (a division of Speech Graphics) today announced a partnership to deliver what they believe is among the first fully real-time, photorealistic digital human systems running ...
Voice AI models face multimodal speech, where one sentence can vary by emotion and emphasis, raising compute needs.
Overview From social media creators to marketers and podcasters, many professionals now rely on celebrity AI voice generator ...
Google LLC’s DeepMind artificial intelligence unit today rolled out a new text-to-speech model called Gemini 3.1 Flash TTS.
Speechmatics and thymia are combining medical-grade speech-to-text with clinical-grade voice biomarker intelligence to identify ...
Gaming veteran Manish Agarwal and Ishank Gupta's AI startup Humyn Labs announces a $20 million commitment to fund data ...
Explore how Indian voice AI startups are revolutionizing communication with multilingual agents, transforming industries and ...
Can AI restore your voice? A new study reveals a multiaxial strain sensor that converts silent muscle movements into ...