Phoenix-4
Phoenix-4 - Tavus' real-time human rendering model with emotional intelligence and active listening
Qwen3-TTS
Qwen3-TTS - Alibaba's multilingual text-to-speech model with voice cloning using just three seconds of reference audio
Tiny Aya
Tiny Aya - Cohere Labs' new open-source multilingual small model covering 70+ languages in just 3.35B parameters
Raven-1
Raven-1 - Tavus's real-time emotional perception model for AI conversations
Model Council
Model Council - Perplexity's new tool for querying and synthesizing outputs from multiple models into a single answer
Voxtral Transcribe 2
Voxtral Transcribe 2 - A new speech-to-text family for transcription across 13 languages, including an open-weights Realtime model for live transcription.
Project Genie
Project Genie - Google DeepMind's interactive world simulator powered by its Genie 3 model
Scribe v2
Scribe v2 - ElevenLabs' SOTA transcription model with top accuracy, multi-language support, keyterm prompting, and more