Phoenix-4

Phoenix-4 - Tavus' real-time human rendering model with emotional intelligence and active listening

Qwen3-TTS

Qwen3-TTS - Alibaba's multilingual text-to-speech model with voice cloning using just three seconds of reference audio

Tiny Aya

Tiny Aya - Cohere Labs' new open-source multilingual small model covering 70+ languages in just 3.35B parameters

Raven-1

Raven-1 - Tavus's real-time emotional perception model for AI conversations

Model Council

Model Council - Perplexity's new tool for querying and synthesizing outputs from multiple models into a single answer

Voxtral Transcribe 2

Voxtral Transcribe 2 - A new speech-to-text family for transcription across 13 languages, including an open-weights Realtime model for live transcription.

Project Genie

Project Genie - Google DeepMind's interactive world simulator powered by its Genie 3 model

Scribe v2

Scribe v2 - ElevenLabs' SOTA transcription model with top accuracy, multi-language support, keyterm prompting, and more