HY-World 2.0
HY-World 2.0 - Tencent's open-source world model that turns text, images, or video into interactive 3D scenes
Gemini 3.1 Flash TTS
Gemini 3.1 Flash TTS - Google's new speech model with inline tags for voice direction across 70+ languages
Lyra 2.0
Lyra 2.0 - NVIDIA's new model that turns text and camera paths into explorable 3D scenes
Harrier
Harrier - Microsoft Bing’s SOTA, open-source embedding model for search and RAG grounding
Google Edge Eloquent
Google Edge Eloquent - Google AI Edge Eloquent — Free voice dictation app that turns messy speech into polished text, runs fully offline, no subscription, no usage caps
MAI-Transcribe-1
MAI-Transcribe-1 - Microsoft's speech-to-text model with best-in-class accuracy across 25 languages
Critique
Critique - Microsoft's multi-model deep research tool that pits AI models against each other
AI CMO
AI CMO - Okara's agent-powered marketing suite that deploys SEO, social, and content agents