Microsoft's New AI Models Go Beyond Just Text ...
Mistral AI launches Voxtral TTS, an open-weight enterprise voice model that runs on a smartphone and challenges ElevenLabs in the fast-growing voice AI market.
OpenAI released its text-to-video artificial intelligence model, Sora, this week after the completion of its testing phase. The Microsoft-backed AI startup first teased the model in February and ...
Microsoft launches three in-house MAI models for transcription, voice and image generation through Foundry, hedging its ...
Mistral AI is expanding its Voxtral model family with its first text-to-speech model. The launch comes amid intensifying ...
Kokoro 82M is an 82-million-parameter text-to-speech model that beats many TTS APIs while running locally on CPUs, including Apple Silicon ...
Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
Google on Friday added a new, experimental “embedding” model for text, Gemini Embedding, to its Gemini developer API. Embedding models translate text inputs like words and phrases into numerical ...
GLM-5V-Turbo is Z.ai's first native multimodal agent foundation model, built for vision-based coding and agentic task ...