• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Artificial Intelligence

Mistral Now Offers New Open-Source Model For Speech Generation

Akinola Ajibola by Akinola Ajibola
March 26, 2026
in Artificial Intelligence, Open source
Share on FacebookShare on Twitter
PARIS, FRANCE – JUNE 11: The logo of the French company Mistral AI, specializing in generative artificial intelligence is displayed during the 9th edition of the VivaTech show at Parc des Expositions Porte de Versailles on June 11, 2025 in Paris, France. VivaTech, the biggest tech show in Europe but also in a unique digital format, for 4 days of reconnection and relaunch thanks to innovation. The event brings together startups, CEOs, investors, tech leaders and all of the digital transformation players who are shaping the future of the Internet. The annual technology conference, also known as VivaTech, was founded in 2016 by Publicis Groupe and Groupe Les Echos and is dedicated to promoting innovation and startups. (Photo by Chesnot/Getty Images)

On Thursday, the French AI startup Mistral unveiled a new open-source text-to-speech model that may be utilised in enterprise use cases such as customer care or by voice AI assistants. Mistral is in direct rivalry with companies like ElevenLabs, Deepgram, and OpenAI thanks to the platform, which enables businesses to create speech assistants for sales and customer engagement.

Voxtral TTS is the first open-source text-to-speech (TTS) model from Mistral AI. The model was introduced as lightweight enough to operate locally on edge devices like laptops, smartphones, and smartwatches.

Nine languages are supported by the new model, known as Voxtral TTS: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.

“A speech model has been requested by our clients. Therefore, we developed a compact speech model that can be used on laptops, smartphones, smartwatches, and other edge devices. In a phone conversation with a press firm, Pierre Stock, vice president of science operations at Mistral AI, stated, “It offers state-of-the-art performance at a fraction of the cost of anything else on the market.”

According to Mistral, the new model can catch features such as minor accents, inflections, intonations, and anomalies in speech flow using a sample of less than five seconds. For use cases like dubbing or real-time translation, the model, which is based on Ministral 3B, can effortlessly convert between languages without losing the voice’s qualities. According to Stock, the company intended for the model to sound human rather than robotic.

The company claims that the model was designed for real-time performance. For a 10-second sample of 500 characters, its time-to-first-audio (TTFA), which measures when the model begins “speaking” after receiving input, is 90 ms. Additionally, the model can render a 10-second clip in about 1.6 seconds thanks to its real-time factor (RTF) of 6x.

These are what to expect with the introduction of the Voxtral TTS in terms of the key features.

  • Edge-friendly: 4 billion parameters, runs on just 3 GB RAM.
  • Low latency: 90 ms time-to-first-audio for real-time use.
  • Voice cloning: Adapts to any voice with under 5 seconds of audio.
  • Multilingual: Supports nine languages, including English, Spanish, Hindi, and Arabic.
  • Expressive: Delivers human-like speech with emotion and varied tone.

Mistral introduced two transcription models earlier this year, one for big batch processing and the other for low-latency real-time use cases. The company’s goal with the new speech model is probably to provide businesses a complete range of voice solutions.

“We intend to create an end-to-end platform that can manage multimodal input streams, such as text, audio, and images, as well as output. The primary advantage of that is that an end-to-end agentic system that allows audio as an input or output gives you a lot more information, according to Stock.

With regard to Mistral’s positioning, because its speech models are open source and customizable, businesses will be more likely to use them than their rivals.

Voxtral TTS completes a full suite of voice AI products, following Mistral’s recent release of speech-to-text (transcription) models, including Voxtral Realtime and Voxtral Mini Transcribe V2. The model can be deployed privately and on-device without relying on the cloud because it is made available with open weights under an Apache 2.0 license.

Related Posts:

  • 260325_TranscribeLaunch_Hero
    Cohere Releases Open-Source Transcription Model
  • tr_20241028-google-cloud-platform-the-smart-persons-guide
    Google Cloud Adds Chirp 3 Audio Generation to Vertex AI
  • Audio_Models_wallpaper_16.9
    OpenAI Launches New Audio Models for Agentic Workflows
  • Grok_AI_15305e720f
    Oracle Cloud to Offer xAI's Grok 3 Model to…
  • PicsArt_09-20-09.50.59-1200x900
    FG Introduces Local Language AI Model
  • media_12153ee7e1793e302d7df9f27b4a1d9a2f00e8e33
    Adobe Launches Firefly AI Audio and Video Tools
  • OpenAI Unveils Enhanced ChatGPT With Voice Commands And Image Interaction
    OpenAI Unveils Enhanced ChatGPT With Voice Commands…
  • openai logo
    OpenAI Released A Voice Cloning Model That Needs…

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: AImistralmistral aiopen sourcespeech generationtext-to-speechVoxtral TTS
Akinola Ajibola

Akinola Ajibola

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • WhatsApp Adds AI-Powered Reply Suggestions March 26, 2026
  • WhatsApp Adds Multiple Accounts For iOS March 26, 2026
  • Cohere Releases Open-Source Transcription Model March 26, 2026
  • Mistral Now Offers New Open-Source Model For Speech Generation March 26, 2026
  • EU Says Porn Sites Breached Online Safety Rules for Minors March 26, 2026
  • Nigerian Crypto Exchange Quidax Cuts Staff As It Shifts Deeper Into B2B March 26, 2026
  • Ecobarter Is Turning Trash Into Currency—and Building Nigeria’s Circular Economy March 26, 2026
  • Meta’s Legal Troubles Deepen as Landmark Ruling Challenges Big Tech Immunity March 26, 2026
  • Report: Apple Tests “Siri Everywhere” Ahead of WWDC March 25, 2026
  • GitHub Expands AI Security Detections Across More Languages March 25, 2026
  • Cloudflare Targets Faster AI Agents with Dynamic Workers March 25, 2026
  • Oracle and OpenAI Reject Microsoft Data Centre Offer March 25, 2026

Browse Archives

March 2026
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
3031 
« Feb    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.