• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Artificial Intelligence

Mistral Now Offers New Open-Source Model For Speech Generation

Akinola Ajibola by Akinola Ajibola
March 26, 2026
in Artificial Intelligence, Open source
Share on FacebookShare on Twitter
PARIS, FRANCE – JUNE 11: The logo of the French company Mistral AI, specializing in generative artificial intelligence is displayed during the 9th edition of the VivaTech show at Parc des Expositions Porte de Versailles on June 11, 2025 in Paris, France. VivaTech, the biggest tech show in Europe but also in a unique digital format, for 4 days of reconnection and relaunch thanks to innovation. The event brings together startups, CEOs, investors, tech leaders and all of the digital transformation players who are shaping the future of the Internet. The annual technology conference, also known as VivaTech, was founded in 2016 by Publicis Groupe and Groupe Les Echos and is dedicated to promoting innovation and startups. (Photo by Chesnot/Getty Images)

On Thursday, the French AI startup Mistral unveiled a new open-source text-to-speech model that may be utilised in enterprise use cases such as customer care or by voice AI assistants. Mistral is in direct rivalry with companies like ElevenLabs, Deepgram, and OpenAI thanks to the platform, which enables businesses to create speech assistants for sales and customer engagement.

Voxtral TTS is the first open-source text-to-speech (TTS) model from Mistral AI. The model was introduced as lightweight enough to operate locally on edge devices like laptops, smartphones, and smartwatches.

Nine languages are supported by the new model, known as Voxtral TTS: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.

“A speech model has been requested by our clients. Therefore, we developed a compact speech model that can be used on laptops, smartphones, smartwatches, and other edge devices. In a phone conversation with a press firm, Pierre Stock, vice president of science operations at Mistral AI, stated, “It offers state-of-the-art performance at a fraction of the cost of anything else on the market.”

According to Mistral, the new model can catch features such as minor accents, inflections, intonations, and anomalies in speech flow using a sample of less than five seconds. For use cases like dubbing or real-time translation, the model, which is based on Ministral 3B, can effortlessly convert between languages without losing the voice’s qualities. According to Stock, the company intended for the model to sound human rather than robotic.

The company claims that the model was designed for real-time performance. For a 10-second sample of 500 characters, its time-to-first-audio (TTFA), which measures when the model begins “speaking” after receiving input, is 90 ms. Additionally, the model can render a 10-second clip in about 1.6 seconds thanks to its real-time factor (RTF) of 6x.

These are what to expect with the introduction of the Voxtral TTS in terms of the key features.

  • Edge-friendly: 4 billion parameters, runs on just 3 GB RAM.
  • Low latency: 90 ms time-to-first-audio for real-time use.
  • Voice cloning: Adapts to any voice with under 5 seconds of audio.
  • Multilingual: Supports nine languages, including English, Spanish, Hindi, and Arabic.
  • Expressive: Delivers human-like speech with emotion and varied tone.

Mistral introduced two transcription models earlier this year, one for big batch processing and the other for low-latency real-time use cases. The company’s goal with the new speech model is probably to provide businesses a complete range of voice solutions.

“We intend to create an end-to-end platform that can manage multimodal input streams, such as text, audio, and images, as well as output. The primary advantage of that is that an end-to-end agentic system that allows audio as an input or output gives you a lot more information, according to Stock.

With regard to Mistral’s positioning, because its speech models are open source and customizable, businesses will be more likely to use them than their rivals.

Voxtral TTS completes a full suite of voice AI products, following Mistral’s recent release of speech-to-text (transcription) models, including Voxtral Realtime and Voxtral Mini Transcribe V2. The model can be deployed privately and on-device without relying on the cloud because it is made available with open weights under an Apache 2.0 license.

Related Posts:

  • 260325_TranscribeLaunch_Hero
    Cohere Releases Open-Source Transcription Model
  • tr_20241028-google-cloud-platform-the-smart-persons-guide
    Google Cloud Adds Chirp 3 Audio Generation to Vertex AI
  • mistral-ai
    Mistral AI Raises $830M In Debt For Paris-Area Data…
  • Audio_Models_wallpaper_16.9
    OpenAI Launches New Audio Models for Agentic Workflows
  • Grok_AI_15305e720f
    Oracle Cloud to Offer xAI's Grok 3 Model to…
  • PicsArt_09-20-09.50.59-1200x900
    FG Introduces Local Language AI Model
  • media_12153ee7e1793e302d7df9f27b4a1d9a2f00e8e33
    Adobe Launches Firefly AI Audio and Video Tools
  • OpenAI Unveils Enhanced ChatGPT With Voice Commands And Image Interaction
    OpenAI Unveils Enhanced ChatGPT With Voice Commands…

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: AImistralmistral aiopen sourcespeech generationtext-to-speechVoxtral TTS
Akinola Ajibola

Akinola Ajibola

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • NCC Tackles Rising Complaints As TELCOs Commits N2.5tn Into Network Upgrades May 14, 2026
  • KongTuke Hackers Exploits Microsoft Teams To Breach Companies May 14, 2026
  • OpenAI Confirms Hack Linked to TanStack Attack May 14, 2026
  • Apple Sides With Google in EU Fight Over Opening Android to AI Rivals May 14, 2026
  • OpenAI and Apple Partnership Frays as ChatGPT iPhone Deal Faces Legal Threat May 14, 2026
  • Cisco Plans Nearly 4,000 Job Cuts While Pivoting Spending Toward AI and Cybersecurity May 14, 2026
  • New Google Accounts May Start With 5GB Free Storage Unless You Add a Phone Number May 14, 2026
  • Claude AI Helps User Recover Forgotten Bitcoin Wallet Worth Nearly $400,000 After 11-Year Hunt May 14, 2026
  • X Rolls Out History Tabs For Bookmarks, Likes, Videos, & Articles May 14, 2026
  • Anthropic Debuts Claude for Small Business Featuring Pre-Built AI Workflows & Connectors May 14, 2026
  • Google Announces New OS Verification Tool To Fight Fake OS May 14, 2026
  • Google DeepMind Is Turning the Mouse Pointer into an AI Assistant May 14, 2026

Browse Archives

May 2026
MTWTFSS
 123
45678910
11121314151617
18192021222324
25262728293031
« Apr    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.