Hi-TechTechnology

Microsoft Launches MAI Transcribe AI Model

Microsoft has introduced a new AI model called MAI-Transcribe-1. This is the company’s third in-house developed AI model, and it claims to be the most accurate speech-to-text system in the world.

The model has a very low error rate, with an average Word Error Rate of just 3.9%. This means it can convert spoken words into text with high accuracy. It supports 25 languages, including English, French, German, Spanish, Hindi, Chinese, Arabic, and many more, making it useful for users around the world.

Microsoft said that MAI-Transcribe-1 has performed very well in industry tests. It ranked first in the FLUERS benchmark for 11 main languages. In the remaining 14 languages, it performed better than other popular models like Whisper large v3. It also outperformed Google Gemini 3.1 Flash in 11 out of 14 languages tested.

The new model is available through Microsoft Foundry, making it easier for developers and companies to use it in their applications. Microsoft also highlighted its speed and cost benefits. The batch transcription feature is about 2.5 times faster than its earlier Azure Fast service.

In terms of pricing, MAI-Transcribe-1 is available at a cost of $0.36 per hour, making it a cost-effective option for businesses and developers who need fast and accurate transcription services.

Overall, this new AI model shows Microsoft’s continued progress in artificial intelligence, especially in speech recognition technology, and aims to provide better and faster solutions for users worldwide.