Microsoft launches 3 new AI tools, photos and text will be created in seconds

Desk: Tech giant Microsoft has taken a big step in the world of Artificial Intelligence and launched three new AI models. These models include MAI-Transcribe-1, MAI-Voice-1 and MAI-Image-2. These models can do tasks like creating images, generating sounds and converting speech into text very fast. The company claims that these models give better performance than competitors like Google and OpenAI. The special thing is that they have been designed for fast and economical use.

Microsoft’s MAI-Transcribe-1 model is designed to convert speech to text and the company claims that it gives highly accurate results in 25 major languages. According to Microsoft’s internal testing, it has a lower error rate than models like Gemini 3.1 Flash and GPT Transcribe on the FLEURS benchmark. This means that users will get more accurate and reliable transcription. The company has also described it as better in terms of price and performance, due to which it can become an attractive option for developers and businesses.




  • The MAI-Voice-1 model has been designed in such a way that it can generate natural and realistic voices. According to Microsoft, the stability of emotion, expression and voice has been specially improved in this. This model can maintain the same sound even in long content, which was a big challenge earlier. The special thing is that the user can create his custom voice by giving just a few seconds of audio. This technology will also be used in Copilot Audio Expressions and Copilot Podcasts, which will make content creation even easier and professional.

    The MAI-Image-2 model takes image generation to a new level. According to Microsoft, it can create more realistic images with better lighting, accurate textures and clear text. This model has been developed in collaboration with photographers and designers, so that it can meet professional needs. Big companies like WPP have also started adopting it. This model is being rolled out in platforms like Copilot, Bing and PowerPoint, so that users can directly benefit from it in their daily work.

    Comments are closed.