areeblog.com
MAI-1-preview and MAI-Voice-1: Microsoft’s First Homegrown Foundation Models
Microsoft trained MAI-1-preview across roughly 15,000 NVIDIA H100 GPUs and rolled out MAI-Voice-1, a speech engine that can synthesize about 60 seconds of audio in under one second on a single GPU, a clear signal that Microsoft is pushing for lower cost-per-inference at production scale. Microsoft has quietly shifted a major piece
Leggi l'articolo su areeblog.com