Microsoft's comprehensive speech service offering text-to-speech, speech-to-text, translation, and speaker recognition.
| Type | REST/WEBSOCKET |
| Authentication | API Key / Azure AD |
| Rate Limits | 20 concurrent requests default, adjustable |
$0
usage-based
Looking for something different? Here are the top alternatives to Azure Speech:
Fast inference and fine-tuning platform for open-source models with competitive pricing and OpenAI-compatible endpoints.
Run open-source AI models in the cloud with a simple API. Supports thousands of community and official models.
Google's multimodal AI API supporting text, image, audio, and video understanding natively.
IBM's speech services offering speech-to-text and text-to-speech with customization and enterprise features.
Amazon's automatic speech recognition service for converting audio to text with custom vocabulary and medical transcription support.
AWS text-to-speech service offering lifelike speech synthesis with neural and standard voices in dozens of languages.
Google's speech-to-text API offering real-time transcription with support for 125+ languages and automatic punctuation.
Speech-to-text API backed by Rev's human transcription expertise, offering high-accuracy automatic transcription.