Microsoft's computer vision service for image analysis, OCR, spatial analysis, and image captioning with Florence model.
| Type | REST |
| Authentication | API Key / Azure AD |
| Rate Limits | 20 TPS default, adjustable |
$0
usage-based
custom
Looking for something different? Here are the top alternatives to Azure Computer Vision:
AI training data platform with auto-annotation, model training, and deployment for computer vision workflows.
Open-source text-to-speech toolkit and API offering voice cloning with just a few seconds of audio reference.
OpenAI's speech recognition API based on the Whisper model, offering accurate transcription and translation across 57 languages.
Google Cloud's unified ML platform for building, deploying, and scaling AI models including Gemini and PaLM.
Purpose-built vector database API for similarity search and retrieval-augmented generation at production scale.
AWS text-to-speech service offering lifelike speech synthesis with neural and standard voices in dozens of languages.
High-speed inference platform optimized for serving open-source models with extremely low latency and high throughput.
Microsoft's comprehensive speech service offering text-to-speech, speech-to-text, translation, and speaker recognition.