Ultra-fast inference API powered by Groq's custom LPU hardware, delivering the fastest token generation speeds available.
| Type | REST |
| Authentication | API Key |
| Rate Limits | 30 RPM free, higher on paid |
$0
usage-based
Looking for something different? Here are the top alternatives to Groq API:
Open-source vector database with built-in vectorization modules, hybrid search, and generative capabilities.
Chinese AI company offering large language models optimized for Chinese language understanding and generation tasks.
Fast inference and fine-tuning platform for open-source models with competitive pricing and OpenAI-compatible endpoints.
Google's text-to-speech API using WaveNet and Neural2 technology to produce natural-sounding synthetic speech.
Enterprise-focused NLP API for text generation, embeddings, reranking, and retrieval-augmented generation.
Google's multimodal AI API supporting text, image, audio, and video understanding natively.
Industry-leading AI voice synthesis API for creating natural-sounding speech with voice cloning and multilingual support.
Chinese AI startup offering the Kimi model with ultra-long context windows for document analysis and conversation.