How to Connect Groq to an AI Agent
Auth setup
1. Sign up at console.groq.com. 2. Create API key in dashboard. 3. Use OpenAI SDK with base_url='https://api.groq.com/openai/v1'. 4. Or use curl with Authorization: Bearer {key}.
Key facts
| Base URL | https://api.groq.com/openai/v1/ |
| API version | v1 (OpenAI-compatible) |
| Auth | API key authentication. Generate key at console.groq.com. Pass as Bearer token in Authorization header. OpenAI SDK compatible — just change base_url. |
| Request body | application/json |
| Pagination | N/A — streaming via SSE for chat completions. |
| Rate limit | Free tier: 30 req/min, 14,400 req/day. Paid: higher limits. Model-specific token limits (e.g., Llama 3.1 8B: 6,000 tokens/min free). |
| Error format | JSON: {"error":{"message":"...","type":"...","code":"..."}} |
Key endpoints
| Method | Path | Description |
POST | /chat/completions | Chat completion (Llama, Mixtral, Gemma models) |
POST | /audio/transcriptions | Whisper speech-to-text |
GET | /models | List available models |
Quickstart
POST /openai/v1/chat/completions
Authorization: Bearer {api_key}
Content-Type: application/json
{"model":"llama-3.3-70b-versatile","messages":[{"role":"user","content":"Hello"}]}
Agent pitfalls & tips
- OpenAI SDK drop-in: just set base_url to https://api.groq.com/openai/v1.
- Fastest inference for open models — Llama 3, Mixtral, Gemma available.
- Use 'llama-3.3-70b-versatile' for best quality, 'llama-3.1-8b-instant' for speed.
- Streaming is recommended for chat — set stream:true.
- Rate limits are per-model. Check X-RateLimit-* headers in responses.
Source: curated by KanseiLink from official documentation (docs) and registry checks. Last reviewed: 2026-04-07. Specs change — verify against the official docs before production use.
Frequently Asked Questions
What is Groq's AEO score?
▼
Groq has an AEO score of 0.60 and is rated BBB (Basic agent connectivity available). AEO (Agent Engine Optimization) measures how well a SaaS service works with AI agents. Scores range from 0.00 to 1.00, with grades from AAA (best) to D (not agent-ready).
Is Groq AI-agent-ready?
▼
Groq is currently connectable for AI agent use. API access is available but no dedicated MCP server has been published yet. For detailed connection guides, auth setup, and known pitfalls, use the KanseiLink MCP tool.
How does Groq compare to other AI & ML services?
▼
In the AI & ML category, Groq is rated BBB. KanseiLink evaluates services based on MCP availability, API quality, documentation, auth-guide clarity, and integration recipe availability (methodology published). Visit the full rankings at kansei-link.com to see how Groq compares.
How can I integrate Groq with an AI agent?
▼
The fastest way to integrate Groq with an AI agent is through KanseiLink MCP. Install it with: npx @kansei-link/mcp-server — then use the search_services and get_service_detail tools to get the current auth setup, endpoints, rate limits, and agent-specific tips. This data is kept fresh from registry checks, curated official-doc guides, and agent reports.
How do I authenticate with Groq?
▼
API key authentication. Generate key at console.groq.com. Pass as Bearer token in Authorization header. OpenAI SDK compatible — just change base_url. Setup: 1. Sign up at console.groq.com. 2. Create API key in dashboard. 3. Use OpenAI SDK with base_url='https://api.groq.com/openai/v1'. 4. Or use curl with Authorization: Bearer {key}.
What are Groq's API rate limits?
▼
Free tier: 30 req/min, 14,400 req/day. Paid: higher limits. Model-specific token limits (e.g., Llama 3.1 8B: 6,000 tokens/min free).