Groq
Free tierFast, low-cost AI inference powered by custom LPU silicon
Free tier available·All audiences·Powered by Groq (third-party models hosted on LPU infrastructure)·API available
Key strengths
Industry-leading inference speed via proprietary LPU chipOpenAI-compatible API for drop-in migrationSignificantly lower inference cost vs. GPU-based providersGlobal data center deployment for low-latency responsesFree API tier with access to top open models
Free tier + paid plans
San Jose, USA
Founded 2016
No ratings yet
Developer Setup
Authentication & Endpoint
Groq is OpenAI API-compatible. Point your existing OpenAI SDK to Groq's endpoint:
import os
import openai
client = openai.OpenAI(
base_url="https://api.groq.com/openai/v1",
api_key=os.environ.get("GROQ_API_KEY")
)
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.groq.com/openai/v1",
apiKey: process.env.GROQ_API_KEY,
});
Key Details
- API Base URL:
https://api.groq.com/openai/v1 - Auth: Bearer token via
GROQ_API_KEYenvironment variable - SDKs: Works with the official OpenAI Python and JavaScript/TypeScript SDKs, or any OpenAI-compatible HTTP client
- Free API Key: Available at no cost directly from groq.com/developers
- Model Selection: Pass the model name in the
modelparameter (e.g.,llama3-8b-8192,mixtral-8x7b-32768)
Supported Capabilities
- Chat completions (streaming and non-streaming)
- Multiple open & third-party model options
- MoE (Mixture of Experts) and large model support
- Global data center routing for low-latency inference
