Groq

Free tier

Fast, low-cost AI inference powered by custom LPU silicon

Free tier available·All audiences·Powered by Groq (third-party models hosted on LPU infrastructure)·API available

Key strengths

Industry-leading inference speed via proprietary LPU chipOpenAI-compatible API for drop-in migrationSignificantly lower inference cost vs. GPU-based providersGlobal data center deployment for low-latency responsesFree API tier with access to top open models

Free tier + paid plans

San Jose, USA

Founded 2016

No ratings yet

Developer Setup

Authentication & Endpoint

Groq is OpenAI API-compatible. Point your existing OpenAI SDK to Groq's endpoint:

import os
import openai

client = openai.OpenAI(
    base_url="https://api.groq.com/openai/v1",
    api_key=os.environ.get("GROQ_API_KEY")
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.groq.com/openai/v1",
  apiKey: process.env.GROQ_API_KEY,
});

Key Details

API Base URL: https://api.groq.com/openai/v1
Auth: Bearer token via GROQ_API_KEY environment variable
SDKs: Works with the official OpenAI Python and JavaScript/TypeScript SDKs, or any OpenAI-compatible HTTP client
Free API Key: Available at no cost directly from groq.com/developers
Model Selection: Pass the model name in the model parameter (e.g., llama3-8b-8192, mixtral-8x7b-32768)

Supported Capabilities

Chat completions (streaming and non-streaming)
Multiple open & third-party model options
MoE (Mixture of Experts) and large model support
Global data center routing for low-latency inference