Groq
Groq provides ultra-fast inference for open-source language models including Llama, Mixtral, and Gemma models. Braintrust integrates seamlessly with Groq through direct API access, wrapper functions for automatic tracing, and proxy support.
Setup
To use Groq models, configure your Groq API key in Braintrust.
- Get a Groq API key from Groq Console
- Add the Groq API key to your organization's AI providers
- Set the Groq API key and your Braintrust API key as environment variables
API keys are encrypted using 256-bit AES-GCM encryption and are not stored or logged by Braintrust.
Use Groq with Braintrust AI proxy
The Braintrust AI Proxy allows you to access Groq models through a unified OpenAI-compatible interface.
Install the braintrust
and openai
packages.
pnpm add braintrust openai
Then, initialize the client and make a request to a Groq model via the Braintrust AI Proxy.
Trace logs with Groq
Trace your Groq LLM calls for observability and monitoring.
When using the Braintrust AI Proxy, API calls are automatically logged to the specified project.
The Braintrust AI Proxy is not required to trace Groq API calls. For more control, learn how to customize traces.
Evaluate with Groq
Evaluations distill the non-deterministic outputs of Groq models into an effective feedback loop that enables you to ship more reliable, higher quality products. Braintrust Eval
is a simple function composed of a dataset of user inputs, a task, and a set of scorers. To learn more about evaluations, see the Experiments guide.
To learn more about tool use, multimodal support, attachments, and masking sensitive data with Groq, visit the customize traces guide.
Models and capabilities
Model | Multimodal | Reasoning | Max input | Max output | Input $/1M | Output $/1M |
---|---|---|---|---|---|---|
openai/gpt-oss-120b | ﹣ | ﹣ | $0.15 | $0.60 | ||
openai/gpt-oss-20b | ﹣ | ﹣ | $0.10 | $0.50 | ||
llama-3.3-70b-versatile | ﹣ | ﹣ | $0.59 | $0.79 | ||
llama-3.1-8b-instant | ﹣ | ﹣ | $0.05 | $0.08 | ||
llama3-70b-8192 | ﹣ | ﹣ | $0.64 | $0.80 | ||
llama3-8b-8192 | ﹣ | ﹣ | $0.10 | $0.10 | ||
llama-guard-3-8b | ﹣ | ﹣ | $0.20 | $0.20 | ||
gemma2-9b-it | ﹣ | ﹣ | $0.20 | $0.20 | ||
meta-llama/llama-4-maverick-17b-128e-instruct | ﹣ | ﹣ | $0.50 | $0.77 | ||
meta-llama/llama-4-scout-17b-16e-instruct | ﹣ | ﹣ | $0.11 | $0.34 | ||
llama-3.3-70b-specdec | ﹣ | ﹣ | $0.59 | $0.99 | ||
llama-3.2-90b-vision-preview | ﹣ | ﹣ | $0.90 | $0.90 | ||
llama-3.2-11b-vision-preview | ﹣ | ﹣ | $0.18 | $0.18 | ||
llama-3.2-3b-preview | ﹣ | ﹣ | $0.06 | $0.06 | ||
llama-3.2-1b-preview | ﹣ | ﹣ | $0.04 | $0.04 | ||
mistral-saba-24b | ﹣ | ﹣ | $0.79 | $0.79 | ||
deepseek-r1-distill-llama-70b | ﹣ | ﹣ | $0.23 | $0.69 | ||
deepseek-r1-distill-llama-70b-specdec | ﹣ | ﹣ | $0.23 | $0.69 | ||
deepseek-r1-distill-qwen-32b | ﹣ | ﹣ | $0.69 | $0.69 | ||
qwen-2.5-32b | ﹣ | ﹣ | $0.79 | $0.79 | ||
qwen-2.5-coder-32b | ﹣ | ﹣ | $0.79 | $0.79 | ||
qwen-qwq-32b | ﹣ | ﹣ | $0.29 | $0.39 | ||
gemma-7b-it | ﹣ | ﹣ | $0.10 | $0.10 | ||
llama-3.1-70b-versatile | ﹣ | ﹣ | ﹣ | ﹣ | ||
llama-3.1-405b-reasoning | ﹣ | ﹣ | ﹣ | ﹣ | ||
llama2-70b-4096 | ﹣ | ﹣ | $0.70 | $0.80 | ||
mixtral-8x7b-32768 | ﹣ | ﹣ | $0.27 | $0.27 |