yangmao.ai · daily-updated AI free tier database

Groq API Pricing and Free Trial

If you plan to integrate Groq into a product, the key questions are API credits, rate limits, pricing stability, and access options.

Quick verdict

  • Free tier: 6000 tokens/min (Llama 3.3 70B)
  • Free API trial: Free tier(永久免费)
  • Access: relay or proxy may be needed
  • Best for: chat / coding / reasoning

Models and limits

ModelContextLimitNotes
Llama 3.3 70B Versatile 128k 30 RPM / 6000 TPM World's fastest inference, 6000 tokens/min free, LPU chip accelerated
Llama 4 Scout 17B 128k 30 RPM / 6000 TPM Meta Llama 4 Scout, MoE architecture, free to use
Llama 4 Maverick 17B 128k 30 RPM / 6000 TPM Meta Llama 4 Maverick, MoE architecture, free to use
Mixtral 8x7B 32k 30 RPM / 5000 TPM MoE architecture, cost-effective
Gemma 2 9B 8k 30 RPM / 15000 TPM Google Gemma 2, ultra-fast small model
DeepSeek R1 Distill Llama 70B 128k 30 RPM / 6000 TPM DeepSeek R1 distilled, strong reasoning

Free API credits

Yes

Free tier(永久免费) · 30 RPM / 6000 TPM

Free API powered by custom LPU (Language Processing Unit) chip, 10x+ faster than GPU. API keys start with gsk_. OpenAI-compatible format. Free tier has rate limits but no total cap, very generous for personal development.

Access notes

Proxy may be needed

Requires proxy in China. API remains extremely fast even through proxy thanks to LPU chips. Use openllmapi.com as proxy.

If multiple provider keys become painful, compare official providers, API gateways, and alternatives.

Recommended path

1

Start from the main Groq profile and confirm whether it fits your task.

2

Create an API key after signup and use the Free tier(永久免费) first.

3

Run one or two real tasks before paying or switching providers.

4

If access is unstable, compare relay options or local alternatives.

Production decision checks

Production budget

Groq free credits are for validation. Before launch, record cost per 1K requests, retry-adjusted cost, and a monthly spend cap.

Compatibility check

If you already use the OpenAI SDK, verify Groq base URL, model names, streaming, tool calling, and JSON mode before migration.

Fallback rule

Keep one similar provider or OpenLLMAPI route ready so credit exhaustion, regional access issues, or model deprecations do not break users.

Source trail

Save this source snapshot plus the official console state so future quota, pricing, and rate-limit changes can be audited.

Related deals

Alternatives

Credit-change alerts

Get notified when Groq credits, pricing, or Mainland China access changes; compare official providers, API gateways, and alternatives before production.

Subscribe → Compare API gateways → Use OpenLLMAPI fallback →

Source snapshot

Generated from the yangmao.ai provider database plus public provider documentation. Free credits, prices, and rate limits can change; verify in the official console before production.

Data source
yangmao.ai provider tracker + official provider documentation review
Official source
https://groq.com
Last updated
2026-06-16
Free tier
6000 tokens/min (Llama 3.3 70B)
API credits
Free tier(永久免费)
Rate limit
30 RPM / 6000 TPM

FAQ

Does Groq have a free tier?

Yes. Current record: 6000 tokens/min (Llama 3.3 70B). Always confirm on the official site before relying on it.

Can I try the Groq API for free?

Yes. Current record: Free tier(永久免费), rate limit: 30 RPM / 6000 TPM.

Can I use Groq without a credit card?

You can usually try free product features first, but card requirements may change on signup.

What are good Groq alternatives?

Check the alternatives section below or open the Groq alternatives page.

🎁 Free Resource Pack

Get the Free AI Startup Toolkit

Free API credits list, AI business case studies, payment stack, risk checklist, and a monetization roadmap.

Get it free →
🐑 AI Assistant