Gemini 3.1 Pro Excels in Sycophancy & Hallucination Benchmark
A community user created a custom benchmark called HalBench to evaluate sycophancy and hallucination tendencies in AI models. The test covered four frontier models: Sonnet 4.6, Grok 4.3, GPT 5.4, and Gemini 3.1 Pro. Results show Gemini 3.1 Pro performing strongly across multiple metrics, providing valuable reference for developers selecting reliable models.
Should you claim it?
Worth checking, but confirm region, account, and payment requirements first.
Did you claim it? Help us verify:
Success rate: — · 0 votes
Get an email when credits, deadlines, or requirements change.
How to claim
- Open the official page or signup link for Gemini (Google).
- Requirement: Visit the Reddit post for detailed benchmark results and model comparisons
- Run one real task to confirm the credits work.
- If the deal expires or does not work, use the alternatives below.
Credits and limits
A community-built HalBench benchmark shows Gemini 3.1 Pro performing well in sycophancy and hallucination tests, compared against frontier models like Sonnet 4.6, Grok 4.3, and GPT 5.4.
Requirements
- Visit the Reddit post for detailed benchmark results and model comparisons
Alternatives if unavailable
Related deals
FAQ
Is Gemini 3.1 Pro Benchmark still available?
Current status: Ongoing. Always confirm on the official signup page.
What do I need to claim Gemini 3.1 Pro Excels in Sycophancy & Hallucination Benchmark?
Visit the Reddit post for detailed benchmark results and model comparisons
Can I access Gemini 3.1 Pro Excels in Sycophancy & Hallucination Benchmark from mainland China?
Current data says it is accessible or relatively friendly from mainland China.