Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
11
76
Ṁ253Ṁ1k
2027
66%
chance
1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
Get Ṁ600 play money
Related questions
What will be true of OpenAI's next major LLM release (GPT-4.5 or GPT-5)?
Will OpenAI offer a higher-tier version of ChatGPT, priced above US$49, by 2025?
77% chance
Will "OpenAI" hit 50% of its previous all-time high search interest this week? (US Google Trends)
90% chance
Will there be a model that has a 75% win rate against the latest iteration of GPT-4 as of January 1st, 2025?
59% chance
Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?
71% chance
Will an AI model outperform 95% of Manifold users on accuracy before 2026?
67% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
60% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
39% chance
Will the "OpenAI hint at or claim to have AGI before 2025 end" market go above 60% before 2024 ends?
20% chance
Will openAI have the most accurate LLM across most benchmarks by EOY 2024?
39% chance