Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?
25
205
Ṁ967Ṁ470
2030
71%
chance
1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.
If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.
Benchmarking on a subset of MATH is acceptable.
Using tools(e.g. calculator) & code is allowed.
Get Ṁ600 play money
Related questions
Will an AI get gold on any International Math Olympiad by 2025?
20% chance
Will an AI win a Gold Medal on the International Math Olympiad by 2029?
65% chance
Will AIs be widely recognized as having developed a new, innovative, foundational mathematical theory before 2030?
33% chance
Will an AI win a Gold Medal on the International Math Olympiad by 2032?
72% chance
Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?
33% chance
Will an AI model outperform 95% of Manifold users on accuracy before 2026?
67% chance
Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?
73% chance
Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
66% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
39% chance
Will openAI have the most accurate LLM across most benchmarks by EOY 2024?
39% chance