Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark? | Manifold

Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?

GPT-5 #OpenAI #Technical AI Timelines #Q*

25

205

Ṁ967

Ṁ470

2030

71%

chance

1D

1W

1M

ALL

Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.
If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.
Benchmarking on a subset of MATH is acceptable.
Using tools(e.g. calculator) & code is allowed.

Get Ṁ600 play money

Related in OpenAI

Will Apple announce a partnership with OpenAI regarding Siri during WWDC 2024?

+12% 1d74% chance

Will I find GPT-4o more helpful than Claude 3 Opus for doing web development tomorrow?

+13% 1d48% chance

See more OpenAI questions

Related in Technical AI Timelines

By the end of 2026, will we have transparency into any useful internal pattern within a Large Language Model whose semantics would have been unfamiliar to AI and cognitive science in 2006?

Will a prompt that enables GPT-4 to solve easy Sudoku puzzles be found? (2023)

See more Technical AI Timelines questions

Related in Q*

Does OpenAI's Q* 'breakthrough' represent a significant advance in AI capabilities?

Is OpenAI's Q* real?

See more Q* questions

More related questions

Will an AI get gold on any International Math Olympiad by 2025?

Will AIs be widely recognized as having developed a new, innovative, foundational mathematical theory before 2030?

Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?

Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?

Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?

-20% 1d39% chance

Will an AI win a Gold Medal on the International Math Olympiad by 2029?

Will an AI win a Gold Medal on the International Math Olympiad by 2032?

Will an AI model outperform 95% of Manifold users on accuracy before 2026?

Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?

OpenAI questions

Will Apple announce a partnership with OpenAI regarding Siri during WWDC 2024?

Will I find GPT-4o more helpful than Claude 3 Opus for doing web development tomorrow?

Technical AI Timelines questions

By the end of 2026, will we have transparency into any useful internal pattern within a Large Language Model whose semantics would have been unfamiliar to AI and cognitive science in 2006?

Will a prompt that enables GPT-4 to solve easy Sudoku puzzles be found? (2023)

Q* questions

Does OpenAI's Q* 'breakthrough' represent a significant advance in AI capabilities?

Is OpenAI's Q* real?

Related questions

Will an AI get gold on any International Math Olympiad by 2025?

Will an AI win a Gold Medal on the International Math Olympiad by 2029?

Will AIs be widely recognized as having developed a new, innovative, foundational mathematical theory before 2030?

Will an AI win a Gold Medal on the International Math Olympiad by 2032?

Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?

Will an AI model outperform 95% of Manifold users on accuracy before 2026?

Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?

Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?