The IMO Small Challenge: First IMO Dataset for LLMs

Lukasiewicz, Thomas; Simon, Frieder

The IMO Small Challenge: First IMO Dataset for LLMs

Simon Frieder‚ Mirek Olšák‚ Julius Berner and Thomas Lukasiewicz

Abstract

We introduce the IMO Small Challenge: A curated collection of the easiest possible IMO problems and other competitive mathematical problems. The goal is to bridge the existing gap in the range of available dataset difficulties in terms of testing problem-solving skills: Currently, datasets are predominantly either too easy (MATH or GSM8K), excessively challenging (solving arbitrary IMO problems, such as required by the IMO Grand Challenge, and embodied by the miniF2F dataset) or focus too little on problem-solving (GHOSTS). Our challenge interpolates this difficulty range and serves as a test bench for next-generation language models. We release a preliminary version of a dataset that accompanies this challenge. It is grounded in natural language, and problems are annotated with solutions and other metadata, such as the type of proof strategy used, in order to facilitate semi-automatic evaluation of LLMs' outputs beyond classical correct-incorrect keyword matching.

Book Title

Proceedings of the 12th International Conference on Learning Representations‚ ICLR 2024‚ Tiny Papers Track‚ Vienna‚ Austria‚ 7–11 May 2024

Month

May

Year

2024

The IMO Small Challenge: First IMO Dataset for LLMs

Abstract

Links

See Also