FrontierMath

From HandWiki

FrontierMath is a test bed to benchmark[1] various artificial intelligences in their attempts to solve 14 bespoke[2] heretofore unexamined mathematical problems[3] (none of which are on the scale of the Millennium Problems). It was established by the non-profit research organization Epoch AI in November 2024.[4] The first such open problem—of the "moderately interesting" rank—to be solved was in hypergraph theory: "A Constant-Factor Lower Bound For H (n)" by GPT-5.4.[5] Such was the novelty of the methodology that memes were generated.[6]

See also

  • Longest proof
  • Millennium problems

References