News
However, they’re nowhere near as good at solving math problems, which tend to involve logical reasoning—something that’s beyond the capabilities of most current AI systems.
They can answer questions only after mathematicians translate the questions into Lean, a computer programming language designed for solving math problems.
A team led by the chief scientist, Ilya Sutskever, made a breakthrough earlier this year that allowed the company to build a model that could solve math problems, a report said.
FrontierMath, a new benchmark from Epoch AI, challenges advanced AI systems with complex math problems, revealing how far AI still has to go before achieving true human-level reasoning.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results