An organization developing math benchmarks for AI didn't disclose that it had received funding from OpenAI until relatively ...
OpenAI secretly funded and had access to a benchmarking dataset, raising questions about high scores achieved by its new o3 ...
The creators of a new test called “Humanity’s Last Exam” argue we may soon lose the ability to create tests hard enough for A ...
Contributors criticize FrontierMath’s undisclosed OpenAI funding, highlighting secrecy, transparency issues, and ethical ...
OpenAI just pulled a Theranos with o3 by claiming record-breaking performance on the FrontierMath benchmark while having ...
Epoch AI, a nonprofit primarily funded by Open Philanthropy, a research and grantmaking foundation, revealed on December 20 that OpenAI had supported the creation of FrontierMath. FrontierMath ...
FrontierMath is part of a larger effort to rethink how we measure intelligence. As machines get smarter, benchmarks must grow ...