UC Berkeley researchers just released an open-source AI reasoning model that’s as good as ChatGPT’s $20/month version.
Some of the world’s most prominent AI models have been accused of cheating on industry-standard benchmarking systems.
LAS VEGAS, Jan. 11, 2025 /PRNewswire/ -- Think Academy debuted its Thinkpal tablet at CES 2025 and has won a TechRadar Pro ...
Microsoft enhances the capabilities of small language models (SLMs) with rStar-Math. The technique boosts the capabilities of ...
Star-Math has achieved remarkable benchmarks in mathematical reasoning, showcasing how small AI models can rival larger ...
Think Academy will officially introduce its newest education technology product at CES 2025, the Thinkpal tablet. Designed to ...
OpenAI’s newest, most performant model, announced in December, has passed the ARC-AGI test, purportedly outperforming most ...
Techopedia explores a simple, new AI jailbreak technique, as demonstrated by Unit 42, that can trick popular AI models into ...
Red teaming has become the go-to technique for iteratively testing AI models to simulate diverse, lethal, unpredictable attacks.
There's not enough human-generated data to keep AI models improving at the same rate. 2025 will put a new solution to the ...
OpenAI's o3 AI model recently achieved 85% on the ARC-AGI benchmark, similar to human-level performance. Though impressive, experts caution that it does not necessarily mean true human-level ...
Coming to the ARC-AGI (Abstract Reasoning Corpus - Artificial General Intelligence) benchmark, it features a series of ...