In a new study, researchers show that the numerical precision of AI models is more important than previously thought. They conducted 465 training runs with language models that had between 3 and 16 ...
François Chollet, AI researcher and influential developer of the Keras AI framework, is leaving Google after a decade to start his own company. With more than two million users, his Keras framework ...
MIT researchers have found a way to significantly improve how AI language models solve problems using a technique called test-time training (TTT). The team has set a new record on a challenging AI ...
Anthropic has added a prompt optimizer and example management features to its AI development console. The prompt optimizer uses Claude to automatically refine existing prompts using prompt engineering ...
Google has released its own Gemini app for the iPhone. The free application gives users direct access to Google's AI chatbot via text, voice, or camera. As a new feature, the app offers 'Gemini Live' ...
At the conference, Baidu CEO Robin Li introduced I-RAG, a text-to-image system that aims to reduce inaccuracies in AI-generated images where the output doesn't match the text input or contains ...
In the MLPerf Training 4.1 benchmarks, the Nvidia Blackwell platform delivered 2.2 times more performance per GPU compared to Hopper in the LLM benchmark Llama 2 70B fine-tuning and 2 times more ...
Anthropic CEO Dario Amodei has confirmed that the company is working on a new version of its flagship AI model, Claude Opus, but he's staying tight-lipped about when it will actually drop. The update, ...
The Information reports that OpenAI's next major language model, codenamed "Orion," delivers much smaller performance gains than expected. The quality improvement between GPT-4 and Orion is notably ...
A new partnership between Anthropic, Amazon Web Services (AWS), and Palantir gives US intelligence agencies access to Anthropic's AI models - another step in the growing connection between AI ...
OpenAI will introduce an AI assistant called "Operator" in January that can perform computer tasks on its own, according to Bloomberg, citing two people familiar with the matter. The sources say ...
Leading AI companies are changing course. Instead of developing ever-larger language models, they are focusing on test-time compute, which uses more processing power during model execution rather than ...