site:the-decoder.com - Search News

Scaling laws for precision: AI researcher sees "perfect storm" for the end of scale

In a new study, researchers show that the numerical precision of AI models is more important than previously thought. They conducted 465 training runs with language models that had between 3 and 16 ...

the-decoder3d

AI researcher François Chollet leaves Google, Keras stays

François Chollet, AI researcher and influential developer of the Keras AI framework, is leaving Google after a decade to start his own company. With more than two million users, his Keras framework ...

the-decoder5d

MIT's test-time training makes AI language models better at solving hard puzzles

MIT researchers have found a way to significantly improve how AI language models solve problems using a technique called test-time training (TTT). The team has set a new record on a challenging AI ...

the-decoder3d

Anthropic adds Claude-powered prompt optimizer to its AI development console

Anthropic has added a prompt optimizer and example management features to its AI development console. The prompt optimizer uses Claude to automatically refine existing prompts using prompt engineering ...

the-decoder3d

Google's AI assistant ‘Gemini Live’ is now available for iPhone

Google has released its own Gemini app for the iPhone. The free application gives users direct access to Google's AI chatbot via text, voice, or camera. As a new feature, the app offers 'Gemini Live' ...

the-decoder5d

Baidu unveils LLM smart glasses and new image generator

At the conference, Baidu CEO Robin Li introduced I-RAG, a text-to-image system that aims to reduce inaccuracies in AI-generated images where the output doesn't match the text input or contains ...

the-decoder4d

Nvidia Blackwell doubles AI training performance in early benchmarks

In the MLPerf Training 4.1 benchmarks, the Nvidia Blackwell platform delivered 2.2 times more performance per GPU compared to Hopper in the LLM benchmark Llama 2 70B fine-tuning and 2 times more ...

the-decoder5d

Anthropic delays next-gen AI model Opus 3.5 with no new timeline

Anthropic CEO Dario Amodei has confirmed that the company is working on a new version of its flagship AI model, Claude Opus, but he's staying tight-lipped about when it will actually drop. The update, ...

the-decoder7d

OpenAI's new "Orion" model reportedly shows small gains over GPT-4

The Information reports that OpenAI's next major language model, codenamed "Orion," delivers much smaller performance gains than expected. The quality improvement between GPT-4 and Orion is notably ...

the-decoder10d

Anthropic and Palantir bring AI model Claude to US defense sector

A new partnership between Anthropic, Amazon Web Services (AWS), and Palantir gives US intelligence agencies access to Anthropic's AI models - another step in the growing connection between AI ...

the-decoder3d

OpenAI is reportedly planning AI agent "Operator" for January launch

OpenAI will introduce an AI assistant called "Operator" in January that can perform computer tasks on its own, according to Bloomberg, citing two people familiar with the matter. The sources say ...

the-decoder6d

OpenAI co-founder Sutskever predicts a new AI "age of discovery" as LLM scaling hits a wall

Leading AI companies are changing course. Instead of developing ever-larger language models, they are focusing on test-time compute, which uses more processing power during model execution rather than ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results