beat · 71 stories
Anthropic's London event produced a striking data point: roughly half the room had shipped Claude-written code without reading it first. The company's new 'dreaming' feature is its answer to what that future actually requires.
IBM Research and Hugging Face published a benchmark. Their choices about what to measure and how will become the industry's default definition of success — whether they say so or not.
A new paper proposes the right unit for agentic energy measurement. The problem: the infrastructure to actually use it doesn't exist yet.
The Zhejiang-UCL team's open-source academic knowledge graph is MIT-licensed and live on GitHub. Its architecture is a genuine answer to a real problem in AI research agents. But 'automated scientific research' it is not.