beat · 78 stories
A visa policy built to fight 'censorship' is cutting off the empirical evidence base for AI governance — and the chill is already visible in three specific research domains.
Hugging Face quantified a quarter of GPU compute time as pure idle wait in continuous batching. The fix lives in CUDA streams, not silicon — and it comes with real tradeoffs that vary by workload.
Google has embedded SynthID watermarking and C2PA Content Credentials across Search, Chrome, and Gemini — positioning itself as both the dominant content generator and the dominant content authenticity authority. Whether that bet holds is the real story.
A new Amazon study finds that small models solving math via chain-of-thought are mostly copying the last number they see, not computing the answer. The finding matters for anyone using CoT faithfulness as an oversight signal.