# The AI Benchmark Gap: 77% on Computing Tasks, 39% on Scientific Reasoning

- slug: the-ai-benchmark-gap-77-on-computing-tasks-39-on-scientific-reasoning
- date: 2026-04-16
- category: Artificial Intelligence

The Stanford HAI AI Index shows AI agents at 77.3% on real-world computing tasks — but the benchmark testing genuine scientific reasoning puts the same systems at 38.78% against an 83.5% PhD expert baseline. A 45-point gap nobody is reporting.

---