Breaking the Thermometer
Claude Opus 4.6 just broke the METR graph. The benchmark designed to track progress towards the singularity can’t keep up.
Claude Opus 4.6 just broke the METR graph. The benchmark designed to track progress towards the singularity can’t keep up.
A year of AI milestones in a month, and progress is only accelerating.