openAI o3 | ToasterBotnet Shitpostblog

Better than humans in reasoning benchmark Arc

o3 celebrated a particular success in the reasoning benchmark “Arc AGI”. In a “high-compute” configuration, o3 has now achieved an accuracy of 87.5 percent in the benchmark and has thus achieved human performance of around 85 percent for the first time. This is an important step towards Artificial General Intelligence (AGI), because passing the ARC-AGI does not mean that AGI has been achieved. In fact, o3 still fails at some very simple tasks, which points to fundamental differences to human intelligence, according to an Arc Prize article.

Better than humans in reasoning benchmark Arc

https://www.heise.de/en/news/OpenAI-s-new-o3-model-aims-to-outperform-humans-in-reasoning-benchmarks-10218087.html