Nature, Published online: 28 January 2026; doi:10.1038/d41586-025-04098-x

Conventional benchmarks are becoming less effective at assessing AI performance, but a multi-disciplinary test has set AI systems a fresh challenge.


From Nature via this RSS feed