Digital Headlines

Latest Tech News At your Fingertips

Thursday, January 23, 2025

Even some of the best AI can’t beat this new benchmark

The nonprofit Center for AI Safety (CAIS) and Scale AI, a company that provides a number of data labeling and AI development services, have released a challenging new benchmark for frontier AI systems. The benchmark, called Humanity’s Last Exam, includes thousands of crowdsourced questions touching on subjects like mathematics, humanities, and the natural sciences. To make […]

© 2024 TechCrunch. All rights reserved. For personal use only.

from TechCrunch https://ift.tt/2MlIFzi

No comments:

Post a Comment