A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores at the expense of rivals. According to the authors, LM Arena allowed some industry-leading AI companies like Meta, OpenAI, […] from TechCrunch https://ift.tt/UMngmJb
Wednesday, April 30, 2025
Study accuses LM Arena of helping top AI labs game its benchmark
Subscribe to:
Post Comments (Atom)



No comments:
Post a Comment