Toot

Written by pera on 2025-01-20 at 13:06

Oh wow what a shit show, people should start being more sceptical about all these LLM benchmarks:

"FrontierMath was funded by OpenAI"

https://www.lesswrong.com/posts/cu2E8wgmbdZbqeWqb/meemi-s-shortform

Goodhart's law never fails...

=> More informations about this toot | View the thread | More toots from trobador@mastodon.social

Mentions

Tags

Proxy Information

Original URL: gemini://mastogem.picasoft.net/toot/113860830722979049
Status Code: Success (20)
Meta: text/gemini
Capsule Response Time: 224.664213 milliseconds
Gemini-to-HTML Time: 0.301166 milliseconds

This content has been proxied by September (ba2dc).