Toot

Written by pera on 2025-01-20 at 13:06

Oh wow what a shit show, people should start being more sceptical about all these LLM benchmarks:

"FrontierMath was funded by OpenAI"

https://www.lesswrong.com/posts/cu2E8wgmbdZbqeWqb/meemi-s-shortform

Goodhart's law never fails...

=> More informations about this toot | View the thread | More toots from trobador@mastodon.social

Mentions

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113860830722979049
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
224.664213 milliseconds
Gemini-to-HTML Time
0.301166 milliseconds

This content has been proxied by September (ba2dc).