LLMs generate vibes … but how do we evaluate vibes? Well if you had to name a pelican which model do you like better - A or B? Tools like LMSYS chatbot arena help us make sense of which one has the best vibes.
(Btw I am team Scooper. Like come on ever see a pelican try to eat a capybara?!)
[#]PyConUS
PS tryout pip install llm
a #Python tool for tinkering with LLMs
=> View attached media | View attached media | View attached media | View attached media
=> More informations about this toot | More toots from Lorenanicole@mastodon.social
text/gemini
This content has been proxied by September (ba2dc).