@simon @matt what’s a good way to reason about these efficiencies? Is there a normalized “token quality per watt” metric that one could look at, scored per-model?
=> More informations about this toot | View the thread | More toots from glyph@mastodon.social
=> View simon@simonwillison.net profile | View matt@toot.cafe profile
text/gemini
This content has been proxied by September (3851b).