Toot

Written by Frederic Jacobs on 2025-01-28 at 10:31

@a @gimulnautti They are reproducing the -R1 variant. But the training of the DeepSeek V3 Base Model is not reproducible.

It's a bit deceptive to say that “deepseek" is being reproduced.

=> More informations about this toot | View the thread | More toots from fj@mastodon.social

Mentions

=> View a@paperbay.org profile | View gimulnautti@mastodon.green profile

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113905517428175420
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
226.875485 milliseconds
Gemini-to-HTML Time
0.582041 milliseconds

This content has been proxied by September (3851b).