Ancestors

Toot

Written by Dan Goodman on 2025-01-29 at 10:07

From what I've seen DeepSeek is more efficient though engineering tricks rather than better ML? Doesn't this mean it's not as disruptive as thought? Big players can just use the same tricks now and still have advantage from bigger compute?

=> More informations about this toot | More toots from neuralreckoning@neuromatch.social

Descendants

Written by Tatjana Scheffler on 2025-01-29 at 10:46

@neuralreckoning yes

=> More informations about this toot | More toots from tschfflr@fediscience.org

Written by Ehproque on 2025-01-29 at 10:47

@neuralreckoning my take (not my field so take with copious amounts of salt) is that the big players held a choke point which was "vast amounts of computing power which can only be available in a data center", so didn't really care about the magic formula being in the open.

Those tricks make training models possible for everyone who would otherwise have been forced to buy them (as-a-service) from them.

Also makes running queries a lot cheaper, which is good for consumers such as meta

=> More informations about this toot | More toots from ehproque@paquita.masto.host

Written by Ehproque on 2025-01-29 at 10:48

@neuralreckoning this last point is the one you point out which is correct (to my limited knowledge)

=> More informations about this toot | More toots from ehproque@paquita.masto.host

Written by ma𝕏pool on 2025-01-29 at 14:35

@neuralreckoning

Internal Google memo from 1.5 years ago is relevant:

Google “We Have No Moat, And Neither Does OpenAI” Leaked Internal Google Document Claims Open Source AI Will Outcompete Google and OpenAI https://semianalysis.com/2023/05/04/google-we-have-no-moat-and-neither/

=> More informations about this toot | More toots from maxpool@mathstodon.xyz

Proxy Information

Original URL: gemini://mastogem.picasoft.net/thread/113911088779799888
Status Code: Success (20)
Meta: text/gemini
Capsule Response Time: 282.483722 milliseconds
Gemini-to-HTML Time: 2.015604 milliseconds

This content has been proxied by September (3851b).