Toot

Written by Steven Hugg on 2025-01-27 at 21:47

If you were wondering if assembly language optimization is still relevant, it was partially responsible for NVDA's stock price dropping by 17% today:

"DeepSeek actually programmed 20 of the 132 processing units on each H800 specifically to manage cross-chip communications. This is actually impossible to do in CUDA. DeepSeek engineers had to drop down to PTX, a low-level instruction set for Nvidia GPUs that is basically like assembly language."

=> More informations about this toot | View the thread | More toots from sehugg@infosec.exchange

Mentions

Tags

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113902516033990526
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
217.583305 milliseconds
Gemini-to-HTML Time
0.738125 milliseconds

This content has been proxied by September (3851b).