DeepSeek released a whole family of inference-scaling / "reasoning" models today, including distilled variants based on Llama and Qwen
Here are my notes on the new models, plus how I ran DeepSeek-R1-Distill-Llama-8B on my Mac using Ollama and LLM
https://simonwillison.net/2025/Jan/20/deepseek-r1/
=> More informations about this toot | View the thread | More toots from simon@simonwillison.net
text/gemini
This content has been proxied by September (3851b).