“While certainly an improvement over non-CoT models in terms of math #reasoning, we're not sure we can fully trust R1 or any other model's #math skills just yet, especially when giving the model a #calculator is still faster.”
great article on #TheRegister testing #R1
[#]AI / #CoT / #DeepSeek 🧮 https://www.theregister.com/2025/01/26/deepseek_r1_ai_cot/?td=rt-3a
=> More informations about this toot | View the thread | More toots from peterrenshaw@ioc.exchange
=> View reasoning tag | View math tag | View calculator tag | View theregister tag | View r1 tag | View ai tag | View cot tag | View deepseek tag This content has been proxied by September (3851b).Proxy Information
text/gemini