Ancestors

Toot

Written by Thomas E. Gladwin on 2025-01-01 at 19:19

Nice start to 2025 - finally bullied a toy reinforcement learning model I've been using to try stuff out into different behaviours for different parameters.

It only appears in the speed to converge, in the early learning stage. I was expecting to get a more qualitative difference where the model can just fail within certain parameter ranges but no luck there yet.

[#]actorcritic #AI

=> View attached media

=> More informations about this toot | More toots from TEG@mastodon.online

Descendants

Proxy Information

Original URL: gemini://mastogem.picasoft.net/thread/113754712006282799
Status Code: Success (20)
Meta: text/gemini
Capsule Response Time: 230.333554 milliseconds
Gemini-to-HTML Time: 0.231669 milliseconds

This content has been proxied by September (ba2dc).