Ancestors

Toot

Written by Thomas E. Gladwin on 2025-01-01 at 19:19

Nice start to 2025 - finally bullied a toy reinforcement learning model I've been using to try stuff out into different behaviours for different parameters.

It only appears in the speed to converge, in the early learning stage. I was expecting to get a more qualitative difference where the model can just fail within certain parameter ranges but no luck there yet.

[#]actorcritic #AI

=> View attached media

=> More informations about this toot | More toots from TEG@mastodon.online

Descendants

Proxy Information
Original URL
gemini://mastogem.picasoft.net/thread/113754712006282799
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
230.333554 milliseconds
Gemini-to-HTML Time
0.231669 milliseconds

This content has been proxied by September (ba2dc).