Toot

Written by Thomas E. Gladwin on 2025-01-01 at 19:19

Nice start to 2025 - finally bullied a toy reinforcement learning model I've been using to try stuff out into different behaviours for different parameters.

It only appears in the speed to converge, in the early learning stage. I was expecting to get a more qualitative difference where the model can just fail within certain parameter ranges but no luck there yet.

[#]actorcritic #AI

=> View attached media

=> More informations about this toot | View the thread | More toots from TEG@mastodon.online

Toot

Written by Thomas E. Gladwin on 2025-01-01 at 19:19

Mentions

Tags