Toot

Written by Thomas E. Gladwin on 2025-01-01 at 19:19

Nice start to 2025 - finally bullied a toy reinforcement learning model I've been using to try stuff out into different behaviours for different parameters.

It only appears in the speed to converge, in the early learning stage. I was expecting to get a more qualitative difference where the model can just fail within certain parameter ranges but no luck there yet.

[#]actorcritic #AI

=> View attached media

=> More informations about this toot | View the thread | More toots from TEG@mastodon.online

Mentions

Tags

=> View actorcritic tag | View ai tag

Proxy Information
Original URL
gemini://mastogem.picasoft.net/toot/113754712006282799
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
223.418584 milliseconds
Gemini-to-HTML Time
0.838157 milliseconds

This content has been proxied by September (ba2dc).