Toot

Written by Andrew W Swan on 2025-01-29 at 09:35

I think that reinforcement learning specifically in go would improve its strength. Even with this I wouldn't expect it to get particular strong - this is a very inefficient way to make a go ai. It might be worthwhile anyway, because the llm's do have one big advantage over current go ai's: every move has a natural language explanation with even more detail in the chain of thought.

=> More informations about this toot | View the thread | More toots from aws@mathstodon.xyz

Toot

Written by Andrew W Swan on 2025-01-29 at 09:35

Mentions

Tags