I think that reinforcement learning specifically in go would improve its strength. Even with this I wouldn't expect it to get particular strong - this is a very inefficient way to make a go ai. It might be worthwhile anyway, because the llm's do have one big advantage over current go ai's: every move has a natural language explanation with even more detail in the chain of thought.
=> More informations about this toot | View the thread | More toots from aws@mathstodon.xyz
text/gemini
This content has been proxied by September (3851b).