Dual head network like AlphaZero

Question

Dual head network like AlphaZero

b0n541 opened this issue 2 years ago · 3 comments

I would like to use KotlinDL to create a AlphaZero like dual head network for my game.

Unfortunately I haven't found any hint on how to accomplish that.

Is this already possible or would it be a new feature to be implemented?

Answer 1 · 2023-06-09T09:17:33.000Z

Hi @b0n541 if you want to create an AlphaZero, could you please share some notes about Keras/PyTorch implementation for that model?

Probably after that I could say more about is it possible or not

Answer 2 · 2023-06-09T17:04:57.000Z

I have found the following information:

https://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0
https://github.com/deepmind/open_spiel/blob/master/docs/alpha_zero.md#model
https://github.com/deepmind/open_spiel/blob/master/open_spiel/python/algorithms/alpha_zero/model.py

What we would need is a way to split the network after the first couple of layers into two stacks of following layers with dedicated outputs for the game value prediction and the move prediction.

Answer 3 · 2024-05-12T19:23:11.000Z

Do you mean reinforcement learning?

Can I implement reinforcement learning in Kotlin DL to play chess or any other 1v1 board game where the model will play against itself?