license: mit | |
I used PGX and MCTX to train AlphaZero on Kuhn Poker. It ran on a TPU v2-8(courtesy of the TPU Research Cloud Program) for ~3.5 days. | |
Code can be found [here](https://github.com/sr5434/MuZero). |
license: mit | |
I used PGX and MCTX to train AlphaZero on Kuhn Poker. It ran on a TPU v2-8(courtesy of the TPU Research Cloud Program) for ~3.5 days. | |
Code can be found [here](https://github.com/sr5434/MuZero). |