metadata
license: mit
I trained an 11M parameter Mamba LM to play chess, starting from code&data by @a_karvonen. After seeing 18.8M games, it has 37.7% win rate vs Stockfish level 0 - not apples^2, but 25M parameter was <20% after 20M games.
license: mit
I trained an 11M parameter Mamba LM to play chess, starting from code&data by @a_karvonen. After seeing 18.8M games, it has 37.7% win rate vs Stockfish level 0 - not apples^2, but 25M parameter was <20% after 20M games.