HaileyStorm's picture
Create README.md
90ad378 verified
|
raw
history blame
No virus
642 Bytes
metadata
license: mit

For an explanation of this project and the models trained for it, please see the Report.

The root folder contains scripts for dataset preprocessing. chess-mamba-vs-xformer contains the training scripts. Config files, used to set model configuration and training hyperameters, are in chess-mamba-vs-xformer/config. Model checkpoints are in chess-mamba-vs-xformer/out. The last checkpoint for completed models (e.g. Mamba and Transformer 50M) are .../anneal/anneal_complete.pt. chess-gpt-eval