nev's picture
Upload it-l12-residual-test-run-sweep-0-4.33E-05//cfg.json with huggingface_hub
c899d1d verified
raw
history blame contribute delete
196 Bytes
{"d_in": 2048, "dtype": "bfloat16", "expansion_factor": 16, "l1_coefficient": 4.333333333333334e-05, "train_batch_size": 2048, "dead_feature_window": 1000, "use_ghost_grads": true, "d_sae": 32768}