tim-lawson
/

mlsae-pythia-70m-deduped-x1-k16

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

tim-lawson commited on Sep 9

Commit

e0392be

•

1 Parent(s): a34dc66

Update README.md

Files changed (1) hide show

README.md +13 -7

README.md CHANGED Viewed

@@ -3,15 +3,21 @@ language: en
 library_name: mlsae
 license: mit
 tags:
-- model_hub_mixin
-- pytorch_model_hub_mixin
 datasets:
-- monology/pile-uncopyrighted
 ---
-A Multi-Layer Sparse Autoencoder (MLSAE) trained on [EleutherAI/pythia-70m-deduped](https://huggingface.co/EleutherAI/pythia-70m-deduped) and [monology/pile-uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted), with an expansion factor of 1 and $k = 16$.
-For more details, see the [paper](https://arxiv.org/submit/5837813) and the [Weights & Biases project](https://wandb.ai/timlawson-/mlsae).
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
-- Library: <https://github.com/tim-lawson/mlsae>

 library_name: mlsae
 license: mit
 tags:
+  - model_hub_mixin
+  - pytorch_model_hub_mixin
 datasets:
+  - monology/pile-uncopyrighted
 ---
+# mlsae-pythia-70m-deduped-x1-k16
+A Multi-Layer Sparse Autoencoder (MLSAE) trained on the residual stream
+activation vectors from every layer of
+[EleutherAI/pythia-70m-deduped](https://huggingface.co/EleutherAI/pythia-70m-deduped)
+with an expansion factor of 1 and k = 16, over 1 billion tokens from
+[monology/pile-uncopyrighted](https://huggingface.co/datasets/monology/pile-uncopyrighted).
+For more details, see:
+- Paper: <https://arxiv.org/abs/2409.04185>,
+- GitHub repository: <https://github.com/tim-lawson/mlsae>
+- Weights & Biases project: <https://wandb.ai/timlawson-/mlsae>