Juliushanhanhan
/

llama-3-8b-it-res

Model card Files Files and versions Community

Juliushanhanhan commited on Aug 16

Commit

868cb67

•

1 Parent(s): 53425c3

Update README.md

Files changed (1) hide show

README.md +7 -17

README.md CHANGED Viewed

@@ -31,22 +31,12 @@ sae, cfg_dict, sparsity = SAE.from_pretrained("Juliushanhanhan/llama-3-8b-it-res
 ## Citation
 ```
-@misc{saelens2024llama38b,
-  author = {SAELens, Jiatong Han},
-  title = {Llama-3-8B SAEs (layer 25, Post-MLP Residual Stream)},
-  year = {2024},
-  publisher = {HuggingFace},
-  url = {https://huggingface.co/Juliushanhanhan/llama-3-8b-it-res},
-  note = {Model trained on the post-MLP residual stream of the 25th layer of Llama-3-8B. Feature visualizations are available at \url{https://www.neuronpedia.org/llama3-8b-it}. The wandb run is recorded at \url{https://wandb.ai/jiatongg/sae_semantic_entropy/runs/ruuu0izg?nw=nwuserjiatongg}.},
 }
-@misc{juliushanhanhan2024openwebtext,
-  author = {Juliushanhanhan},
-  title = {OpenWebText-1B Llama3 Tokenized CXT 1024},
-  year = {2024},
-  publisher = {HuggingFace},
-  url = {https://huggingface.co/datasets/Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024},
-  note = {Dataset used for training the Llama-3-8B SAEs.},
-}
 ```

 ## Citation
 ```
+@misc {jiatong_han_2024,
+	author       = { {Jiatong Han} },
+	title        = { llama-3-8b-it-res (Revision 53425c3) },
+	year         = 2024,
+	url          = { https://huggingface.co/Juliushanhanhan/llama-3-8b-it-res },
+	doi          = { 10.57967/hf/2889 },
+	publisher    = { Hugging Face }
 }
 ```