nbeerbower
/

Llama3.1-Allades-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nbeerbower commited on Sep 28, 2024

Commit

f5d5149

·

verified ·

1 Parent(s): e4dd938

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -11,4 +11,20 @@ base_model:
 - mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated
 ---
-# Llama3.1-Allades-8B

 - mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated
 ---
+# Llama3.1-Allades-8B
+Allades finetunes abliterated Llama 3.1 with 5 datasets to improve creative writing, reasoning, and roleplay.
+## Datasets
+- [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1)
+- [nbeerbower/gutenberg2-dpo](https://huggingface.co/datasets/nbeerbower/gutenberg2-dpo)
+- [jondurbin/truthy-dpo-v0.1](https://huggingface.co/datasets/jondurbin/truthy-dpo-v0.1)
+- [kyujinpy/orca_math_dpo](https://huggingface.co/datasets/kyujinpy/orca_math_dpo)
+- [antiven0m/physical-reasoning-dpo](https://huggingface.co/datasets/antiven0m/physical-reasoning-dpo)
+## Training
+ORPO tuned for 1 epoch with 2x RTX 3090 (sponsored by [Schneewolf Labs](https://schneewolflabs.com)).
+Data was prepared with [Llama 3.1 Instruct](https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1/).