raincandy-u commited on
Commit
b7c0990
1 Parent(s): 31d0764

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - facebook
7
+ - meta
8
+ - pytorch
9
+ - llama
10
+ - llama-3
11
+ license: other
12
+ license_name: llama3
13
+ license_link: LICENSE
14
+ datasets:
15
+ - unalignment/toxic-dpo-v0.2
16
+
17
+ ---
18
+
19
+
20
+
21
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/657eb5b256c9c67605a6e8b5/6X2l7zKi9ttuxfRGLucUY.png)
22
+
23
+ **Llama-3-5B-Sheard**
24
+
25
+ Pruned version of Llama-3-8b.
26
+
27
+ Tool used: [PrunMe](https://github.com/arcee-ai/PruneMe), Mergekit.
28
+
29
+ **Meta Llama 3 is licensed under the Meta Llama 3 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.**
30
+
31
+ ## Training
32
+
33
+ After sliced by mergekit, the model is continue-pretrained on minipile for 1 epoch and ~100k samples. Then we trained it using ORPO on Llama-3-70b generated DPO pairs.
34
+
35
+ ## Disclaimer
36
+
37
+ This model is for testing purposes only, and when the system prompt is not empty, the output may repeat and not stop!
38
+
39
+ <h1><a href="https://discord.gg/Dw8SkfBm">Join our discord</a></h1>