Awan LLM commited on
Commit
91e262b
1 Parent(s): 8fd63e4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -3
README.md CHANGED
@@ -1,3 +1,44 @@
1
- ---
2
- license: llama3
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ ---
4
+ Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement:
5
+ https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
6
+
7
+
8
+ This v0.3 version is even more uncensored thanks to using https://huggingface.co/AwanLLM/Awanllm-Llama-3-8B-Dolfin-v0.6-Abliterated as the base model.
9
+
10
+
11
+ In terms of reasoning and intelligence, this model is probably a bit worse than the OG model because of the decensoring. However, this model is better at long back and forth chats and will refuse less.
12
+
13
+
14
+ We have it up on our site https://awanllm.com for anyone to try.
15
+
16
+
17
+ OpenLLM Benchmark:
18
+
19
+
20
+ Training:
21
+ - Full 8192 sequence length.
22
+ - Training duration is around 2 days on an RTX 4090, using 4-bit loading and Qlora 64-rank 64-alpha resulting in ~2% trainable weights.
23
+
24
+
25
+ Instruct format:
26
+ ```
27
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
28
+
29
+ {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
30
+
31
+ {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
32
+
33
+ {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
34
+
35
+ {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
36
+ ```
37
+
38
+
39
+ Quants:
40
+
41
+ FP16: https://huggingface.co/AwanLLM/Awanllm-Llama-3-8B-Cumulus-v0.3
42
+
43
+ GGUF: https://huggingface.co/AwanLLM/Awanllm-Llama-3-8B-Cumulus-v0.3-GGUF
44
+