carsenk commited on
Commit
458a22f
·
verified ·
1 Parent(s): 65ac492

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +55 -55
README.md CHANGED
@@ -1,56 +1,56 @@
1
- ---
2
- base_model: meta-llama/Llama-3.2-3b
3
- datasets:
4
- - mlabonne/FineTome-100k
5
- - microsoft/orca-math-word-problems-200k
6
- - m-a-p/CodeFeedback-Filtered-Instruction
7
- - cognitivecomputations/dolphin-coder
8
- - PawanKrd/math-gpt-4o-200k
9
- - V3N0M/Jenna-50K-Alpaca-Uncensored
10
- language:
11
- - en
12
- library_name: transformers
13
- license: llama3.2
14
- pipeline_tag: text-generation
15
- tags:
16
- - llama
17
- - unsloth
18
- - uncensored
19
- - llama-3.2
20
- - llama.cpp
21
- - gguf
22
- - inference
23
- ---
24
-
25
- # Llama 3.2 3B Uncensored
26
-
27
- This model is a fine-tuned version of Llama 3.2 3B trained by Carsen Klock, using multiple combined datasets and processed for uncensored responses. This was a test run at a very low amount of steps, do not expect the model to behave fully uncensored.
28
-
29
- ## Training Details
30
- - Base Model: Llama 3.2 3B
31
- - Training Framework: Unsloth
32
- - Training Type: LoRA Fine-tuning
33
- - Training Steps: 1000
34
- - Batch Size: 2
35
- - Learning Rate: 5e-6
36
- - Gradient Accumulation Steps: 16
37
- - Trained on 1 x NVIDIA RX 4080 SUPER
38
-
39
- ## Available Formats
40
- - GGUF (f16, q8_0, q5_k_m, q4_k_m)
41
- - Merged 16-bit
42
-
43
- ## Use Cases
44
- - Mathematics problem solving
45
- - Code generation and feedback
46
- - General instruction following
47
- - Uncensored responses
48
-
49
- ## Training Data
50
- Combined and processed datasets including:
51
- - [FineTome-100k](https://huggingface.co/datasets/mlabonne/FineTome-100k) - General instruction following
52
- - [Orca Math Word Problems](https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k) - Mathematics
53
- - [CodeFeedback](https://huggingface.co/datasets/m-a-p/CodeFeedback-Filtered-Instruction) - Code feedback
54
- - [Dolphin Coder](https://huggingface.co/datasets/cognitivecomputations/dolphin-coder) - Code generation
55
- - [Math GPT-4o](https://huggingface.co/datasets/PawanKrd/math-gpt-4o-200k) - Advanced mathematics
56
  - [Jenna Uncensored](https://huggingface.co/datasets/V3N0M/Jenna-50K-Alpaca-Uncensored) - Uncensored conversations
 
1
+ ---
2
+ base_model: meta-llama/Llama-3.2-3b
3
+ datasets:
4
+ - mlabonne/FineTome-100k
5
+ - microsoft/orca-math-word-problems-200k
6
+ - m-a-p/CodeFeedback-Filtered-Instruction
7
+ - cognitivecomputations/dolphin-coder
8
+ - PawanKrd/math-gpt-4o-200k
9
+ - V3N0M/Jenna-50K-Alpaca-Uncensored
10
+ language:
11
+ - en
12
+ library_name: transformers
13
+ license: llama3.2
14
+ pipeline_tag: text-generation
15
+ tags:
16
+ - llama
17
+ - unsloth
18
+ - uncensored
19
+ - llama-3.2
20
+ - llama.cpp
21
+ - gguf
22
+ - inference
23
+ ---
24
+
25
+ # Llama 3.2 3B Uncensored
26
+
27
+ This model is a fine-tuned version of Llama 3.2 3B trained by Carsen Klock, using multiple combined datasets and processed for uncensored responses.
28
+
29
+ ## Training Details
30
+ - Base Model: Llama 3.2 3B
31
+ - Training Framework: Unsloth
32
+ - Training Type: LoRA Fine-tuning
33
+ - Training Steps: 2000
34
+ - Batch Size: 2
35
+ - Learning Rate: 5e-6
36
+ - Gradient Accumulation Steps: 16
37
+ - Trained on 1 x NVIDIA RX 4080 SUPER
38
+
39
+ ## Available Formats
40
+ - GGUF (f16, q8_0, q5_k_m, q4_k_m)
41
+ - Merged 16-bit
42
+
43
+ ## Use Cases
44
+ - Mathematics problem solving
45
+ - Code generation and feedback
46
+ - General instruction following
47
+ - Uncensored responses
48
+
49
+ ## Training Data
50
+ Combined and processed datasets including:
51
+ - [FineTome-100k](https://huggingface.co/datasets/mlabonne/FineTome-100k) - General instruction following
52
+ - [Orca Math Word Problems](https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k) - Mathematics
53
+ - [CodeFeedback](https://huggingface.co/datasets/m-a-p/CodeFeedback-Filtered-Instruction) - Code feedback
54
+ - [Dolphin Coder](https://huggingface.co/datasets/cognitivecomputations/dolphin-coder) - Code generation
55
+ - [Math GPT-4o](https://huggingface.co/datasets/PawanKrd/math-gpt-4o-200k) - Advanced mathematics
56
  - [Jenna Uncensored](https://huggingface.co/datasets/V3N0M/Jenna-50K-Alpaca-Uncensored) - Uncensored conversations