flozi00 commited on
Commit
1d73c08
1 Parent(s): 92f88d6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -79
README.md CHANGED
@@ -1,86 +1,17 @@
1
  ---
2
- library_name: peft
 
 
 
3
  ---
4
- ## Training procedure
5
 
 
6
 
7
- The following `bitsandbytes` quantization config was used during training:
8
- - quant_method: bitsandbytes
9
- - load_in_8bit: False
10
- - load_in_4bit: True
11
- - llm_int8_threshold: 6.0
12
- - llm_int8_skip_modules: None
13
- - llm_int8_enable_fp32_cpu_offload: False
14
- - llm_int8_has_fp16_weight: False
15
- - bnb_4bit_quant_type: fp4
16
- - bnb_4bit_use_double_quant: False
17
- - bnb_4bit_compute_dtype: float16
18
 
19
- The following `bitsandbytes` quantization config was used during training:
20
- - quant_method: bitsandbytes
21
- - load_in_8bit: False
22
- - load_in_4bit: True
23
- - llm_int8_threshold: 6.0
24
- - llm_int8_skip_modules: None
25
- - llm_int8_enable_fp32_cpu_offload: False
26
- - llm_int8_has_fp16_weight: False
27
- - bnb_4bit_quant_type: fp4
28
- - bnb_4bit_use_double_quant: False
29
- - bnb_4bit_compute_dtype: float16
30
 
31
- The following `bitsandbytes` quantization config was used during training:
32
- - quant_method: bitsandbytes
33
- - load_in_8bit: False
34
- - load_in_4bit: True
35
- - llm_int8_threshold: 6.0
36
- - llm_int8_skip_modules: None
37
- - llm_int8_enable_fp32_cpu_offload: False
38
- - llm_int8_has_fp16_weight: False
39
- - bnb_4bit_quant_type: fp4
40
- - bnb_4bit_use_double_quant: False
41
- - bnb_4bit_compute_dtype: float16
42
 
43
- The following `bitsandbytes` quantization config was used during training:
44
- - quant_method: bitsandbytes
45
- - load_in_8bit: False
46
- - load_in_4bit: True
47
- - llm_int8_threshold: 6.0
48
- - llm_int8_skip_modules: None
49
- - llm_int8_enable_fp32_cpu_offload: False
50
- - llm_int8_has_fp16_weight: False
51
- - bnb_4bit_quant_type: fp4
52
- - bnb_4bit_use_double_quant: False
53
- - bnb_4bit_compute_dtype: float16
54
-
55
- The following `bitsandbytes` quantization config was used during training:
56
- - quant_method: bitsandbytes
57
- - load_in_8bit: False
58
- - load_in_4bit: True
59
- - llm_int8_threshold: 6.0
60
- - llm_int8_skip_modules: None
61
- - llm_int8_enable_fp32_cpu_offload: False
62
- - llm_int8_has_fp16_weight: False
63
- - bnb_4bit_quant_type: fp4
64
- - bnb_4bit_use_double_quant: False
65
- - bnb_4bit_compute_dtype: float16
66
-
67
- The following `bitsandbytes` quantization config was used during training:
68
- - quant_method: bitsandbytes
69
- - load_in_8bit: False
70
- - load_in_4bit: True
71
- - llm_int8_threshold: 6.0
72
- - llm_int8_skip_modules: None
73
- - llm_int8_enable_fp32_cpu_offload: False
74
- - llm_int8_has_fp16_weight: False
75
- - bnb_4bit_quant_type: fp4
76
- - bnb_4bit_use_double_quant: False
77
- - bnb_4bit_compute_dtype: float16
78
- ### Framework versions
79
-
80
- - PEFT 0.6.0.dev0
81
- - PEFT 0.6.0.dev0
82
- - PEFT 0.6.0.dev0
83
- - PEFT 0.6.0.dev0
84
- - PEFT 0.6.0.dev0
85
-
86
- - PEFT 0.6.0.dev0
 
1
  ---
2
+ datasets:
3
+ - flozi00/conversations
4
+ language:
5
+ - de
6
  ---
 
7
 
8
+ ## This project is sponsored by [ ![PrimeLine](https://www.primeline-solutions.com/skin/frontend/default/theme566/images/primeline-solutions-logo.png) ](https://www.primeline-solutions.com/de/server/nach-einsatzzweck/gpu-rendering-hpc/)
9
 
10
+ # Model Card
 
 
 
 
 
 
 
 
 
 
11
 
12
+ This model is an finetuned version for german instructions and conversations in style of Alpaca. "### Assistant:" "### User:", trained with a context length of 8k tokens.
13
+ The dataset used is deduplicated and cleaned, with no codes inside. The focus is on instruction following and conversational tasks.
 
 
 
 
 
 
 
 
 
14
 
15
+ The model archictecture is based on Mistral v0.1 with 7B parameters, trained on 100% renewable energy powered hardware.
 
 
 
 
 
 
 
 
 
 
16
 
17
+ This work is contributed by private research of [flozi00](https://huggingface.co/flozi00)