Tirendaz commited on
Commit
a49a9cd
1 Parent(s): 483833a

Upload model

Browse files
Files changed (3) hide show
  1. README.md +22 -2
  2. adapter_config.json +3 -3
  3. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -18,6 +18,7 @@ base_model: mistralai/Mistral-7B-v0.1
18
 
19
 
20
  - **Developed by:** [More Information Needed]
 
21
  - **Shared by [optional]:** [More Information Needed]
22
  - **Model type:** [More Information Needed]
23
  - **Language(s) (NLP):** [More Information Needed]
@@ -76,7 +77,7 @@ Use the code below to get started with the model.
76
 
77
  ### Training Data
78
 
79
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
80
 
81
  [More Information Needed]
82
 
@@ -107,7 +108,7 @@ Use the code below to get started with the model.
107
 
108
  #### Testing Data
109
 
110
- <!-- This should link to a Data Card if possible. -->
111
 
112
  [More Information Needed]
113
 
@@ -235,4 +236,23 @@ The following `bitsandbytes` quantization config was used during training:
235
  ### Framework versions
236
 
237
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
238
  - PEFT 0.6.2
 
18
 
19
 
20
  - **Developed by:** [More Information Needed]
21
+ - **Funded by [optional]:** [More Information Needed]
22
  - **Shared by [optional]:** [More Information Needed]
23
  - **Model type:** [More Information Needed]
24
  - **Language(s) (NLP):** [More Information Needed]
 
77
 
78
  ### Training Data
79
 
80
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
 
82
  [More Information Needed]
83
 
 
108
 
109
  #### Testing Data
110
 
111
+ <!-- This should link to a Dataset Card if possible. -->
112
 
113
  [More Information Needed]
114
 
 
236
  ### Framework versions
237
 
238
 
239
+ - PEFT 0.6.2
240
+ ## Training procedure
241
+
242
+
243
+ The following `bitsandbytes` quantization config was used during training:
244
+ - quant_method: bitsandbytes
245
+ - load_in_8bit: False
246
+ - load_in_4bit: True
247
+ - llm_int8_threshold: 6.0
248
+ - llm_int8_skip_modules: None
249
+ - llm_int8_enable_fp32_cpu_offload: False
250
+ - llm_int8_has_fp16_weight: False
251
+ - bnb_4bit_quant_type: nf4
252
+ - bnb_4bit_use_double_quant: False
253
+ - bnb_4bit_compute_dtype: bfloat16
254
+
255
+ ### Framework versions
256
+
257
+
258
  - PEFT 0.6.2
adapter_config.json CHANGED
@@ -16,11 +16,11 @@
16
  "rank_pattern": {},
17
  "revision": null,
18
  "target_modules": [
19
- "q_proj",
20
- "gate_proj",
21
  "v_proj",
22
  "o_proj",
23
- "k_proj"
 
24
  ],
25
  "task_type": "CAUSAL_LM"
26
  }
 
16
  "rank_pattern": {},
17
  "revision": null,
18
  "target_modules": [
19
+ "k_proj",
 
20
  "v_proj",
21
  "o_proj",
22
+ "gate_proj",
23
+ "q_proj"
24
  ],
25
  "task_type": "CAUSAL_LM"
26
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e959eb686bfc2fdfe71bfa8cabece08a711db9adde9262c3fd7f51c4bbd5e4b7
3
  size 369142184
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:edeacf260f8a14d9b56ad699f051606977247af6bf8aa2caa6e692b441cd8ed0
3
  size 369142184