wilmerhenao
/

olinguito

Model card Files Files and versions Community

wilmerhenao commited on Jun 26, 2023

Commit

fb7211b

•

1 Parent(s): 2bb239d

Introducing Olinguito: A Language Model Fine-tuned with LORA Algorithm on Alpaca-cleaned Data

Browse files

This commit adds Olinguito, a new language model derived from Dolly, which has been fine-tuned using the LORA (Low-Rank Adaptation of Large Language Models ) algorithm. Olinguito's training data has undergone meticulous cleaning, specifically extracted from Alpaca sources. By applying LORA, Olinguito aims to provide enhanced performance, accuracy, and robustness in natural language processing tasks. This commit lays the foundation for incorporating Olinguito into our Hugging Face repository, enabling users to access and utilize this refined language model for various applications.

Files changed (3) hide show

README.md +42 -1
adapter_config.json +20 -0
adapter_model.bin +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,44 @@
 ---
-license: apache-2.0
 ---

 ---
+library_name: peft
 ---
+## Training procedure
+The following `bitsandbytes` quantization config was used during training:
+- load_in_8bit: True
+- load_in_4bit: False
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: fp4
+- bnb_4bit_use_double_quant: False
+- bnb_4bit_compute_dtype: float32
+The following `bitsandbytes` quantization config was used during training:
+- load_in_8bit: True
+- load_in_4bit: False
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: fp4
+- bnb_4bit_use_double_quant: False
+- bnb_4bit_compute_dtype: float32
+The following `bitsandbytes` quantization config was used during training:
+- load_in_8bit: True
+- load_in_4bit: False
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: fp4
+- bnb_4bit_use_double_quant: False
+- bnb_4bit_compute_dtype: float32
+### Framework versions
+- PEFT 0.4.0.dev0
+- PEFT 0.4.0.dev0
+- PEFT 0.4.0.dev0

adapter_config.json ADDED Viewed

	@@ -0,0 +1,20 @@

+{
+  "base_model_name_or_path": "EleutherAI/gpt-j-6B",
+  "bias": "none",
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "lora_alpha": 16,
+  "lora_dropout": 0.05,
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 4,
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "v_proj"
+  ],
+  "task_type": "CAUSAL_LM"
+}

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2f486b4780c854cb07b402669a80fff69bf52d382be990154f4019ae53a13a25
+size 7379597