avinashhm
/

dolly-3b-lora

instruction-tuning

Model card Files Files and versions

avinashhm commited on Jun 23

Commit

73e0f08

·

verified ·

1 Parent(s): 7cdcb9e

Update README.md

Files changed (1) hide show

README.md +14 -8

README.md CHANGED Viewed

@@ -1,13 +1,20 @@
 ---
 library_name: transformers
 tags:
-  - dolly-v2
-  - instruction-tuning
-  - peft
-  - lora
 ---
-# Model Card for dolly-3b-lora
 This model is a fine-tuned version of the Dolly V2 3B language model, enhanced with Parameter-Efficient Fine-Tuning (PEFT) using Low-Rank Adaptation (LoRA). It was fine-tuned on the LaMini-instruction dataset to improve its ability to follow instructions and generate coherent responses for various tasks.
@@ -18,12 +25,11 @@ This model is a fine-tuned version of the Dolly V2 3B language model, enhanced w
 This is a fine-tuned version of the `databricks/dolly-v2-3b` model, adapted using LoRA on the LaMini-instruction dataset. The model is designed for instruction-following tasks, leveraging the efficiency of LoRA to fine-tune approximately 2.93% of the total parameters while maintaining performance. It supports text generation tasks and has been optimized for use on GPU hardware with 8-bit quantization, with a fallback to CPU if needed.
 - **Developed by:** avinashhm
-- **Funded by [optional]:** Not specified
-- **Shared by [optional]:** avinashhm
 - **Model type:** Causal Language Model
 - **Language(s) (NLP):** English
 - **License:** Apache-2.0
-- **Finetuned from model [optional]:** databricks/dolly-v2-3b
 ### Model Sources

 ---
 library_name: transformers
 tags:
+- dolly-v2
+- instruction-tuning
+- peft
+- lora
+license: apache-2.0
+datasets:
+- MBZUAI/LaMini-instruction
+language:
+- en
+base_model:
+- databricks/dolly-v2-3b
 ---
+# dolly-3b-lora(Finetuned)
 This model is a fine-tuned version of the Dolly V2 3B language model, enhanced with Parameter-Efficient Fine-Tuning (PEFT) using Low-Rank Adaptation (LoRA). It was fine-tuned on the LaMini-instruction dataset to improve its ability to follow instructions and generate coherent responses for various tasks.
 This is a fine-tuned version of the `databricks/dolly-v2-3b` model, adapted using LoRA on the LaMini-instruction dataset. The model is designed for instruction-following tasks, leveraging the efficiency of LoRA to fine-tune approximately 2.93% of the total parameters while maintaining performance. It supports text generation tasks and has been optimized for use on GPU hardware with 8-bit quantization, with a fallback to CPU if needed.
 - **Developed by:** avinashhm
+- **Shared by :** avinashhm
 - **Model type:** Causal Language Model
 - **Language(s) (NLP):** English
 - **License:** Apache-2.0
+- **Finetuned from model :** databricks/dolly-v2-3b
 ### Model Sources