Librarian Bot: Add base_model information to model
#1
by
librarian-bot
- opened
README.md
CHANGED
@@ -1,8 +1,5 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
-
datasets:
|
4 |
-
- vicgalle/alpaca-gpt4
|
5 |
-
pipeline_tag: conversational
|
6 |
tags:
|
7 |
- alpaca
|
8 |
- gpt4
|
@@ -11,6 +8,10 @@ tags:
|
|
11 |
- finetuning
|
12 |
- lora
|
13 |
- peft
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
GPT-J 6B model was finetuned on GPT-4 generations of the Alpaca prompts on [MonsterAPI](https://monsterapi.ai)'s no-code LLM finetuner, using LoRA for ~ 65,000 steps, auto-optmised to run on 1 A6000 GPU with no out of memory issues and without needing me to write any code or setup a GPU server with libraries to run this experiment. The finetuner does it all for us by itself.
|
16 |
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
3 |
tags:
|
4 |
- alpaca
|
5 |
- gpt4
|
|
|
8 |
- finetuning
|
9 |
- lora
|
10 |
- peft
|
11 |
+
datasets:
|
12 |
+
- vicgalle/alpaca-gpt4
|
13 |
+
pipeline_tag: conversational
|
14 |
+
base_model: EleutherAI/gpt-j-6b
|
15 |
---
|
16 |
GPT-J 6B model was finetuned on GPT-4 generations of the Alpaca prompts on [MonsterAPI](https://monsterapi.ai)'s no-code LLM finetuner, using LoRA for ~ 65,000 steps, auto-optmised to run on 1 A6000 GPU with no out of memory issues and without needing me to write any code or setup a GPU server with libraries to run this experiment. The finetuner does it all for us by itself.
|
17 |
|