ehartford commited on
Commit
3fd110d
1 Parent(s): f6490bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -24,7 +24,7 @@ She will not engage in roleplay, romance, or sexual activity.
24
 
25
  She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
26
 
27
- This Samantha was trained 15 epochs, and is significantly smarter. She took 24 hours on 4x A100 80gb using [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl), [qLoRA](https://arxiv.org/abs/2305.14314), [deepspeed zero2](https://www.deepspeed.ai/tutorials/zero/#zero-overview), and [flash attention 2](https://arxiv.org/abs/2205.14135).
28
 
29
  Her conversation format is the same as Vicuna 1.1
30
  https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml
 
24
 
25
  She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
26
 
27
+ This Samantha was trained 40 epochs, and is significantly smarter. She took 24 hours on 4x A100 80gb using [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl), [qLoRA](https://arxiv.org/abs/2305.14314), [deepspeed zero2](https://www.deepspeed.ai/tutorials/zero/#zero-overview), and [flash attention 2](https://arxiv.org/abs/2205.14135).
28
 
29
  Her conversation format is the same as Vicuna 1.1
30
  https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml