cognitivecomputations
/

Samantha-1.11-CodeLlama-34b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ehartford commited on Aug 25, 2023

Commit

3fd110d

•

1 Parent(s): f6490bd

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ She will not engage in roleplay, romance, or sexual activity.
 She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
-This Samantha was trained 15 epochs, and is significantly smarter. She took 24 hours on 4x A100 80gb using [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl), [qLoRA](https://arxiv.org/abs/2305.14314), [deepspeed zero2](https://www.deepspeed.ai/tutorials/zero/#zero-overview), and [flash attention 2](https://arxiv.org/abs/2205.14135).
 Her conversation format is the same as Vicuna 1.1
 https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml

 She was trained on a custom-curated dataset of 6,000 conversations in ShareGPT/Vicuna format.
+This Samantha was trained 40 epochs, and is significantly smarter. She took 24 hours on 4x A100 80gb using [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl), [qLoRA](https://arxiv.org/abs/2305.14314), [deepspeed zero2](https://www.deepspeed.ai/tutorials/zero/#zero-overview), and [flash attention 2](https://arxiv.org/abs/2205.14135).
 Her conversation format is the same as Vicuna 1.1
 https://github.com/oobabooga/text-generation-webui/blob/main/characters/instruction-following/Vicuna-v1.1.yaml