emozilla
/

scifi-fantasy-author-7b-8k_delta

Text Generation

text-generation-inference

Model card Files Files and versions Community

emozilla commited on May 11, 2023

Commit

6e46e7d

•

1 Parent(s): cc5ac58

Create README.md

Files changed (1) hide show

README.md +22 -0

README.md ADDED Viewed

	@@ -0,0 +1,22 @@

+---
+license: apache-2.0
+inference: false
+---
+**NOTE: This "delta model" cannot be used directly.**
+Users have to apply it on top of the original LLaMA weights.
+See https://github.com/lm-sys/FastChat#vicuna-weights for instructions.
+<br>
+<br>
+# scifi-fantasy-author Model Card
+`scifi-fantasy-author` is a finetuned LLaMA model to generate narrative fiction,
+paricularly in the Science Fiction and Fantasy genres.
+The following hyperparameters were used
+|Batch Size|Epochs|Context length|Learning rate|Scheduler|Weight decay|Warmup ratio|
+|----------|------|--------------|-------------|---------|------------|------------|
+|       64 |    3 |         8192 |        2e-5 |  Cosine |         0. |       0.03 |
+The model reached a training loss of 2.008 and took approximately 8 hours on 8x A100 80 GB GPUs.