NOTE: This "delta model" cannot be used directly.
Users have to apply it on top of the original LLaMA weights.
See https://github.com/lm-sys/FastChat#vicuna-weights for instructions.

scifi-fantasy-author Model Card

scifi-fantasy-author is a finetuned LLaMA-7B model to generate narrative fiction, paricularly in the Science Fiction and Fantasy genres.

The following hyperparameters were used

Batch Size Epochs Context length Learning rate Scheduler Weight decay Warmup ratio
128 3 8192 2e-5 Cosine 0. 0.03

The model reached a training loss of 2.008 and took approximately 8 hours on 8x A100 80 GB GPUs.

The specific training script can be found here.

Downloads last month
12
Inference Examples
Inference API (serverless) has been turned off for this model.