NOTE: This "delta model" cannot be used directly.
Users have to apply it on top of the original LLaMA weights.
See https://github.com/lm-sys/FastChat#vicuna-weights for instructions.
scifi-fantasy-author Model Card
scifi-fantasy-author is a finetuned LLaMA-7B model to generate narrative fiction,
paricularly in the Science Fiction and Fantasy genres.
The following hyperparameters were used
|Batch Size||Epochs||Context length||Learning rate||Scheduler||Weight decay||Warmup ratio|
The model reached a training loss of 2.008 and took approximately 8 hours on 8x A100 80 GB GPUs.
The specific training script can be found here.
- Downloads last month
Inference API has been turned off for this model.