Edit model card

Generic badge

Prompt Extend

Text generation model for generating suitable style cues given the main idea for a prompt.

It is a GPT-2 model trained on dataset of stable diffusion prompts.

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 128
  • eval_batch_size: 256
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 5
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss
3.7436 1.0 12796 2.5429
2.3292 2.0 25592 2.0711
1.9439 3.0 38388 1.8447
1.7059 4.0 51184 1.7325
1.5775 5.0 63980 1.7110

Framework versions

  • Transformers 4.24.0
  • Pytorch 1.13.0+cu117
  • Datasets 2.7.1
  • Tokenizers 0.13.2
Downloads last month
2,589
Hosted inference API
Text Generation
Examples
Examples
This model can be loaded on the Inference API on-demand.

Spaces using daspartho/prompt-extend 10