Edit model card

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Hyperparameters Value
name AdamW
learning_rate.class_name WarmUpCosine
learning_rate.config.lr_start 1e-05
learning_rate.config.lr_max 0.001
learning_rate.config.total_steps 15625
learning_rate.config.warmup_steps 2343
decay 0.0
beta_1 0.8999999761581421
beta_2 0.9990000128746033
epsilon 1e-07
amsgrad False
weight_decay 9.999999747378752e-05
exclude_from_weight_decay None
training_precision float32

Model Plot

View Model Plot

Model Image

Downloads last month
2
Unable to determine this model’s pipeline type. Check the docs .