flash-attention / cosine-warmup.yaml
theonlyengine's picture
Upload 421 files
3f9c425 verified
raw
history blame
82 Bytes
# @package train.scheduler
_target_: transformers.get_cosine_schedule_with_warmup