deliciouscat's picture
Update README.md
cba5c04 verified
|
raw
history blame
576 Bytes
metadata
library_name: transformers
license: mit
datasets:
  - klue
  - dkoterwa/kor-sts
language:
  - ko
pipeline_tag: sentence-similarity

KF-DeBERTa-base-cross-STS

pre-trained model: KF-DeBERTa-base-cross-NLI {https://huggingface.co/deliciouscat/kf-deberta-base-cross-nli} trained data:

  • klue/sts: 1epoch
  • dkoterwa/kor-sts: 2epoch

label scaling: 0~5 -> -1->1 bi-encoder STS 학습을 위한 dataset augmentation을 상정하고 훈련하였습니다. {https://arxiv.org/abs/2010.08240} cosine similarity로 학습할 수 있도록 scaling 된 output이 추론됩니다.