bongsoo commited on
Commit
f2ddf3c
1 Parent(s): 277c09b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md CHANGED
@@ -31,6 +31,30 @@ jinmang2/kpfbert 모델을 sentencebert로 파인듀닝한 모델
31
 
32
  For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name={MODEL_NAME})
33
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
34
  ## Citing & Authors
35
 
36
  bongsoo
 
31
 
32
  For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name={MODEL_NAME})
33
 
34
+ ## Training
35
+ - [jinmang2/kpfbert](https://huggingface.co/jinmang2/kpfbert) 모델을 sts(10)-distil(10)-nli(3)-sts(10) 훈련 시킴
36
+
37
+ The model was trained with the parameters:
38
+
39
+ **공통**
40
+ - **do_lower_case=1, correct_bios=0, polling_mode=cls**
41
+
42
+ **1.STS**
43
+ - 말뭉치 : korsts(5,749) + kluestsV1.1(11,668) + stsb_multi_mt(5,749) + mteb/sickr-sts(9,927) + glue stsb(5,749) (총:38,842)
44
+ - Param : **lr: 1e-4, eps: 1e-6, warm_step=10%, epochs: 10, train_batch: 128, eval_batch: 64, max_token_len: 72**
45
+ - 훈련코드 [여기](https://github.com/kobongsoo/BERT/blob/master/sbert/sentece-bert-sts.ipynb) 참조
46
+
47
+ **2.distilation**
48
+ - 교사 모델 : paraphrase-multilingual-mpnet-base-v2(max_token_len:128)
49
+ - 말뭉치 : news_talk_en_ko_train.tsv (영어-한국어 대화-뉴스 병렬 말뭉치 : 1.38M)
50
+ - Param : **lr: 5e-5, epochs: 10, train_batch: 128, eval/test_batch: 64, max_token_len: 128(교사모델이 128이므로 맟춰줌)**
51
+ - 훈련코드 [여기](https://github.com/kobongsoo/BERT/blob/master/sbert/sbert-distillaton.ipynb) 참조
52
+
53
+ **3.NLI**
54
+ - 말뭉치 : 훈련(967,852) : kornli(550,152), kluenli(24,998), glue-mnli(392,702) / 평가(3,519) : korsts(1,500), kluests(519), gluests(1,500) ()
55
+ - HyperParameter : **lr: 3e-5, eps: 1e-8, warm_step=10%, epochs: 3, train/eval_batch: 64, max_token_len: 128**
56
+ - 훈련코드 [여기](https://github.com/kobongsoo/BERT/blob/master/sbert/sentence-bert-nli.ipynb) 참조
57
+ -
58
  ## Citing & Authors
59
 
60
  bongsoo