huseinzol05 commited on
Commit
41e211e
1 Parent(s): ddf12bb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ms
4
+ tags:
5
+ - paraphrase
6
+ metrics:
7
+ - sacrebleu
8
+ ---
9
+
10
+ # finetune-paraphrase-t5-small-standard-bahasa-cased
11
+
12
+ Finetuned T5 small on MS paraphrase tasks.
13
+
14
+ ## Dataset
15
+
16
+ 1. translated PAWS, https://huggingface.co/datasets/mesolitica/translated-PAWS
17
+ 2. translated MRPC, https://huggingface.co/datasets/mesolitica/translated-MRPC
18
+
19
+ ## Finetune details
20
+
21
+ 1. Finetune using single RTX 3090 Ti.
22
+
23
+ Scripts at https://github.com/huseinzol05/malaya/tree/master/session/paraphrase/hf-t5
24
+
25
+ ## Supported prefix
26
+
27
+ 1. `parafrasa: {string}`, for MS paraphrase.
28
+
29
+ ## Evaluation
30
+
31
+ Evaluated on MRPC validation set and PAWS test set.
32
+
33
+ ```
34
+ {'name': 'BLEU',
35
+ 'score': 61.559202822392486,
36
+ '_mean': -1.0,
37
+ '_ci': -1.0,
38
+ '_verbose': '86.0/68.4/56.1/46.3 (BP = 0.984 ratio = 0.984 hyp_len = 138806 ref_len = 141004)',
39
+ 'bp': 0.9842896648933511,
40
+ 'counts': [119394, 90149, 69905, 54500],
41
+ 'totals': [138806, 131732, 124658, 117584],
42
+ 'sys_len': 138806,
43
+ 'ref_len': 141004,
44
+ 'precisions': [86.0150137602121,
45
+ 68.43363799228737,
46
+ 56.07742784257729,
47
+ 46.34984351612464],
48
+ 'prec_str': '86.0/68.4/56.1/46.3',
49
+ 'ratio': 0.9844117897364614}
50
+ ```