huseinzol05 commited on
Commit
9f0f5a5
1 Parent(s): b6bb47c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ms
4
+ tags:
5
+ - paraphrase
6
+ metrics:
7
+ - sacrebleu
8
+ ---
9
+
10
+ # finetune-paraphrase-t5-tiny-standard-bahasa-cased
11
+
12
+ Finetuned T5 tiny on MS paraphrase tasks.
13
+
14
+ ## Dataset
15
+
16
+ 1. translated PAWS, https://huggingface.co/datasets/mesolitica/translated-PAWS
17
+ 2. translated MRPC, https://huggingface.co/datasets/mesolitica/translated-MRPC
18
+
19
+ ## Finetune details
20
+
21
+ 1. Finetune using single RTX 3090 Ti.
22
+
23
+ Scripts at https://github.com/huseinzol05/malaya/tree/master/session/paraphrase/hf-t5
24
+
25
+ ## Supported prefix
26
+
27
+ 1. `parafrasa: {string}`, for MS paraphrase.
28
+
29
+ ## Evaluation
30
+
31
+ Evaluated on MRPC validation set and PAWS test set.
32
+
33
+ ```
34
+ {'name': 'BLEU',
35
+ 'score': 61.06784273649806,
36
+ '_mean': -1.0,
37
+ '_ci': -1.0,
38
+ '_verbose': '86.1/68.4/55.8/45.9 (BP = 0.980 ratio = 0.980 hyp_len = 138209 ref_len = 141004)',
39
+ 'bp': 0.9799801176769202,
40
+ 'counts': [119035, 89737, 69210, 53653],
41
+ 'totals': [138209, 131135, 124061, 116987],
42
+ 'sys_len': 138209,
43
+ 'ref_len': 141004,
44
+ 'precisions': [86.1268079502782,
45
+ 68.4310062149693,
46
+ 55.787072488533866,
47
+ 45.86236077512886],
48
+ 'prec_str': '86.1/68.4/55.8/45.9',
49
+ 'ratio': 0.9801778672945448}
50
+ ```