neverLife commited on
Commit
cf0d769
1 Parent(s): aae38c5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md CHANGED
@@ -1,3 +1,50 @@
1
  ---
2
  license: openrail
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: openrail
3
+ language:
4
+ - ja
5
+ - zh
6
+ metrics:
7
+ - bleu
8
+ pipeline_tag: translation
9
  ---
10
+
11
+ 在Tatoeba-Challenge数据集下1epoch的结果
12
+
13
+
14
+
15
+ ## 结果
16
+
17
+ 在评估集上得到如下结果:
18
+ - Loss: 1.3042
19
+ - Bleu: 55.834
20
+ - Gen Len: 17.2465
21
+
22
+
23
+
24
+ ## 使用DEMO
25
+
26
+ ```python
27
+ from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
28
+
29
+ model_path = "neverLife/nllb-200-distilled-600M-ja-zh"
30
+ model = AutoModelForSeq2SeqLM.from_pretrained(model_path)
31
+ ja = "ぜんぜん田舎に来た気がしないんだが……。"
32
+ tokenizer = AutoTokenizer.from_pretrained(model_path, src_lang="jpn_Jpan", tgt_lang="zho_Hans")
33
+
34
+ input_ids = tokenizer.encode(ja, max_length=128, padding=True, return_tensors='pt')
35
+ outputs = model.generate(input_ids, num_beams=4, max_new_tokens=128)
36
+
37
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
38
+
39
+ ```
40
+
41
+
42
+
43
+
44
+
45
+ ## 框架版本
46
+
47
+ - Transformers 4.28.1
48
+ - Pytorch 2.0.0+cu117
49
+ - Datasets 2.11.0
50
+ - Tokenizers 0.13.3