dongxq commited on
Commit
6ad3e77
1 Parent(s): 8743bf1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -29,7 +29,7 @@ Good at solving text summarization tasks, after fine-tuning on multiple Chinese
29
 
30
  基于[Randeng-Pegasus-523M-Chinese](https://huggingface.co/IDEA-CCNL/Randeng-Pegasus-523M-Chinese),我们在收集的7个中文领域的文本摘要数据集(约4M个样本),使用实体过滤后数据集(约1.8M)重新微调,在不损伤下游指标的情况下提升了摘要对原文的忠实度,得到了summary-v1版本。这7个数据集为:education, new2016zh, nlpcc, shence, sohu, thucnews和weibo。
31
 
32
- Based on [Randeng-Pegasus-523M-Chinese](https://huggingface.co/IDEA-CCNL/Randeng-Pegasus-523M-Chinese), we fine-tuned a text summarization version (summary-v1) on a filted dataset(1.8M), which we use entitys to filter these 7 Chinese text summarization datasets, with totaling around 4M samples. We can improve the faithfulness of summaries without damage to the downstream task, eg Rouge-L on lcsts. The datasets include: education, new2016zh, nlpcc, shence, sohu, thucnews and weibo.
33
 
34
  ### 下游效果 Performance
35
 
 
29
 
30
  基于[Randeng-Pegasus-523M-Chinese](https://huggingface.co/IDEA-CCNL/Randeng-Pegasus-523M-Chinese),我们在收集的7个中文领域的文本摘要数据集(约4M个样本),使用实体过滤后数据集(约1.8M)重新微调,在不损伤下游指标的情况下提升了摘要对原文的忠实度,得到了summary-v1版本。这7个数据集为:education, new2016zh, nlpcc, shence, sohu, thucnews和weibo。
31
 
32
+ Based on [Randeng-Pegasus-523M-Chinese](https://huggingface.co/IDEA-CCNL/Randeng-Pegasus-523M-Chinese), we fine-tuned a text summarization version (summary-v1) on a filted dataset(1.8M), which we use entitys to filter these 7 Chinese text summarization datasets, with totaling around 4M samples. We can improve the faithfulness of summaries without damage to the downstream task, eg. Rouge-L on lcsts. The datasets include: education, new2016zh, nlpcc, shence, sohu, thucnews and weibo.
33
 
34
  ### 下游效果 Performance
35