kobart / README.md
hyunwoongko's picture
Update README.md
039ea66
metadata
language: ko
tags:
  - bart
license: mit

KoBART-base-v2

With the addition of chatting data, the model is trained to handle the semantics of sequences longer than KoBART.

from transformers import PreTrainedTokenizerFast, BartModel

tokenizer = PreTrainedTokenizerFast.from_pretrained('hyunwoongko/kobart')
model = BartModel.from_pretrained('hyunwoongko/kobart')

Performance

NSMC

  • acc. : 0.901

hyunwoongko/kobart

  • Added bos/eos post processor
  • Removed token_type_ids