Edit model card

mt5-cpe-kmutt-thai-sentence-sum

This repository contains the finetuned mT5-base model for Thai sentence summarization. The architecture of the model is based on mT5 model and fine-tuned on text-summarization pairs in Thai. Also, this project is a Senior Project of Computer Engineering Student at King Mongkut’s University of Technology Thonburi.

Usage on SimpleTransformer (Tested on version 0.63.4)

from simpletransformers.t5 import T5Model, T5Args
from torch import cuda

model = T5Model("t5", "thanathorn/mt5-cpe-kmutt-thai-sentence-sum", use_cuda=cuda.is_available())

sentence = "simplify: ถ้าพูดถึงขนมหวานในตำนานที่ชื่นใจที่สุดแล้วละก็ต้องไม่พ้น น้ำแข็งใส แน่เพราะว่าเป็นอะไรที่ชื่นใจสุด"
prediction = model.predict([sentence])
print(prediction[0])

(See the example on Google Colab)

Score

  • ROUGE-1: 61.7805
  • ROUGE-2: 45.9689
  • ROUGE-L: 59.3542

Intended uses & limitations

  • You can use this model for Thai sentence text summarization.
  • Not intended to use with paragraph text.
Downloads last month
21,628
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.