Tamil-Summarization / README.md
suriya7's picture
Update README.md
3b47336 verified
|
raw
history blame
No virus
2.52 kB
metadata
license: mit
datasets:
  - HariprasathSB/tamil_summarization
language:
  - en
  - ta
tags:
  - summarization
  - translation

Tamil Summarization and English-to-Tamil Translation Model

Overview

This repository contains a fine-tuned model for both Tamil summarization and English-to-Tamil translation. The model was fine-tuned using the Hugging Face Transformers library. This README provides information on how to use the model and its capabilities.

Model Details

Model Performance

W&B Chart 23_3_2024, 11_46_59 pm.png

Usage

Installation

You can install the necessary dependencies using pip:

pip install transformers

Inference

Below is an example of how to use the model for both summarization and translation tasks:

# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("suriya7/Tamil-Summarization")
model = AutoModelForSeq2SeqLM.from_pretrained("suriya7/Tamil-Summarization")

# Example English-to-Tamil Translation:

input_text = "This is an example English sentence."
input_ids = tokenizer.encode(input_text, return_tensors="pt").input_ids
outputs = model.generate(input_ids,max_length=128)
translated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
print("Translated Tamil Sentence:", translated_text)

# Example Tamil Summarization:

tamil_article = "தமிழ் உரையினை சுருக்கமாக சுருக்கமாக உரையிடுவது எப்படி?"
tamil_input_ids = tokenizer.encode(tamil_article, return_tensors="pt",truncation=True).input_ids
summary_ids = model.generate(tamil_input_ids, max_length=128)
summary = tokenizer.decode(summary_ids[0], skip_special_tokens=True)
print("Summarized Tamil Text:", summary)

Model Output

  • For translation tasks, the model outputs translated text in Tamil.
  • For summarization tasks, the model outputs a summarized version of the input Tamil text.