Kirili4ik
/

mbart_ruDialogSum

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

mbart_ruDialogSum / README.md

Kirili4ik's picture

Update README.md

322c3e7 over 1 year ago

|

2.61 kB

	---
	language:
	- ru
	tags:
	- mbart
	inference:
	parameters:
	no_repeat_ngram_size: 4,
	num_beams: 5
	datasets:
	- IlyaGusev/gazeta
	- samsum
	- samsum_(translated_into_Russian)
	widget:
	- text: >
	Джефф: Могу ли я обучить модель 🤗 Transformers на Amazon SageMaker?

	Филипп: Конечно, вы можете использовать новый контейнер для глубокого
	обучения HuggingFace.

	Джефф: Хорошо.

	Джефф: и как я могу начать?

	Джефф: где я могу найти документацию?

	Филипп: ок, ок, здесь можно найти все:
	https://huggingface.co/blog/the-partnership-amazon-sagemaker-and-hugging-face
	model-index:
	- name: mbart_ruDialogSum
	results:
	- task:
	name: Abstractive Dialogue Summarization
	type: abstractive-text-summarization
	dataset:
	name: SAMSum Corpus (translated to Russian)
	type: samsum
	metrics:
	- name: Validation ROGUE-1
	type: rogue-1
	value: 34.5
	- name: Validation ROGUE-L
	type: rogue-l
	value: 33
	- name: Test ROGUE-1
	type: rogue-1
	value: 31
	- name: Test ROGUE-L
	type: rogue-l
	value: 28
	license: cc
	---
	### 📝 Description

	MBart for Russian summarization fine-tuned for dialogues summarization.


	This model was firstly fine-tuned by [Ilya Gusev](https://hf.co/IlyaGusev) on [Gazeta dataset](https://huggingface.co/datasets/IlyaGusev/gazeta). We have fine tuned that model on [SamSum dataset](https://huggingface.co/datasets/samsum) translated to Russian using GoogleTranslateAPI

	🤗 Moreover! We have implemented a ! telegram bot [@summarization_bot](https://t.me/summarization_bot) ! with the inference of this model. Add it to the chat and get summaries instead of dozens spam messages! 🤗


	### ❓ How to use with code
	```python
	from transformers import MBartTokenizer, MBartForConditionalGeneration

	# Download model and tokenizer
	model_name = "Kirili4ik/mbart_ruDialogSum"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = MBartForConditionalGeneration.from_pretrained(model_name)
	model.eval()

	article_text = "..."

	input_ids = tokenizer(
	[article_text],
	max_length=600,
	padding="max_length",
	truncation=True,
	return_tensors="pt",
	)["input_ids"]

	output_ids = model.generate(
	input_ids=input_ids,
	top_k=0,
	num_beams=3,
	no_repeat_ngram_size=3
	)[0]


	summary = tokenizer.decode(output_ids, skip_special_tokens=True)
	print(summary)
	```