flax-community
/

t5-v1_1-base-wikisplit

 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,53 @@

+---
+datasets:
+- wiki_split
+widget:
+- text: "Mary likes to play football in her freetime whenever she meets with her friends that are very nice people."
+license: mit
+---
+# T5 model for sentence splitting in English
+Sentence Split is the task of dividing a long sentence into multiple sentences.
+E.g.:
+```
+Mary likes to play football in her freetime whenever she meets with her friends that are very nice people.
+```
+could be split into
+```
+Mary likes to play football in her freetime whenever she meets with her friends.
+```
+```
+Her friends are very nice people.
+```
+## How to use it in your code:
+```python
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+tokenizer = AutoTokenizer.from_pretrained("flax-community/t5-v1_1-base-wikisplit")
+model = AutoModelForSeq2SeqLM.from_pretrained("flax-community/t5-v1_1-base-wikisplit")
+complex_sentence = "This comedy drama is produced by Tidy , the company she co-founded in 2008 with her husband David Peet , who is managing director ."
+sample_tokenized = tokenizer(complex_sentence, return_tensors="pt")
+answer = model.generate(sample_tokenized['input_ids'], attention_mask = sample_tokenized['attention_mask'], max_length=256, num_beams=5)
+gene_sentence = tokenizer.decode(answer[0], skip_special_tokens=True)
+gene_sentence
+"""
+Output:
+This comedy drama is produced by Tidy. She co-founded Tidy in 2008 with her husband David Peet, who is managing director.
+"""
+```
+## Datasets:
+[Wiki_Split](https://research.google/tools/datasets/wiki-split/)
+## Current Basline from [paper](https://arxiv.org/abs/1907.12461)
+![baseline](./baseline.png)
+## Our Results:
+| Model | Exact | SARI | BLEU |
+| --- | --- | --- | --- |
+| t5-base-wikisplit |  17.93 | 67.5438 | 76.9 |
+| t5-v1_1-base-wikisplit | 16.84 | 66.38 | 76.32 |

config.json CHANGED Viewed

@@ -6,6 +6,7 @@
   "d_ff": 2048,
   "d_kv": 64,
   "d_model": 768,
   "decoder_start_token_id": 0,
   "dropout_rate": 0.1,
   "eos_token_id": 1,

   "d_ff": 2048,
   "d_kv": 64,
   "d_model": 768,
+  "max_length": 256,
   "decoder_start_token_id": 0,
   "dropout_rate": 0.1,
   "eos_token_id": 1,