Shobhank-iiitdwd
/

long-t5-tglobal-base-16384-book-summary

text2text-generation

Inference Endpoints

Model card Files Files and versions Community

Shobhank-iiitdwd commited on Dec 27, 2022

Commit

913134b

•

1 Parent(s): 059bc25

Update README.md

Files changed (1) hide show

README.md +2 -8

README.md CHANGED Viewed

@@ -455,11 +455,9 @@ model-index:
-Summarize long text and get a SparkNotes-esque summary of arbitrary topics!
 - generalizes reasonably well to academic & narrative text.
-- A simple example/use case on ASR is [here](https://longt5-booksum-example.netlify.app/).
-- Example notebook in Colab (_click on the icon above_).
 ## Cheeky Proof-of-Concept
@@ -492,7 +490,7 @@ A summary of the [infamous navy seals copypasta](https://knowyourmeme.com/memes/
 ## Model description
-A fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the `kmfoda/booksum` dataset:
 - 30+ epochs of fine-tuning from the base model on V100/A100 GPUs
 - Training used 16384 token input / 1024 max output
@@ -553,10 +551,6 @@ This model was originally tuned on Google Colab with a heavily modified variant
 ## Training procedure
-### Updates:
-- July 22, 2022: updated to a fairly converged checkpoint
-- July 3, 2022: Added a new version with several epochs of additional general training that is more performant.
 ### Training hyperparameters

 - generalizes reasonably well to academic & narrative text.
 ## Cheeky Proof-of-Concept
 ## Model description
+A fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/google/long-t5-tglobal-base) on the `booksum` dataset:
 - 30+ epochs of fine-tuning from the base model on V100/A100 GPUs
 - Training used 16384 token input / 1024 max output
 ## Training procedure
 ### Training hyperparameters