pszemraj
/

long-t5-tglobal-xl-16384-book-summary

text2text-generation

Model card Files Files and versions Community

pszemraj commited on Dec 15, 2022

Commit

1bb84f7

•

1 Parent(s): 25f8f89

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -173,13 +173,15 @@ long_text = "Here is a lot of text I don't want to read. Replace me"
 result = summarizer(long_text)
 print(result[0]["summary_text"])
 ```
-### beyond the basics
-### decoding performance
 Pass [other parameters related to beam search textgen](https://huggingface.co/blog/how-to-generate) when calling `summarizer` to get even higher quality results.
-### LLM.int8 Quantization
 > alternate section title: how to get this monster to run inference on free Colab runtimes
@@ -211,6 +213,8 @@ model = AutoModelForSeq2SeqLM.from_pretrained(
 )
 ```
 Do you love to ask questions? Awesome. But first, check out the [how LLM.int8 works blog post](https://huggingface.co/blog/hf-bitsandbytes-integration) by huggingface.
 \* More rigorous metric-based investigation into comparing beam-search summarization with and without LLM.int8 will take place over time.

 result = summarizer(long_text)
 print(result[0]["summary_text"])
 ```
+### Beyond the basics
+There are two additional points to consider beyond simple inference: adjusting decoding parameters for improved performance, and quantization for decreased memory devouring.
+#### Adjusting parameters
 Pass [other parameters related to beam search textgen](https://huggingface.co/blog/how-to-generate) when calling `summarizer` to get even higher quality results.
+#### LLM.int8 Quantization
 > alternate section title: how to get this monster to run inference on free Colab runtimes
 )
 ```
+The above is already present in the Colab demo linked at the top of the model card.
 Do you love to ask questions? Awesome. But first, check out the [how LLM.int8 works blog post](https://huggingface.co/blog/hf-bitsandbytes-integration) by huggingface.
 \* More rigorous metric-based investigation into comparing beam-search summarization with and without LLM.int8 will take place over time.