Salesforce
/

dialogstudio-t5-large-v1.0

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jianguozhang commited on Sep 7, 2023

Commit

fb42602

•

1 Parent(s): 19d2bf0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -110,7 +110,7 @@ alt="drawing" width="700"/>
   - For sampled FLAN data:
     - We follow their original data format, i.e., we did not set special tokens to separate in-context learning examples.
   - In summary:
-    - We recommend you use our format and add our special tokens (such as `<USER>` and `<SYSTEM>` ) to get better performance. However, you may not necessary need to exactly follow our format if you do observe random behavios.
     - We found that T5 model series such as Flan-t5 and DialogStudio-T5 may generate repetitive tokens during inference. If you find such repetition issues, you can set the `repetition_penalty` in model.generate(), such as 1.5, to mitigate them. Note that `repetition_penalty=1.0` by default.
 # Usage

   - For sampled FLAN data:
     - We follow their original data format, i.e., we did not set special tokens to separate in-context learning examples.
   - In summary:
+    - We recommend you use our format and add our special tokens (such as `<USER>` and `<SYSTEM>` ) to get better performance. However, you may not necessary need to exactly follow our format if you do not observe random behavios.
     - We found that T5 model series such as Flan-t5 and DialogStudio-T5 may generate repetitive tokens during inference. If you find such repetition issues, you can set the `repetition_penalty` in model.generate(), such as 1.5, to mitigate them. Note that `repetition_penalty=1.0` by default.
 # Usage