MT5 release The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. Collection by google May 14 12 google/mt5-base Text2Text Generation • Updated Jan 24, 2023 • 382k • 166 google/mt5-large Text2Text Generation • Updated Jan 24, 2023 • 37.7k • 77 google/umt5-small Text2Text Generation • Updated Jul 6, 2023 • 9.44k • 18 google/umt5-xl Text2Text Generation • Updated Jul 3, 2023 • 3.46k • 13
My work Collection by quchenyuan Sep 12, 2023 - Multi-view Self-supervised Disentanglement for General Image Denoising Paper • 2309.05049 • Published Sep 10, 2023 • 1
Multi-view Self-supervised Disentanglement for General Image Denoising Paper • 2309.05049 • Published Sep 10, 2023 • 1
AI_MultiModal Collection by ET01 Oct 12, 2023 - Sleeping 122 ⚡ Qwen VL liuhaotian/llava-v1.5-13b Image-Text-to-Text • Updated May 9 • 158k • 435
T5 release The original T5 transformer release was done in two steps, the original T5 checkpoints and the improved T5v1 Collection by google May 14 10 google-t5/t5-base Translation • Updated Feb 14 • 3.73M • 494 google-t5/t5-small Translation • Updated Jun 30, 2023 • 5.61M • 276 google-t5/t5-large Translation • Updated Apr 6, 2023 • 526k • 158 Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 6
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 6
nlp Collection by netapy Sep 12, 2023 - When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale Paper • 2309.04564 • Published Sep 8, 2023 • 14
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale Paper • 2309.04564 • Published Sep 8, 2023 • 14
Flan-T5 release The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling Collection by google May 14 14 google/flan-t5-small Text2Text Generation • Updated Oct 10, 2023 • 290k • 217 google/flan-t5-base Text2Text Generation • Updated Jul 17, 2023 • 1.52M • 731 google/flan-t5-large Text2Text Generation • Updated Jul 17, 2023 • 7.17M • 475 google/flan-t5-xxl Text2Text Generation • Updated Jul 27, 2023 • 402k • 1.13k
aa Collection by liankafohali Sep 12, 2023 - csukuangfj/sherpa-ncnn-streaming-zipformer-small-bilingual-zh-en-2023-02-16 Updated May 23, 2023 • 3
german models Collection by darksmile92 Sep 12, 2023 - jphme/Llama-2-13b-chat-german Text Generation • Updated Oct 6, 2023 • 6.73k • 60