UBC-NLP
/

AraT5v2-base-1024

Arabic Machine Translation

Arabic Text Summarization

Arabic News Title and Question Generation

Arabic Paraphrasing and Transliteration

Arabic Code-Switched Translation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

elmadany commited on May 1

Commit

7b240e6

•

1 Parent(s): 8f1fa54

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -66,6 +66,7 @@ If you use our models (AraT5v2-base-1024, AraT5-base, AraT5-msa-base, AraT5-twee
     pages = "628--647",
     abstract = "Transfer learning with a unified Transformer framework (T5) that converts all language problems into a text-to-text format was recently proposed as a simple and effective transfer learning approach. Although a multilingual version of the T5 model (mT5) was also introduced, it is not clear how well it can fare on non-English tasks involving diverse data. To investigate this question, we apply mT5 on a language with a wide variety of dialects{--}Arabic. For evaluation, we introduce a novel benchmark for ARabic language GENeration (ARGEN), covering seven important tasks. For model comparison, we pre-train three powerful Arabic T5-style models and evaluate them on ARGEN. Although pre-trained with {\textasciitilde}49 less data, our new models perform significantly better than mT5 on all ARGEN tasks (in 52 out of 59 test sets) and set several new SOTAs. Our models also establish new SOTA on the recently-proposed, large Arabic language understanding evaluation benchmark ARLUE (Abdul-Mageed et al., 2021). Our new models are publicly available. We also link to ARGEN datasets through our repository: https://github.com/UBC-NLP/araT5.",
 }
 ``` bibtex
 @inproceedings{elmadany-etal-2023-octopus,
@@ -81,4 +82,5 @@ If you use our models (AraT5v2-base-1024, AraT5-base, AraT5-msa-base, AraT5-twee
     url = "https://aclanthology.org/2023.arabicnlp-1.20",
     doi = "10.18653/v1/2023.arabicnlp-1.20",
     pages = "232--243",
-}

     pages = "628--647",
     abstract = "Transfer learning with a unified Transformer framework (T5) that converts all language problems into a text-to-text format was recently proposed as a simple and effective transfer learning approach. Although a multilingual version of the T5 model (mT5) was also introduced, it is not clear how well it can fare on non-English tasks involving diverse data. To investigate this question, we apply mT5 on a language with a wide variety of dialects{--}Arabic. For evaluation, we introduce a novel benchmark for ARabic language GENeration (ARGEN), covering seven important tasks. For model comparison, we pre-train three powerful Arabic T5-style models and evaluate them on ARGEN. Although pre-trained with {\textasciitilde}49 less data, our new models perform significantly better than mT5 on all ARGEN tasks (in 52 out of 59 test sets) and set several new SOTAs. Our models also establish new SOTA on the recently-proposed, large Arabic language understanding evaluation benchmark ARLUE (Abdul-Mageed et al., 2021). Our new models are publicly available. We also link to ARGEN datasets through our repository: https://github.com/UBC-NLP/araT5.",
 }
+```
 ``` bibtex
 @inproceedings{elmadany-etal-2023-octopus,
     url = "https://aclanthology.org/2023.arabicnlp-1.20",
     doi = "10.18653/v1/2023.arabicnlp-1.20",
     pages = "232--243",
+}
+```