bourdoiscatie
commited on
Commit
•
b84615a
1
Parent(s):
761e144
Update README.md
Browse files
README.md
CHANGED
@@ -10,10 +10,12 @@ pipeline_tag: text2text-generation
|
|
10 |
|
11 |
## Model Description
|
12 |
|
13 |
-
We present **frenchT0**, a model for zero-shot task generalization on the French language. This is an adaptation of [bigscience/T0](https://huggingface.co/bigscience/T0) on the French language.
|
14 |
-
This model is still under development but exhibit better or equal performance for less parameters.
|
15 |
|
|
|
|
|
|
|
|
|
16 |
|
17 |
-
|
18 |
-
|
19 |
-
Th dataset used to train this model, will **soon** be released !
|
|
|
10 |
|
11 |
## Model Description
|
12 |
|
13 |
+
We present **frenchT0**, a model for zero-shot task generalization on the French language. This is an adaptation of [bigscience/T0](https://huggingface.co/bigscience/T0) on the French language.
|
|
|
14 |
|
15 |
+
The model was trained on a preliminary version of [DFP](https://huggingface.co/datasets/CATIE-AQ/DFP).
|
16 |
+
Development of this model has been stopped in favor of another (still under development) including more data (i.e. the full version of DFP) and on longer sequences (at least 8K tokens).
|
17 |
+
So no full benchmark will be conducted.
|
18 |
+
From our first observations, frenchT0 gave better or equivalent results to [mt0-base](https://huggingface.co/bigscience/mt0-base) for fewer parameters (580M vs. 300M parameters).
|
19 |
|
20 |
+
To test this model, we invite you to look at the sample prompts provided in the [DFP](https://huggingface.co/datasets/CATIE-AQ/DFP)'s card.
|
21 |
+
Text generation tasks should not give good results, but classification tasks (classification, QA, NER, POS, etc.) give interesting results.
|
|