Pclanglais commited on
Commit
478d995
1 Parent(s): 4662ef1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -19,6 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  Pleias-Topic-Detection is a finetuned version of t5-small on a set of 70,000 documents and associated topics from Common Corpus. While t5-small has been reportedly only trained in English, the model actually shows unexpected capacities for multilingual annotation. The final corpus include a significant amount of texts in French, Spanish, Italian, Dutch and German and has been proven to work somewhat in all of theses languages.
21
 
 
22
 
23
  ### Training hyperparameters
24
 
 
19
 
20
  Pleias-Topic-Detection is a finetuned version of t5-small on a set of 70,000 documents and associated topics from Common Corpus. While t5-small has been reportedly only trained in English, the model actually shows unexpected capacities for multilingual annotation. The final corpus include a significant amount of texts in French, Spanish, Italian, Dutch and German and has been proven to work somewhat in all of theses languages.
21
 
22
+ Given that Pleias-Topic-Detection is a relatively lightweight model (70 million parameters) it can be used for classification at scale on a large corpus.
23
 
24
  ### Training hyperparameters
25