Jiva
/

xlm-roberta-large-it-mnli

@@ -10,8 +10,8 @@ license: mit
 pipeline_tag: zero-shot-classification
 widget:
 - text: "La seconda guerra mondiale vide contrapporsi, tra il 1939 e il 1945, le cosiddette potenze dell'Asse e gli Alleati che, come già accaduto ai belligeranti della prima guerra mondiale, si combatterono su gran parte del pianeta; il conflitto ebbe inizio il 1º settembre 1939 con l'attacco della Germania nazista alla Polonia e terminò, nel teatro europeo, l'8 maggio 1945 con la resa tedesca e, in quello asiatico, il successivo 2 settembre con la resa dell'Impero giapponese dopo i bombardamenti atomici di Hiroshima e Nagasaki."
-  candidate_labels: "storia, geografia, moda, politica, macchine, cibo"
-  multi_class: true
 ---
 # XLM-roBERTa-large-it-mnli
@@ -37,7 +37,7 @@ The model can be loaded with the `zero-shot-classification` pipeline like so:
 ```python
 from transformers import pipeline
 classifier = pipeline("zero-shot-classification",
-                      model="Jiva/xlm-roberta-large-it-mnli", device=0, use_fast=True)
 ```
 You can then classify in any of the above languages. You can even pass the labels in one language and the sequence to
 classify in another:
@@ -48,17 +48,15 @@ sequence_to_classify = "La Sardegna è una regione italiana a statuto speciale d
 candidate_labels = ["geografia", "politica", "macchine", "cibo", "moda"]
 classifier(sequence_to_classify, candidate_labels)
 # {'labels': ['geografia', 'moda', 'politica', 'macchine', 'cibo'],
-#  'scores': [0.5027586221694946, 0.19790762662887573, 0.1900099515914917, 0.10961027443408966, 0.07802766561508179]}
 ```
-The default hypothesis template is the English, `This text is {}`. If you are working strictly within one language, it
-may be worthwhile to translate this to the language you are working with:
 ```python
 sequence_to_classify = "La Sardegna è una regione italiana a statuto speciale di 1 592 730 abitanti con capoluogo Cagliari, la cui denominazione bilingue utilizzata nella comunicazione ufficiale è Regione Autonoma della Sardegna / Regione Autònoma de Sardigna."
-candidate_labels = ["geografia", "politica", "macchine", "cibo", "moda"]
-hypothesis_template = "si parla di {}""
 classifier(sequence_to_classify, candidate_labels, hypothesis_template=hypothesis_template)
-# {'labels': ['geografia', 'moda', 'politica', 'macchine', 'cibo'],
-#  'scores': [0.5027586221694946, 0.19790762662887573, 0.1900099515914917, 0.10961027443408966, 0.07802766561508179]}
 ```
 #### With manual PyTorch
 ```python
@@ -67,7 +65,7 @@ from transformers import AutoModelForSequenceClassification, AutoTokenizer
 nli_model = AutoModelForSequenceClassification.from_pretrained('Jiva/xlm-roberta-large-it-mnli')
 tokenizer = AutoTokenizer.from_pretrained('Jiva/xlm-roberta-large-it-mnli')
 premise = sequence
-hypothesis = f'si parla di{ label}.'
 # run through model pre-trained on MNLI
 x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
                      truncation_strategy='only_first')
@@ -81,7 +79,17 @@ prob_label_is_true = probs[:,1]
 ## Training
 ## Version 0.1
-The model has been now retrained on the full training set. Around 1000 sentences pairs have been removed from the set bacause their translation was botched by the translation model.
 ## Version 0.0
 This model was pre-trained on set of 100 languages, as described in

 pipeline_tag: zero-shot-classification
 widget:
 - text: "La seconda guerra mondiale vide contrapporsi, tra il 1939 e il 1945, le cosiddette potenze dell'Asse e gli Alleati che, come già accaduto ai belligeranti della prima guerra mondiale, si combatterono su gran parte del pianeta; il conflitto ebbe inizio il 1º settembre 1939 con l'attacco della Germania nazista alla Polonia e terminò, nel teatro europeo, l'8 maggio 1945 con la resa tedesca e, in quello asiatico, il successivo 2 settembre con la resa dell'Impero giapponese dopo i bombardamenti atomici di Hiroshima e Nagasaki."
+candidate_labels: "storia, geografia, moda, politica, macchine, cibo"
+multi_class: true
 ---
 # XLM-roBERTa-large-it-mnli
 ```python
 from transformers import pipeline
 classifier = pipeline("zero-shot-classification",
+                      model="Jiva/xlm-roberta-large-it-mnli", device=0, use_fast=True, multi_label=True)
 ```
 You can then classify in any of the above languages. You can even pass the labels in one language and the sequence to
 classify in another:
 candidate_labels = ["geografia", "politica", "macchine", "cibo", "moda"]
 classifier(sequence_to_classify, candidate_labels)
 # {'labels': ['geografia', 'moda', 'politica', 'macchine', 'cibo'],
+# 'scores': [0.38871392607688904, 0.22633370757102966, 0.19398456811904907, 0.13735772669315338, 0.13708525896072388]}
 ```
+The default hypothesis template is the English, `This text is {}`. With this model better results are achieving when providing a translated template:
 ```python
 sequence_to_classify = "La Sardegna è una regione italiana a statuto speciale di 1 592 730 abitanti con capoluogo Cagliari, la cui denominazione bilingue utilizzata nella comunicazione ufficiale è Regione Autonoma della Sardegna / Regione Autònoma de Sardigna."
+candidate_labels = ["geografia", "politica", "macchine", "cibo", "moda"]"
+hypothesis_template = "si parla di {}"
 classifier(sequence_to_classify, candidate_labels, hypothesis_template=hypothesis_template)
+'scores': [0.6068345904350281, 0.34715887904167175, 0.32433947920799255, 0.3068877160549164, 0.18744681775569916]}
 ```
 #### With manual PyTorch
 ```python
 nli_model = AutoModelForSequenceClassification.from_pretrained('Jiva/xlm-roberta-large-it-mnli')
 tokenizer = AutoTokenizer.from_pretrained('Jiva/xlm-roberta-large-it-mnli')
 premise = sequence
+hypothesis = f'si parla di {}.'
 # run through model pre-trained on MNLI
 x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
                      truncation_strategy='only_first')
 ## Training
 ## Version 0.1
+The model has been now retrained on the full training set. Around 1000 sentences pairs have been removed from the set because their translation was botched by the translation model.
+| metric          	| value 	|
+|-----------------	|-------	|
+| learnin_rate    	| 4e-6  	|
+| optimizer       	| AdamW 	|
+| batch_size      	| 80    	|
+| mcc             	| 0.77  	|
+| train_loss      	| 0.34  	|
+| eval_loss       	| 0.40  	|
+| stopped_at_step 	| 9754  	|
 ## Version 0.0
 This model was pre-trained on set of 100 languages, as described in