AdrienB134 commited on
Commit
884ff07
1 Parent(s): 318bd55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ tags:
15
 
16
  #### Details
17
 
18
- The model is initialized from the [ricardoz/BERTugues-base-portuguese-cased](https://huggingface.co/ricardoz/BERTugues-base-portuguese-cased) checkpoint and fine-tuned on 10M triples via pairwise softmax cross-entropy loss over the computed scores of the positive and negative passages associated to a query. It was trained on a single Tesla A100 GPU with 40GBs of memory during 200k steps with 10% of warmup steps using a batch size of 96 and the AdamW optimizer with a constant learning rate of 3e-06. Total training time was around 12 hours.
19
 
20
  #### Data
21
 
 
15
 
16
  #### Details
17
 
18
+ The model is initialized from the [ricardoz/BERTugues-base-portuguese-cased](https://huggingface.co/ricardoz/BERTugues-base-portuguese-cased) model and fine-tuned on 10M triples via pairwise softmax cross-entropy loss over the computed scores of the positive and negative passages associated to a query. It was trained on a single Tesla A100 GPU with 40GBs of memory during 200k steps with 10% of warmup steps using a batch size of 96 and the AdamW optimizer with a constant learning rate of 3e-06. Total training time was around 12 hours.
19
 
20
  #### Data
21