pszemraj commited on
Commit
2f46918
1 Parent(s): 5c85d30

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -17,9 +17,9 @@ datasets:
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
  should probably proofread and complete it, then remove this comment. -->
19
 
20
- # mGPT-gpt-pierre_DS-msgs-df_Ep-2_Bs-4
21
 
22
- This model is a fine-tuned version of [sberbank-ai/mGPT](https://huggingface.co/sberbank-ai/mGPT) on the None dataset.
23
 
24
  ## Model description
25
 
@@ -48,7 +48,7 @@ The following hyperparameters were used during training:
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: cosine_with_restarts
50
  - lr_scheduler_warmup_ratio: 0.05
51
- - num_epochs: 2
52
 
53
  ### Framework versions
54
 
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
  should probably proofread and complete it, then remove this comment. -->
19
 
20
+ # mGPT: fine-tune on message data MWE
21
 
22
+ This model is a fine-tuned version of [sberbank-ai/mGPT](https://huggingface.co/sberbank-ai/mGPT) on 80k messages. Trained for one epoch, will be updated in a (separate) model repo later.
23
 
24
  ## Model description
25
 
 
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: cosine_with_restarts
50
  - lr_scheduler_warmup_ratio: 0.05
51
+ - num_epochs: 1
52
 
53
  ### Framework versions
54