ITG
/

DialoGPT-medium-spanish-chitchat

@@ -66,6 +66,20 @@ for i in range(CHAT_TURNS):
 ---
 ## Fine-tuning in a different dataset or style
 If you want to fine-tune this model, we recommend you to start from the [DialoGPT model](https://huggingface.co/microsoft/DialoGPT-medium).

 ---
+## Fine-tuning hyper-parameters
+|            **Hyper-parameter**           |          **Value**          |
+|:----------------------------------------:|:---------------------------:|
+|         Validation partition (%)         |             20%             |
+|            Training batch size           |              8              |
+|               Learning rate              |             5e-4            |
+|            Max training epochs           |              20             |
+|         Warmup training steps (%)        |             6%              |
+|               Weight decay               |             0.01            |
+| Optimiser (beta1, beta2, epsilon) | AdamW (0.9, 0.999, 1e-08) |
+|  Monitoring metric (delta, patience)     |   validation loss (0.1, 3)  |
 ## Fine-tuning in a different dataset or style
 If you want to fine-tune this model, we recommend you to start from the [DialoGPT model](https://huggingface.co/microsoft/DialoGPT-medium).