This is a DDPT model (https://aclanthology.org/2022.coling-1.21/) trained on MultiWOZ 2.1
Refer to ConvLab-3 for model description and usage.
The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 64
- seed: 1
- optimizer: Adam
- num_epochs: 40
- use checkpoint which performed best on validation set
- Transformers 4.18.0
- Pytorch 1.10.2+cu111
- Downloads last month
Unable to determine this model’s pipeline type. Check the docs .