File size: 704 Bytes
942d185
f330634
 
942d185
f330634
 
 
 
 
 
942d185
f330634
 
 
 
 
 
 
 
 
 
 
 
 
 
9b9ce11
f330634
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
language:
- en
license: apache-2.0
tags:
- dialogue policy
- task-oriented dialog
datasets:
- ConvLab/multiwoz21

---

# ddpt-policy-sgd

This is a DDPT model (https://aclanthology.org/2022.coling-1.21/) trained on [MultiWOZ 2.1](https://huggingface.co/datasets/ConvLab/multiwoz21)

Refer to [ConvLab-3](https://github.com/ConvLab/ConvLab-3) for model description and usage.

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 64
- seed: 1
- optimizer: Adam
- num_epochs: 40
- use checkpoint which performed best on validation set
### Framework versions
- Transformers 4.18.0
- Pytorch 1.10.2+cu111