File size: 841 Bytes
a32cbb4
 
 
914cb13
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
This checkpoint has been trained with the Turkish part of the [MLSUM dataset](https://huggingface.co/datasets/mlsum) where google/mt5 is the main Pre-trained checkpoint. [SimpleT5](https://github.com/Shivanandroy/simpleT5) library is used for training. 

Here is the code snippet for training

```
model = SimpleT5()
model.from_pretrained("mt5","google/mt5-small")

model.train(train_df=train2, # pandas dataframe with 2 columns: source_text & target_text
            eval_df=validation2, # pandas dataframe with 2 columns: source_text & target_text
            source_max_token_len = 512, 
            target_max_token_len = 128,
            batch_size = 8,
            max_epochs = 5,
            use_gpu = True,
            outputdir = "mt5_mlsum_turkish",
            early_stopping_patience_epochs = 0,
            precision = 32
)
```