File size: 1,293 Bytes
b3599bf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
SUMMARY MODEL:

Model Params Size: 60492288
Model Params Size Formatted: 60.49 M
Model Disk Size: 242030465
Model Disk Size Formatted: 242.03 MB


TRAINING AND VALIDATION RESULTS:

Training batch size: 4
Validation batch size: 8
Total expected epochs: 4
Total expected trainig steps: 15052
Total expected trainig steps 2: 15052
Total trained epochs: 4.0
Total trained steps: 15052
Elapsed time: 7849.268085718155 seconds
Elapsed time (formatted): 02:10:49
Total flos: 8148659183026176.0
Total flos (formatted): 8.148659e+15
Best epoch val_loss: 0.5480290651321411
Best model checkpoint: E:/000_Tesis/test_executions/pretrain_utg4java\checkpoint-15052



SUMMARY DATASETS:

Loaded Dataset:
DatasetDict({
    train: Dataset({
        features: ['text'],
        num_rows: 15052
    })
    valid: Dataset({
        features: ['text'],
        num_rows: 1881
    })
    test: Dataset({
        features: ['text'],
        num_rows: 1882
    })
})

Tokenized Dataset:
DatasetDict({
    train: Dataset({
        features: ['input_ids'],
        num_rows: 15052
    })
    valid: Dataset({
        features: ['input_ids'],
        num_rows: 1881
    })
    test: Dataset({
        features: ['input_ids'],
        num_rows: 1882
    })
})