metadata
license: apache-2.0
base_model: bedus-creation/mBart-small-dataset-i-eng-lim
tags:
- generated_from_keras_callback
model-index:
- name: bedus-creation/mBart-small-dataset-ii-eng-lim-004
results: []
bedus-creation/mBart-small-dataset-ii-eng-lim-004
This model is a fine-tuned version of bedus-creation/mBart-small-dataset-i-eng-lim on an unknown dataset. It achieves the following results on the evaluation set:
- Train Loss: 0.1961
- Validation Loss: 0.3744
- Epoch: 99
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 1e-04, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
- training_precision: float32
Training results
Train Loss | Validation Loss | Epoch |
---|---|---|
0.9940 | 0.4653 | 0 |
0.4659 | 0.3647 | 1 |
0.4011 | 0.3331 | 2 |
0.3798 | 0.3284 | 3 |
0.3640 | 0.3210 | 4 |
0.3539 | 0.3087 | 5 |
0.3456 | 0.3106 | 6 |
0.3377 | 0.3049 | 7 |
0.3340 | 0.2998 | 8 |
0.3285 | 0.2974 | 9 |
0.3246 | 0.2980 | 10 |
0.3202 | 0.2950 | 11 |
0.3174 | 0.2910 | 12 |
0.3154 | 0.2932 | 13 |
0.3124 | 0.2882 | 14 |
0.3094 | 0.2895 | 15 |
0.3092 | 0.2880 | 16 |
0.3073 | 0.2861 | 17 |
0.3043 | 0.2842 | 18 |
0.3037 | 0.2856 | 19 |
0.3009 | 0.2834 | 20 |
0.2999 | 0.2859 | 21 |
0.2983 | 0.2836 | 22 |
0.2973 | 0.2809 | 23 |
0.2952 | 0.2825 | 24 |
0.2942 | 0.2809 | 25 |
0.2933 | 0.2792 | 26 |
0.2914 | 0.2813 | 27 |
0.2898 | 0.2817 | 28 |
0.2884 | 0.2794 | 29 |
0.2866 | 0.2797 | 30 |
0.2853 | 0.2797 | 31 |
0.2849 | 0.2844 | 32 |
0.2835 | 0.2798 | 33 |
0.2821 | 0.2803 | 34 |
0.2823 | 0.2828 | 35 |
0.2798 | 0.2796 | 36 |
0.2797 | 0.2788 | 37 |
0.2766 | 0.2811 | 38 |
0.2765 | 0.2800 | 39 |
0.2747 | 0.2852 | 40 |
0.2731 | 0.2825 | 41 |
0.2720 | 0.2841 | 42 |
0.2709 | 0.2855 | 43 |
0.2693 | 0.2843 | 44 |
0.2678 | 0.2863 | 45 |
0.2667 | 0.2912 | 46 |
0.2645 | 0.2863 | 47 |
0.2633 | 0.2862 | 48 |
0.2618 | 0.2881 | 49 |
0.2607 | 0.2890 | 50 |
0.2585 | 0.2928 | 51 |
0.2585 | 0.2903 | 52 |
0.2562 | 0.2904 | 53 |
0.2545 | 0.2902 | 54 |
0.2541 | 0.2937 | 55 |
0.2528 | 0.2930 | 56 |
0.2512 | 0.3014 | 57 |
0.2484 | 0.2979 | 58 |
0.2478 | 0.3002 | 59 |
0.2460 | 0.3034 | 60 |
0.2449 | 0.3000 | 61 |
0.2442 | 0.3010 | 62 |
0.2418 | 0.3054 | 63 |
0.2399 | 0.3046 | 64 |
0.2395 | 0.3072 | 65 |
0.2374 | 0.3117 | 66 |
0.2368 | 0.3081 | 67 |
0.2351 | 0.3149 | 68 |
0.2334 | 0.3155 | 69 |
0.2335 | 0.3123 | 70 |
0.2310 | 0.3193 | 71 |
0.2296 | 0.3169 | 72 |
0.2277 | 0.3220 | 73 |
0.2275 | 0.3200 | 74 |
0.2248 | 0.3223 | 75 |
0.2253 | 0.3235 | 76 |
0.2224 | 0.3266 | 77 |
0.2225 | 0.3289 | 78 |
0.2201 | 0.3288 | 79 |
0.2188 | 0.3330 | 80 |
0.2158 | 0.3389 | 81 |
0.2157 | 0.3379 | 82 |
0.2145 | 0.3447 | 83 |
0.2135 | 0.3436 | 84 |
0.2128 | 0.3525 | 85 |
0.2116 | 0.3464 | 86 |
0.2104 | 0.3494 | 87 |
0.2081 | 0.3540 | 88 |
0.2071 | 0.3561 | 89 |
0.2059 | 0.3598 | 90 |
0.2043 | 0.3608 | 91 |
0.2032 | 0.3721 | 92 |
0.2027 | 0.3668 | 93 |
0.2022 | 0.3608 | 94 |
0.2012 | 0.3675 | 95 |
0.1997 | 0.3695 | 96 |
0.1974 | 0.3703 | 97 |
0.1953 | 0.3704 | 98 |
0.1961 | 0.3744 | 99 |
Framework versions
- Transformers 4.33.3
- TensorFlow 2.13.0
- Datasets 2.14.5
- Tokenizers 0.13.3