metadata
license: apache-2.0
base_model: google-bert/bert-base-multilingual-cased
tags:
- generated_from_trainer
metrics:
- accuracy
- f1
- precision
- recall
model-index:
- name: bert-base-multi-class-classification-cs
results: []
bert-base-multi-class-classification-cs
This model is a fine-tuned version of google-bert/bert-base-multilingual-cased on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.0385
- Accuracy: 0.9942
- F1: 0.9942
- Precision: 0.9942
- Recall: 0.9942
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 16
- eval_batch_size: 32
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 100
- num_epochs: 3
- mixed_precision_training: Native AMP
Training results
Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Precision | Recall |
---|---|---|---|---|---|---|---|
3.3053 | 0.0595 | 50 | 3.1767 | 0.0871 | 0.0306 | 0.0333 | 0.0869 |
2.8293 | 0.1190 | 100 | 2.1807 | 0.4647 | 0.3675 | 0.5055 | 0.4651 |
1.6887 | 0.1786 | 150 | 1.1300 | 0.7705 | 0.7362 | 0.7621 | 0.7722 |
0.9012 | 0.2381 | 200 | 0.6245 | 0.8321 | 0.8086 | 0.8549 | 0.8356 |
0.5237 | 0.2976 | 250 | 0.3510 | 0.9129 | 0.9029 | 0.9007 | 0.9147 |
0.3115 | 0.3571 | 300 | 0.2218 | 0.9512 | 0.9517 | 0.9545 | 0.9518 |
0.2217 | 0.4167 | 350 | 0.1746 | 0.9382 | 0.9284 | 0.9596 | 0.9388 |
0.1464 | 0.4762 | 400 | 0.1210 | 0.9729 | 0.9731 | 0.9749 | 0.9730 |
0.1201 | 0.5357 | 450 | 0.0977 | 0.9804 | 0.9805 | 0.9810 | 0.9805 |
0.0921 | 0.5952 | 500 | 0.1212 | 0.9722 | 0.9721 | 0.9741 | 0.9718 |
0.1061 | 0.6548 | 550 | 0.1224 | 0.9726 | 0.9728 | 0.9750 | 0.9728 |
0.0996 | 0.7143 | 600 | 0.0812 | 0.9817 | 0.9815 | 0.9819 | 0.9816 |
0.1196 | 0.7738 | 650 | 0.0726 | 0.9859 | 0.9858 | 0.9862 | 0.9857 |
0.101 | 0.8333 | 700 | 0.0711 | 0.9853 | 0.9854 | 0.9856 | 0.9853 |
0.1159 | 0.8929 | 750 | 0.1012 | 0.9792 | 0.9795 | 0.9803 | 0.9796 |
0.086 | 0.9524 | 800 | 0.0693 | 0.9870 | 0.9871 | 0.9874 | 0.9870 |
0.0742 | 1.0119 | 850 | 0.0619 | 0.9885 | 0.9886 | 0.9888 | 0.9885 |
0.0713 | 1.0714 | 900 | 0.0517 | 0.9896 | 0.9896 | 0.9897 | 0.9896 |
0.02 | 1.1310 | 950 | 0.0707 | 0.9869 | 0.9870 | 0.9874 | 0.9870 |
0.038 | 1.1905 | 1000 | 0.0455 | 0.9920 | 0.9920 | 0.9920 | 0.9919 |
0.0378 | 1.25 | 1050 | 0.0485 | 0.9906 | 0.9906 | 0.9906 | 0.9906 |
0.0257 | 1.3095 | 1100 | 0.0452 | 0.9921 | 0.9921 | 0.9922 | 0.9921 |
0.0454 | 1.3690 | 1150 | 0.0494 | 0.9905 | 0.9905 | 0.9906 | 0.9905 |
0.0174 | 1.4286 | 1200 | 0.0404 | 0.9923 | 0.9922 | 0.9922 | 0.9922 |
0.0425 | 1.4881 | 1250 | 0.0627 | 0.9879 | 0.9877 | 0.9879 | 0.9877 |
0.0489 | 1.5476 | 1300 | 0.0525 | 0.9908 | 0.9907 | 0.9907 | 0.9907 |
0.0816 | 1.6071 | 1350 | 0.0439 | 0.9918 | 0.9918 | 0.9919 | 0.9917 |
0.0375 | 1.6667 | 1400 | 0.0434 | 0.9921 | 0.9920 | 0.9920 | 0.9921 |
0.0435 | 1.7262 | 1450 | 0.0368 | 0.9929 | 0.9928 | 0.9929 | 0.9929 |
0.0285 | 1.7857 | 1500 | 0.0364 | 0.9935 | 0.9934 | 0.9935 | 0.9934 |
0.0222 | 1.8452 | 1550 | 0.0332 | 0.9942 | 0.9942 | 0.9943 | 0.9941 |
0.0311 | 1.9048 | 1600 | 0.0394 | 0.9929 | 0.9929 | 0.9930 | 0.9929 |
0.0269 | 1.9643 | 1650 | 0.0359 | 0.9935 | 0.9934 | 0.9935 | 0.9934 |
0.0258 | 2.0238 | 1700 | 0.0326 | 0.9937 | 0.9937 | 0.9938 | 0.9937 |
0.0046 | 2.0833 | 1750 | 0.0324 | 0.9945 | 0.9944 | 0.9945 | 0.9944 |
0.0152 | 2.1429 | 1800 | 0.0329 | 0.9946 | 0.9946 | 0.9948 | 0.9946 |
0.024 | 2.2024 | 1850 | 0.0305 | 0.9948 | 0.9947 | 0.9948 | 0.9947 |
0.0212 | 2.2619 | 1900 | 0.0333 | 0.9943 | 0.9943 | 0.9943 | 0.9943 |
0.0029 | 2.3214 | 1950 | 0.0322 | 0.9937 | 0.9937 | 0.9938 | 0.9937 |
0.0114 | 2.3810 | 2000 | 0.0342 | 0.9940 | 0.9940 | 0.9941 | 0.9940 |
0.0115 | 2.4405 | 2050 | 0.0328 | 0.9942 | 0.9941 | 0.9942 | 0.9941 |
0.0182 | 2.5 | 2100 | 0.0333 | 0.9937 | 0.9937 | 0.9938 | 0.9936 |
0.01 | 2.5595 | 2150 | 0.0314 | 0.9940 | 0.9940 | 0.9941 | 0.9940 |
0.0205 | 2.6190 | 2200 | 0.0325 | 0.9937 | 0.9937 | 0.9938 | 0.9937 |
0.0173 | 2.6786 | 2250 | 0.0335 | 0.9940 | 0.9940 | 0.9941 | 0.9940 |
0.0093 | 2.7381 | 2300 | 0.0340 | 0.9942 | 0.9942 | 0.9943 | 0.9941 |
0.0151 | 2.7976 | 2350 | 0.0327 | 0.9946 | 0.9947 | 0.9947 | 0.9946 |
0.0077 | 2.8571 | 2400 | 0.0321 | 0.9948 | 0.9948 | 0.9949 | 0.9948 |
0.0074 | 2.9167 | 2450 | 0.0311 | 0.9946 | 0.9946 | 0.9947 | 0.9946 |
0.0021 | 2.9762 | 2500 | 0.0310 | 0.9946 | 0.9946 | 0.9947 | 0.9946 |
Framework versions
- Transformers 4.40.1
- Pytorch 2.2.1+cu121
- Datasets 2.19.1
- Tokenizers 0.19.1