dtvingg
/

vinh-test

@@ -19,8 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [vinai/bartpho-syllable-base](https://huggingface.co/vinai/bartpho-syllable-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0237
-- Sacrebleu: 98.1436
 ## Model description
@@ -40,11 +41,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 24
-- eval_batch_size: 96
 - seed: 42
 - gradient_accumulation_steps: 4
-- total_train_batch_size: 96
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
@@ -52,18 +53,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
-|:-------------:|:-----:|:----:|:---------------:|:---------:|
-| No log        | 1.0   | 179  | 0.0514          | 96.4718   |
-| No log        | 2.0   | 358  | 0.0363          | 97.6428   |
-| 0.0916        | 3.0   | 537  | 0.0285          | 97.6959   |
-| 0.0916        | 4.0   | 716  | 0.0252          | 97.8422   |
-| 0.0916        | 5.0   | 895  | 0.0237          | 98.1436   |
 ### Framework versions
 - Transformers 4.46.3
-- Pytorch 2.4.0
-- Datasets 3.1.0
 - Tokenizers 0.20.3

 This model is a fine-tuned version of [vinai/bartpho-syllable-base](https://huggingface.co/vinai/bartpho-syllable-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3488
+- Model Preparation Time: 0.0071
+- Sacrebleu: 92.9401
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 12
+- eval_batch_size: 48
 - seed: 42
 - gradient_accumulation_steps: 4
+- total_train_batch_size: 48
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time | Sacrebleu |
+|:-------------:|:------:|:----:|:---------------:|:----------------------:|:---------:|
+| No log        | 0.9231 | 3    | 0.7367          | 0.0071                 | 89.5419   |
+| No log        | 1.8462 | 6    | 0.6013          | 0.0071                 | 89.5419   |
+| No log        | 2.7692 | 9    | 0.4542          | 0.0071                 | 89.5419   |
+| No log        | 4.0    | 13   | 0.3624          | 0.0071                 | 92.9401   |
+| No log        | 4.6154 | 15   | 0.3488          | 0.0071                 | 92.9401   |
 ### Framework versions
 - Transformers 4.46.3
+- Pytorch 2.5.1+cu121
+- Datasets 3.2.0
 - Tokenizers 0.20.3