akaksakan
/

vietnamese-correction-v2

@@ -2,8 +2,6 @@
 base_model: vinai/bartpho-syllable
 tags:
 - generated_from_trainer
-metrics:
-- sacrebleu
 model-index:
 - name: vietnamese-correction-v2
   results: []
@@ -15,9 +13,6 @@ should probably proofread and complete it, then remove this comment. -->
 # vietnamese-correction-v2
 This model is a fine-tuned version of [vinai/bartpho-syllable](https://huggingface.co/vinai/bartpho-syllable) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.1848
-- Sacrebleu: 26.6603
 ## Model description
@@ -37,23 +32,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
-- gradient_accumulation_steps: 8
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 0.5
 - mixed_precision_training: Native AMP
-### Training results
 ### Framework versions
-- Transformers 4.41.0
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 base_model: vinai/bartpho-syllable
 tags:
 - generated_from_trainer
 model-index:
 - name: vietnamese-correction-v2
   results: []
 # vietnamese-correction-v2
 This model is a fine-tuned version of [vinai/bartpho-syllable](https://huggingface.co/vinai/bartpho-syllable) on an unknown dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 0.5
 - mixed_precision_training: Native AMP
 ### Framework versions
+- Transformers 4.41.1
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -31,7 +31,7 @@
   "scale_embedding": false,
   "tokenizer_class": "BartphoTokenizer",
   "torch_dtype": "float32",
-  "transformers_version": "4.41.0",
   "use_cache": true,
   "vocab_size": 40030
 }

   "scale_embedding": false,
   "tokenizer_class": "BartphoTokenizer",
   "torch_dtype": "float32",
+  "transformers_version": "4.41.1",
   "use_cache": true,
   "vocab_size": 40030
 }

generation_config.json CHANGED Viewed

@@ -1,8 +1,9 @@
 {
   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,
   "forced_eos_token_id": 2,
   "pad_token_id": 1,
-  "transformers_version": "4.41.0"
 }

 {
+  "_from_model_config": true,
   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,
   "forced_eos_token_id": 2,
   "pad_token_id": 1,
+  "transformers_version": "4.41.1"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6f5ca1ca4da42923fefb7cc5700f4db9f63676945ecdd8da81a87651247aeb9e
 size 1583480280

 version https://git-lfs.github.com/spec/v1
+oid sha256:3c6fe161d6020bfe16cf5adf3fd5de00933dfe01d0361f60c6591b4939a786b9
 size 1583480280

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a33bc49b7e095684e80a29bf06ce52f0c0fd2cfbfe65065399dfd0ac2fc4ea71
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:d1382f0e19f83927c2620e91d29acd74fae3796db660c5e127af665b56cd62fd
 size 5240