Nhut commited on
Commit
2849a1c
1 Parent(s): afef19d

Training in progress, step 800

Browse files
Files changed (2) hide show
  1. README.md +8 -8
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -20,12 +20,12 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - eval_loss: 1.2890
24
- - eval_runtime: 164.2332
25
- - eval_samples_per_second: 0.67
26
- - eval_steps_per_second: 0.335
27
- - epoch: 1.2270
28
- - step: 600
29
 
30
  ## Model description
31
 
@@ -45,8 +45,8 @@ More information needed
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 0.0002
48
- - train_batch_size: 2
49
- - eval_batch_size: 2
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: constant
 
20
 
21
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - eval_loss: 1.4180
24
+ - eval_runtime: 36.1138
25
+ - eval_samples_per_second: 3.24
26
+ - eval_steps_per_second: 0.831
27
+ - epoch: 3.0534
28
+ - step: 800
29
 
30
  ## Model description
31
 
 
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 0.0002
48
+ - train_batch_size: 4
49
+ - eval_batch_size: 4
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
  - lr_scheduler_type: constant
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b2bbedea91e98591925278203fb69f78e479281e6612ab5d674b691d899d7c9
3
  size 2806378968
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:953bc92ccfbed0a8b53281d0cb39d4a808a9a5d6efbc4b22c9cb7e92ee838b76
3
  size 2806378968