anhnv125 commited on
Commit
2c58f32
1 Parent(s): 13138e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -59
README.md CHANGED
@@ -1,59 +0,0 @@
1
- ---
2
- base_model: ChaiML/reward_models_100_170000000_cp_498032
3
- tags:
4
- - generated_from_trainer
5
- model-index:
6
- - name: reward-model
7
- results: []
8
- ---
9
-
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
12
-
13
- # reward-model
14
-
15
- This model is a fine-tuned version of [ChaiML/reward_models_100_170000000_cp_498032](https://huggingface.co/ChaiML/reward_models_100_170000000_cp_498032) on the None dataset.
16
- It achieves the following results on the evaluation set:
17
- - Loss: 0.6433
18
-
19
- ## Model description
20
-
21
- More information needed
22
-
23
- ## Intended uses & limitations
24
-
25
- More information needed
26
-
27
- ## Training and evaluation data
28
-
29
- More information needed
30
-
31
- ## Training procedure
32
-
33
- ### Training hyperparameters
34
-
35
- The following hyperparameters were used during training:
36
- - learning_rate: 1e-06
37
- - train_batch_size: 16
38
- - eval_batch_size: 16
39
- - seed: 7
40
- - gradient_accumulation_steps: 16
41
- - total_train_batch_size: 256
42
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
- - lr_scheduler_type: cosine
44
- - lr_scheduler_warmup_steps: 200
45
- - num_epochs: 1
46
-
47
- ### Training results
48
-
49
- | Training Loss | Epoch | Step | Validation Loss |
50
- |:-------------:|:-----:|:----:|:---------------:|
51
- | 0.6589 | 0.68 | 200 | 0.6433 |
52
-
53
-
54
- ### Framework versions
55
-
56
- - Transformers 4.34.1
57
- - Pytorch 2.0.1+cu117
58
- - Datasets 2.14.6
59
- - Tokenizers 0.14.1