Menouar commited on
Commit
9f6cdf5
1 Parent(s): ba81253

Model save

Browse files
Files changed (1) hide show
  1. README.md +4 -6
README.md CHANGED
@@ -5,8 +5,6 @@ tags:
5
  - trl
6
  - sft
7
  - generated_from_trainer
8
- datasets:
9
- - generator
10
  base_model: tiiuae/falcon-7b
11
  model-index:
12
  - name: falcon7b-linear-equations
@@ -18,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # falcon7b-linear-equations
20
 
21
- This model is a fine-tuned version of [tiiuae/falcon-7b](https://huggingface.co/tiiuae/falcon-7b) on the generator dataset.
22
 
23
  ## Model description
24
 
@@ -38,15 +36,15 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0002
41
- - train_batch_size: 5
42
  - eval_batch_size: 8
43
  - seed: 42
44
  - gradient_accumulation_steps: 2
45
- - total_train_batch_size: 10
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: constant
48
  - lr_scheduler_warmup_ratio: 0.03
49
- - training_steps: 30
50
 
51
  ### Training results
52
 
 
5
  - trl
6
  - sft
7
  - generated_from_trainer
 
 
8
  base_model: tiiuae/falcon-7b
9
  model-index:
10
  - name: falcon7b-linear-equations
 
16
 
17
  # falcon7b-linear-equations
18
 
19
+ This model is a fine-tuned version of [tiiuae/falcon-7b](https://huggingface.co/tiiuae/falcon-7b) on an unknown dataset.
20
 
21
  ## Model description
22
 
 
36
 
37
  The following hyperparameters were used during training:
38
  - learning_rate: 0.0002
39
+ - train_batch_size: 10
40
  - eval_batch_size: 8
41
  - seed: 42
42
  - gradient_accumulation_steps: 2
43
+ - total_train_batch_size: 20
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
+ - training_steps: 400
48
 
49
  ### Training results
50