AlekseyKorshuk commited on
Commit
c35a24f
1 Parent(s): c810fd6

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -18
README.md CHANGED
@@ -2,23 +2,11 @@
2
  license: other
3
  tags:
4
  - generated_from_trainer
5
- datasets:
6
- - AlekseyKorshuk/amazon-reviews-input-output
7
  metrics:
8
  - accuracy
9
  model-index:
10
  - name: amazon-reviews-input-output
11
- results:
12
- - task:
13
- name: Causal Language Modeling
14
- type: text-generation
15
- dataset:
16
- name: AlekseyKorshuk/amazon-reviews-input-output
17
- type: AlekseyKorshuk/amazon-reviews-input-output
18
- metrics:
19
- - name: Accuracy
20
- type: accuracy
21
- value: 0.08526422764227642
22
  ---
23
 
24
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -26,10 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
26
 
27
  # amazon-reviews-input-output
28
 
29
- This model is a fine-tuned version of [facebook/opt-350m](https://huggingface.co/facebook/opt-350m) on the AlekseyKorshuk/amazon-reviews-input-output dataset.
30
  It achieves the following results on the evaluation set:
31
- - Loss: 0.2056
32
- - Accuracy: 0.0853
33
 
34
  ## Model description
35
 
@@ -49,9 +37,11 @@ More information needed
49
 
50
  The following hyperparameters were used during training:
51
  - learning_rate: 5e-05
52
- - train_batch_size: 32
53
  - eval_batch_size: 32
54
  - seed: 42
 
 
55
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
56
  - lr_scheduler_type: linear
57
  - num_epochs: 1.0
@@ -61,7 +51,7 @@ The following hyperparameters were used during training:
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
63
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
64
- | 1.9481 | 0.78 | 25 | 0.2152 | 0.0850 |
65
 
66
 
67
  ### Framework versions
 
2
  license: other
3
  tags:
4
  - generated_from_trainer
 
 
5
  metrics:
6
  - accuracy
7
  model-index:
8
  - name: amazon-reviews-input-output
9
+ results: []
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
14
 
15
  # amazon-reviews-input-output
16
 
17
+ This model is a fine-tuned version of [facebook/opt-350m](https://huggingface.co/facebook/opt-350m) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.2104
20
+ - Accuracy: 0.0852
21
 
22
  ## Model description
23
 
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 5e-05
40
+ - train_batch_size: 4
41
  - eval_batch_size: 32
42
  - seed: 42
43
+ - gradient_accumulation_steps: 8
44
+ - total_train_batch_size: 32
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 1.0
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
+ | 0.761 | 0.8 | 25 | 0.2104 | 0.0852 |
55
 
56
 
57
  ### Framework versions