GBaker commited on
Commit
7990278
1 Parent(s): 22bec86

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -9
README.md CHANGED
@@ -15,8 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [uw-madison/nystromformer-4096](https://huggingface.co/uw-madison/nystromformer-4096) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.3796
19
- - Accuracy: 0.2773
20
 
21
  ## Model description
22
 
@@ -39,24 +39,26 @@ The following hyperparameters were used during training:
39
  - train_batch_size: 4
40
  - eval_batch_size: 4
41
  - seed: 42
42
- - gradient_accumulation_steps: 8
43
- - total_train_batch_size: 32
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 3
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | No log | 1.0 | 318 | 1.3863 | 0.2372 |
53
- | 1.3888 | 2.0 | 636 | 1.3858 | 0.2388 |
54
- | 1.3888 | 3.0 | 954 | 1.3796 | 0.2773 |
 
 
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.26.0
60
  - Pytorch 1.13.1+cu116
61
- - Datasets 2.8.0
62
  - Tokenizers 0.13.2
 
15
 
16
  This model is a fine-tuned version of [uw-madison/nystromformer-4096](https://huggingface.co/uw-madison/nystromformer-4096) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.4537
19
+ - Accuracy: 0.2883
20
 
21
  ## Model description
22
 
 
39
  - train_batch_size: 4
40
  - eval_batch_size: 4
41
  - seed: 42
42
+ - gradient_accumulation_steps: 32
43
+ - total_train_batch_size: 128
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 5
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
+ | No log | 0.99 | 79 | 1.3860 | 0.2467 |
53
+ | No log | 1.99 | 158 | 1.3853 | 0.2616 |
54
+ | No log | 2.99 | 237 | 1.3785 | 0.2820 |
55
+ | No log | 3.99 | 316 | 1.3801 | 0.2820 |
56
+ | No log | 4.99 | 395 | 1.4537 | 0.2883 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.26.0
62
  - Pytorch 1.13.1+cu116
63
+ - Datasets 2.9.0
64
  - Tokenizers 0.13.2