DeepDream2045 commited on
Commit
3abfbc1
·
verified ·
1 Parent(s): 77b9e0a

End of training

Browse files
Files changed (2) hide show
  1. README.md +4 -4
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -98,7 +98,7 @@ xformers_attention: true
98
 
99
  This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-64k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-64k) on the None dataset.
100
  It achieves the following results on the evaluation set:
101
- - Loss: 2.1861
102
 
103
  ## Model description
104
 
@@ -135,9 +135,9 @@ The following hyperparameters were used during training:
135
 
136
  | Training Loss | Epoch | Step | Validation Loss |
137
  |:-------------:|:------:|:----:|:---------------:|
138
- | 32.5319 | 0.0004 | 1 | 2.2997 |
139
- | 45.1928 | 0.0088 | 25 | 2.1923 |
140
- | 45.6105 | 0.0176 | 50 | 2.1861 |
141
 
142
 
143
  ### Framework versions
 
98
 
99
  This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-64k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-64k) on the None dataset.
100
  It achieves the following results on the evaluation set:
101
+ - Loss: 2.1862
102
 
103
  ## Model description
104
 
 
135
 
136
  | Training Loss | Epoch | Step | Validation Loss |
137
  |:-------------:|:------:|:----:|:---------------:|
138
+ | 32.5311 | 0.0004 | 1 | 2.2997 |
139
+ | 45.2215 | 0.0088 | 25 | 2.1925 |
140
+ | 45.6407 | 0.0176 | 50 | 2.1862 |
141
 
142
 
143
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:abffa9b619c8db3bcba2ab08587557f87865398b50856ef9bf665d77c2e76682
3
  size 335706186
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78bd26063133246ba38d2644d40a3136e2e23858f6e18281d1fbbeeeb094bff2
3
  size 335706186