vdaita commited on
Commit
4c7df4f
1 Parent(s): 4549a9a

End of training

Browse files
Files changed (2) hide show
  1. README.md +5 -6
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -107,7 +107,7 @@ special_tokens:
107
 
108
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
109
  It achieves the following results on the evaluation set:
110
- - Loss: 0.0202
111
 
112
  ## Model description
113
 
@@ -144,11 +144,10 @@ The following hyperparameters were used during training:
144
 
145
  | Training Loss | Epoch | Step | Validation Loss |
146
  |:-------------:|:-----:|:----:|:---------------:|
147
- | 0.2054 | 0.02 | 1 | 0.2354 |
148
- | 0.062 | 0.25 | 15 | 0.0651 |
149
- | 0.0333 | 0.5 | 30 | 0.0370 |
150
- | 0.0215 | 0.75 | 45 | 0.0218 |
151
- | 0.0174 | 1.0 | 60 | 0.0202 |
152
 
153
 
154
  ### Framework versions
 
107
 
108
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-6.7b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct) on the None dataset.
109
  It achieves the following results on the evaluation set:
110
+ - Loss: 0.1634
111
 
112
  ## Model description
113
 
 
144
 
145
  | Training Loss | Epoch | Step | Validation Loss |
146
  |:-------------:|:-----:|:----:|:---------------:|
147
+ | 0.3241 | 0.02 | 1 | 0.3550 |
148
+ | 0.2785 | 0.25 | 11 | 0.2303 |
149
+ | 0.2129 | 0.51 | 22 | 0.1771 |
150
+ | 0.1803 | 0.76 | 33 | 0.1634 |
 
151
 
152
 
153
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:93937c3dd5f3a3538a2062a256e4416162f68ea8decad2f081fd608f1ae1eb64
3
  size 848460690
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3819cb5d5e46941f03a7f51ca30705ee8c8cb14dc6cf5ef1b056b94e2798cde2
3
  size 848460690