apfurman commited on
Commit
6c31eff
1 Parent(s): c019233

Model save

Browse files
Files changed (2) hide show
  1. README.md +2 -14
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -6,8 +6,6 @@ tags:
6
  - sft
7
  - generated_from_trainer
8
  base_model: google/gemma-2b
9
- datasets:
10
- - generator
11
  model-index:
12
  - name: gemma-2b-dolly-qa
13
  results: []
@@ -18,9 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # gemma-2b-dolly-qa
20
 
21
- This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on the generator dataset.
22
- It achieves the following results on the evaluation set:
23
- - Loss: 2.4690
24
 
25
  ## Model description
26
 
@@ -48,15 +44,7 @@ The following hyperparameters were used during training:
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
  - lr_scheduler_warmup_ratio: 0.05
51
- - training_steps: 200
52
-
53
- ### Training results
54
-
55
- | Training Loss | Epoch | Step | Validation Loss |
56
- |:-------------:|:------:|:----:|:---------------:|
57
- | 2.8295 | 1.6393 | 100 | 2.5585 |
58
- | 2.5394 | 3.2787 | 200 | 2.4690 |
59
-
60
 
61
  ### Framework versions
62
 
 
6
  - sft
7
  - generated_from_trainer
8
  base_model: google/gemma-2b
 
 
9
  model-index:
10
  - name: gemma-2b-dolly-qa
11
  results: []
 
16
 
17
  # gemma-2b-dolly-qa
18
 
19
+ This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 
 
20
 
21
  ## Model description
22
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.05
47
+ - training_steps: 1480
 
 
 
 
 
 
 
 
48
 
49
  ### Framework versions
50
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe2dfb61fa68f87785bca7c4b31bbf2797edf7cd7036852277b807f936b5c17c
3
  size 156926880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:657a8a9c2bb35d84a476d3095c97851b469678cebbce8e3f10bc206bf23bbe76
3
  size 156926880