Thimira commited on
Commit
079419e
1 Parent(s): ca89f01

Model save

Browse files
README.md CHANGED
@@ -5,6 +5,8 @@ tags:
5
  - sft
6
  - generated_from_trainer
7
  base_model: NousResearch/Llama-2-7b-chat-hf
 
 
8
  model-index:
9
  - name: sinhala-llama-2-7b-chat-hf
10
  results: []
@@ -15,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # sinhala-llama-2-7b-chat-hf
17
 
18
- This model is a fine-tuned version of [NousResearch/Llama-2-7b-chat-hf](https://huggingface.co/NousResearch/Llama-2-7b-chat-hf) on an unknown dataset.
19
 
20
  ## Model description
21
 
@@ -41,7 +43,7 @@ The following hyperparameters were used during training:
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: constant
43
  - lr_scheduler_warmup_ratio: 0.03
44
- - num_epochs: 1
45
 
46
  ### Training results
47
 
@@ -49,8 +51,8 @@ The following hyperparameters were used during training:
49
 
50
  ### Framework versions
51
 
52
- - PEFT 0.7.2.dev0
53
- - Transformers 4.36.2
54
- - Pytorch 2.1.2+cu121
55
- - Datasets 2.16.1
56
- - Tokenizers 0.15.1
 
5
  - sft
6
  - generated_from_trainer
7
  base_model: NousResearch/Llama-2-7b-chat-hf
8
+ datasets:
9
+ - generator
10
  model-index:
11
  - name: sinhala-llama-2-7b-chat-hf
12
  results: []
 
17
 
18
  # sinhala-llama-2-7b-chat-hf
19
 
20
+ This model is a fine-tuned version of [NousResearch/Llama-2-7b-chat-hf](https://huggingface.co/NousResearch/Llama-2-7b-chat-hf) on the generator dataset.
21
 
22
  ## Model description
23
 
 
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: constant
45
  - lr_scheduler_warmup_ratio: 0.03
46
+ - num_epochs: 2
47
 
48
  ### Training results
49
 
 
51
 
52
  ### Framework versions
53
 
54
+ - PEFT 0.10.0
55
+ - Transformers 4.39.3
56
+ - Pytorch 2.1.0
57
+ - Datasets 2.18.0
58
+ - Tokenizers 0.15.2
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f4f45c508486d2bedcd6b2ae6c0cb1ac9c1c332a2a8541033923c6ceb37fc34
3
  size 67126232
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ceb78c3e63da95de0658a25a983c432aab484a8cc34b6bb2d3708fdbcfdeb0af
3
  size 67126232
runs/Apr03_15-17-23_ip-172-16-181-33.ec2.internal/events.out.tfevents.1712157462.ip-172-16-181-33.ec2.internal.13907.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5321c20d62e5d6b80d634b66d722b19b20b78693321041c689604dbd9719b64e
3
- size 80108
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:769dce4f52bc6b153e745fd63ca9f6e9167aa1293c1afaacd19665fcc646ca02
3
+ size 80462