jlpan
/

starcoder-c2py-snippet1

Generated from Trainer

Model card Files Files and versions Community

jlpan commited on Aug 28, 2023

Commit

e11d55a

•

1 Parent(s): f338e35

update model card README.md

Browse files

Files changed (1) hide show

README.md +8 -16

README.md CHANGED Viewed

@@ -6,7 +6,6 @@ tags:
 model-index:
 - name: starcoder-c2py-snippet1
   results: []
-library_name: peft
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2014
 ## Model description
@@ -43,29 +42,22 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 10
-- training_steps: 100
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 8.2651        | 0.1   | 10   | 4.2201          |
-| 0.9761        | 0.2   | 20   | 0.5205          |
-| 0.3183        | 0.3   | 30   | 0.2766          |
-| 0.1887        | 1.04  | 40   | 0.2384          |
-| 0.1867        | 1.14  | 50   | 0.2171          |
-| 0.1732        | 1.24  | 60   | 0.2072          |
-| 0.156         | 1.34  | 70   | 0.2034          |
-| 0.1415        | 2.08  | 80   | 0.2022          |
-| 0.1614        | 2.17  | 90   | 0.2016          |
-| 0.1568        | 2.27  | 100  | 0.2014          |
 ### Framework versions
-- PEFT 0.5.0.dev0
-- PEFT 0.5.0.dev0
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0

 model-index:
 - name: starcoder-c2py-snippet1
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [bigcode/starcoder](https://huggingface.co/bigcode/starcoder) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2601
 ## Model description
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 5
+- training_steps: 50
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 7.249         | 0.2   | 10   | 2.0348          |
+| 0.6338        | 0.4   | 20   | 0.5047          |
+| 0.3306        | 0.6   | 30   | 0.3064          |
+| 0.2144        | 1.07  | 40   | 0.2655          |
+| 0.2195        | 1.27  | 50   | 0.2601          |
 ### Framework versions
 - Transformers 4.32.0.dev0
 - Pytorch 2.0.1+cu117
 - Datasets 2.12.0