vichyt
/

codet5p-770m-py-codebleu-1-True-1e-07-0.1

+---
+license: bsd-3-clause
+tags:
+- generated_from_trainer
+datasets:
+- mbpp
+model-index:
+- name: codet5p-770m-py-codebleu-1-True-1e-07-0.1
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# codet5p-770m-py-codebleu-1-True-1e-07-0.1
+This model is a fine-tuned version of [Salesforce/codet5p-770m-py](https://huggingface.co/Salesforce/codet5p-770m-py) on the mbpp dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6263
+- Codebleu: 0.0880
+- Ngram Match Score: 0.0119
+- Weighted Ngram Match Score: 0.0435
+- Syntax Match Score: 0.1209
+- Dataflow Match Score: 0.0852
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-07
+- train_batch_size: 6
+- eval_batch_size: 6
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 50
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Codebleu | Ngram Match Score | Weighted Ngram Match Score | Syntax Match Score | Dataflow Match Score |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------------:|:--------------------------:|:------------------:|:--------------------:|
+| 0.9753        | 1.0   | 63   | 0.9060          | 0.0244   | 0.0000            | 0.0108                     | 0.0289             | 0.0293               |
+| 0.9732        | 2.0   | 126  | 0.8664          | 0.0781   | 0.0104            | 0.0358                     | 0.1089             | 0.0747               |
+| 0.9044        | 3.0   | 189  | 0.8430          | 0.0802   | 0.0110            | 0.0363                     | 0.1132             | 0.0754               |
+| 0.8564        | 4.0   | 252  | 0.8162          | 0.0821   | 0.0116            | 0.0384                     | 0.1146             | 0.0782               |
+| 0.8289        | 5.0   | 315  | 0.7880          | 0.0861   | 0.0135            | 0.0430                     | 0.1214             | 0.0796               |
+| 0.8171        | 6.0   | 378  | 0.7615          | 0.0862   | 0.0134            | 0.0422                     | 0.1219             | 0.0796               |
+| 0.7935        | 7.0   | 441  | 0.7390          | 0.0856   | 0.0136            | 0.0423                     | 0.1204             | 0.0796               |
+| 0.781         | 8.0   | 504  | 0.7206          | 0.0883   | 0.0143            | 0.0435                     | 0.1219             | 0.0845               |
+| 0.7608        | 9.0   | 567  | 0.7065          | 0.0855   | 0.0122            | 0.0396                     | 0.1171             | 0.0838               |
+| 0.7404        | 10.0  | 630  | 0.6934          | 0.0804   | 0.0094            | 0.0351                     | 0.1118             | 0.0782               |
+| 0.7388        | 11.0  | 693  | 0.6847          | 0.0787   | 0.0089            | 0.0337                     | 0.1108             | 0.0754               |
+| 0.7178        | 12.0  | 756  | 0.6786          | 0.0792   | 0.0090            | 0.0339                     | 0.1113             | 0.0761               |
+| 0.7087        | 13.0  | 819  | 0.6736          | 0.0811   | 0.0106            | 0.0388                     | 0.1122             | 0.0782               |
+| 0.7035        | 14.0  | 882  | 0.6690          | 0.0820   | 0.0109            | 0.0388                     | 0.1122             | 0.0803               |
+| 0.7005        | 15.0  | 945  | 0.6652          | 0.0842   | 0.0106            | 0.0384                     | 0.1151             | 0.0831               |
+| 0.688         | 16.0  | 1008 | 0.6620          | 0.0835   | 0.0104            | 0.0380                     | 0.1156             | 0.0810               |
+| 0.6911        | 17.0  | 1071 | 0.6587          | 0.0833   | 0.0106            | 0.0382                     | 0.1166             | 0.0796               |
+| 0.6782        | 18.0  | 1134 | 0.6559          | 0.0851   | 0.0114            | 0.0416                     | 0.1156             | 0.0838               |
+| 0.678         | 19.0  | 1197 | 0.6536          | 0.0844   | 0.0115            | 0.0416                     | 0.1132             | 0.0845               |
+| 0.6657        | 20.0  | 1260 | 0.6512          | 0.0856   | 0.0118            | 0.0422                     | 0.1132             | 0.0873               |
+| 0.6702        | 21.0  | 1323 | 0.6491          | 0.0842   | 0.0115            | 0.0416                     | 0.1113             | 0.0859               |
+| 0.662         | 22.0  | 1386 | 0.6471          | 0.0842   | 0.0115            | 0.0416                     | 0.1113             | 0.0859               |
+| 0.6569        | 23.0  | 1449 | 0.6453          | 0.0842   | 0.0116            | 0.0416                     | 0.1113             | 0.0859               |
+| 0.6605        | 24.0  | 1512 | 0.6436          | 0.0860   | 0.0114            | 0.0424                     | 0.1171             | 0.0845               |
+| 0.6589        | 25.0  | 1575 | 0.6420          | 0.0860   | 0.0114            | 0.0424                     | 0.1171             | 0.0845               |
+| 0.6519        | 26.0  | 1638 | 0.6404          | 0.0874   | 0.0118            | 0.0429                     | 0.1190             | 0.0859               |
+| 0.6568        | 27.0  | 1701 | 0.6390          | 0.0874   | 0.0118            | 0.0429                     | 0.1190             | 0.0859               |
+| 0.6569        | 28.0  | 1764 | 0.6378          | 0.0874   | 0.0116            | 0.0428                     | 0.1190             | 0.0859               |
+| 0.6455        | 29.0  | 1827 | 0.6365          | 0.0874   | 0.0116            | 0.0428                     | 0.1190             | 0.0859               |
+| 0.6456        | 30.0  | 1890 | 0.6355          | 0.0880   | 0.0116            | 0.0428                     | 0.1190             | 0.0873               |
+| 0.6503        | 31.0  | 1953 | 0.6345          | 0.0880   | 0.0116            | 0.0428                     | 0.1190             | 0.0873               |
+| 0.6424        | 32.0  | 2016 | 0.6337          | 0.0880   | 0.0116            | 0.0428                     | 0.1190             | 0.0873               |
+| 0.644         | 33.0  | 2079 | 0.6328          | 0.0880   | 0.0116            | 0.0428                     | 0.1190             | 0.0873               |
+| 0.6429        | 34.0  | 2142 | 0.6320          | 0.0872   | 0.0120            | 0.0435                     | 0.1190             | 0.0852               |
+| 0.6436        | 35.0  | 2205 | 0.6313          | 0.0872   | 0.0120            | 0.0435                     | 0.1190             | 0.0852               |
+| 0.638         | 36.0  | 2268 | 0.6307          | 0.0872   | 0.0120            | 0.0435                     | 0.1190             | 0.0852               |
+| 0.6381        | 37.0  | 2331 | 0.6300          | 0.0872   | 0.0120            | 0.0435                     | 0.1190             | 0.0852               |
+| 0.6307        | 38.0  | 2394 | 0.6295          | 0.0872   | 0.0120            | 0.0435                     | 0.1190             | 0.0852               |
+| 0.6344        | 39.0  | 2457 | 0.6289          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6296        | 40.0  | 2520 | 0.6285          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6268        | 41.0  | 2583 | 0.6280          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6315        | 42.0  | 2646 | 0.6276          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6265        | 43.0  | 2709 | 0.6273          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.626         | 44.0  | 2772 | 0.6270          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.631         | 45.0  | 2835 | 0.6268          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6315        | 46.0  | 2898 | 0.6266          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6309        | 47.0  | 2961 | 0.6264          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.627         | 48.0  | 3024 | 0.6263          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.6252        | 49.0  | 3087 | 0.6262          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+| 0.632         | 50.0  | 3150 | 0.6263          | 0.0880   | 0.0119            | 0.0435                     | 0.1209             | 0.0852               |
+### Framework versions
+- Transformers 4.30.0.dev0
+- Pytorch 2.0.1
+- Datasets 2.13.1
+- Tokenizers 0.13.3