vichyt
/

codet5p-770m-py-codebleu-32-True-1e-06-0.1

+---
+license: bsd-3-clause
+tags:
+- generated_from_trainer
+datasets:
+- mbpp
+model-index:
+- name: codet5p-770m-py-codebleu-32-True-1e-06-0.1
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# codet5p-770m-py-codebleu-32-True-1e-06-0.1
+This model is a fine-tuned version of [Salesforce/codet5p-770m-py](https://huggingface.co/Salesforce/codet5p-770m-py) on the mbpp dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.8087
+- Codebleu: 0.0867
+- Ngram Match Score: 0.0137
+- Weighted Ngram Match Score: 0.0422
+- Syntax Match Score: 0.1204
+- Dataflow Match Score: 0.0824
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-06
+- train_batch_size: 6
+- eval_batch_size: 6
+- seed: 42
+- gradient_accumulation_steps: 32
+- total_train_batch_size: 192
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 50
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Codebleu | Ngram Match Score | Weighted Ngram Match Score | Syntax Match Score | Dataflow Match Score |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------------:|:--------------------------:|:------------------:|:--------------------:|
+| 1.9228        | 0.51  | 1    | 0.9113          | 0.0047   | 0.0000            | 0.0000                     | 0.0048             | 0.0070               |
+| 0.9857        | 1.52  | 3    | 0.9112          | 0.0047   | 0.0000            | 0.0000                     | 0.0048             | 0.0070               |
+| 0.9734        | 2.54  | 5    | 0.9112          | 0.0069   | 0.0000            | 0.0001                     | 0.0067             | 0.0105               |
+| 0.9624        | 3.56  | 7    | 0.9111          | 0.0074   | 0.0000            | 0.0002                     | 0.0072             | 0.0112               |
+| 0.9586        | 4.57  | 9    | 0.9107          | 0.0087   | 0.0000            | 0.0003                     | 0.0092             | 0.0126               |
+| 0.9708        | 5.59  | 11   | 0.9097          | 0.0140   | 0.0000            | 0.0019                     | 0.0178             | 0.0168               |
+| 0.9667        | 6.6   | 13   | 0.9092          | 0.0171   | 0.0000            | 0.0034                     | 0.0202             | 0.0216               |
+| 0.9791        | 7.62  | 15   | 0.9058          | 0.0211   | 0.0000            | 0.0057                     | 0.0255             | 0.0258               |
+| 0.9702        | 8.63  | 17   | 0.9048          | 0.0317   | 0.0001            | 0.0144                     | 0.0366             | 0.0391               |
+| 0.9563        | 9.65  | 19   | 0.9034          | 0.0398   | 0.0007            | 0.0192                     | 0.0477             | 0.0468               |
+| 0.9654        | 10.67 | 21   | 0.8927          | 0.0482   | 0.0014            | 0.0215                     | 0.0583             | 0.0566               |
+| 0.9458        | 11.68 | 23   | 0.8898          | 0.0602   | 0.0043            | 0.0275                     | 0.0742             | 0.0684               |
+| 0.9523        | 12.7  | 25   | 0.8866          | 0.0647   | 0.0053            | 0.0286                     | 0.0829             | 0.0705               |
+| 0.942         | 13.71 | 27   | 0.8847          | 0.0786   | 0.0091            | 0.0338                     | 0.1069             | 0.0789               |
+| 0.94          | 14.73 | 29   | 0.8648          | 0.0798   | 0.0099            | 0.0357                     | 0.1079             | 0.0803               |
+| 0.9025        | 15.75 | 31   | 0.8604          | 0.0809   | 0.0105            | 0.0363                     | 0.1122             | 0.0782               |
+| 0.9058        | 16.76 | 33   | 0.8577          | 0.0815   | 0.0107            | 0.0362                     | 0.1132             | 0.0789               |
+| 0.893         | 17.78 | 35   | 0.8543          | 0.0816   | 0.0110            | 0.0363                     | 0.1132             | 0.0789               |
+| 0.8959        | 18.79 | 37   | 0.8524          | 0.0805   | 0.0109            | 0.0362                     | 0.1113             | 0.0782               |
+| 0.877         | 19.81 | 39   | 0.8422          | 0.0808   | 0.0118            | 0.0385                     | 0.1113             | 0.0782               |
+| 0.861         | 20.83 | 41   | 0.8374          | 0.0811   | 0.0118            | 0.0385                     | 0.1113             | 0.0789               |
+| 0.8365        | 21.84 | 43   | 0.8376          | 0.0827   | 0.0119            | 0.0386                     | 0.1132             | 0.0810               |
+| 0.8293        | 22.86 | 45   | 0.8331          | 0.0853   | 0.0126            | 0.0390                     | 0.1180             | 0.0824               |
+| 0.8288        | 23.87 | 47   | 0.8246          | 0.0852   | 0.0134            | 0.0421                     | 0.1180             | 0.0810               |
+| 0.8175        | 24.89 | 49   | 0.8141          | 0.0852   | 0.0134            | 0.0421                     | 0.1180             | 0.0810               |
+| 0.6345        | 25.4  | 50   | 0.8087          | 0.0867   | 0.0137            | 0.0422                     | 0.1204             | 0.0824               |
+### Framework versions
+- Transformers 4.30.0.dev0
+- Pytorch 2.0.1
+- Datasets 2.13.1
+- Tokenizers 0.13.3