codet5p-770m-py-codebleu-1-True-1e-07-0.1

This model is a fine-tuned version of Salesforce/codet5p-770m-py on the mbpp dataset. It achieves the following results on the evaluation set:

Loss: 0.6263
Codebleu: 0.0880
Ngram Match Score: 0.0119
Weighted Ngram Match Score: 0.0435
Syntax Match Score: 0.1209
Dataflow Match Score: 0.0852

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-07
train_batch_size: 6
eval_batch_size: 6
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 100
num_epochs: 50
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Codebleu	Ngram Match Score	Weighted Ngram Match Score	Syntax Match Score	Dataflow Match Score
0.9753	1.0	63	0.9060	0.0244	0.0000	0.0108	0.0289	0.0293
0.9732	2.0	126	0.8664	0.0781	0.0104	0.0358	0.1089	0.0747
0.9044	3.0	189	0.8430	0.0802	0.0110	0.0363	0.1132	0.0754
0.8564	4.0	252	0.8162	0.0821	0.0116	0.0384	0.1146	0.0782
0.8289	5.0	315	0.7880	0.0861	0.0135	0.0430	0.1214	0.0796
0.8171	6.0	378	0.7615	0.0862	0.0134	0.0422	0.1219	0.0796
0.7935	7.0	441	0.7390	0.0856	0.0136	0.0423	0.1204	0.0796
0.781	8.0	504	0.7206	0.0883	0.0143	0.0435	0.1219	0.0845
0.7608	9.0	567	0.7065	0.0855	0.0122	0.0396	0.1171	0.0838
0.7404	10.0	630	0.6934	0.0804	0.0094	0.0351	0.1118	0.0782
0.7388	11.0	693	0.6847	0.0787	0.0089	0.0337	0.1108	0.0754
0.7178	12.0	756	0.6786	0.0792	0.0090	0.0339	0.1113	0.0761
0.7087	13.0	819	0.6736	0.0811	0.0106	0.0388	0.1122	0.0782
0.7035	14.0	882	0.6690	0.0820	0.0109	0.0388	0.1122	0.0803
0.7005	15.0	945	0.6652	0.0842	0.0106	0.0384	0.1151	0.0831
0.688	16.0	1008	0.6620	0.0835	0.0104	0.0380	0.1156	0.0810
0.6911	17.0	1071	0.6587	0.0833	0.0106	0.0382	0.1166	0.0796
0.6782	18.0	1134	0.6559	0.0851	0.0114	0.0416	0.1156	0.0838
0.678	19.0	1197	0.6536	0.0844	0.0115	0.0416	0.1132	0.0845
0.6657	20.0	1260	0.6512	0.0856	0.0118	0.0422	0.1132	0.0873
0.6702	21.0	1323	0.6491	0.0842	0.0115	0.0416	0.1113	0.0859
0.662	22.0	1386	0.6471	0.0842	0.0115	0.0416	0.1113	0.0859
0.6569	23.0	1449	0.6453	0.0842	0.0116	0.0416	0.1113	0.0859
0.6605	24.0	1512	0.6436	0.0860	0.0114	0.0424	0.1171	0.0845
0.6589	25.0	1575	0.6420	0.0860	0.0114	0.0424	0.1171	0.0845
0.6519	26.0	1638	0.6404	0.0874	0.0118	0.0429	0.1190	0.0859
0.6568	27.0	1701	0.6390	0.0874	0.0118	0.0429	0.1190	0.0859
0.6569	28.0	1764	0.6378	0.0874	0.0116	0.0428	0.1190	0.0859
0.6455	29.0	1827	0.6365	0.0874	0.0116	0.0428	0.1190	0.0859
0.6456	30.0	1890	0.6355	0.0880	0.0116	0.0428	0.1190	0.0873
0.6503	31.0	1953	0.6345	0.0880	0.0116	0.0428	0.1190	0.0873
0.6424	32.0	2016	0.6337	0.0880	0.0116	0.0428	0.1190	0.0873
0.644	33.0	2079	0.6328	0.0880	0.0116	0.0428	0.1190	0.0873
0.6429	34.0	2142	0.6320	0.0872	0.0120	0.0435	0.1190	0.0852
0.6436	35.0	2205	0.6313	0.0872	0.0120	0.0435	0.1190	0.0852
0.638	36.0	2268	0.6307	0.0872	0.0120	0.0435	0.1190	0.0852
0.6381	37.0	2331	0.6300	0.0872	0.0120	0.0435	0.1190	0.0852
0.6307	38.0	2394	0.6295	0.0872	0.0120	0.0435	0.1190	0.0852
0.6344	39.0	2457	0.6289	0.0880	0.0119	0.0435	0.1209	0.0852
0.6296	40.0	2520	0.6285	0.0880	0.0119	0.0435	0.1209	0.0852
0.6268	41.0	2583	0.6280	0.0880	0.0119	0.0435	0.1209	0.0852
0.6315	42.0	2646	0.6276	0.0880	0.0119	0.0435	0.1209	0.0852
0.6265	43.0	2709	0.6273	0.0880	0.0119	0.0435	0.1209	0.0852
0.626	44.0	2772	0.6270	0.0880	0.0119	0.0435	0.1209	0.0852
0.631	45.0	2835	0.6268	0.0880	0.0119	0.0435	0.1209	0.0852
0.6315	46.0	2898	0.6266	0.0880	0.0119	0.0435	0.1209	0.0852
0.6309	47.0	2961	0.6264	0.0880	0.0119	0.0435	0.1209	0.0852
0.627	48.0	3024	0.6263	0.0880	0.0119	0.0435	0.1209	0.0852
0.6252	49.0	3087	0.6262	0.0880	0.0119	0.0435	0.1209	0.0852
0.632	50.0	3150	0.6263	0.0880	0.0119	0.0435	0.1209	0.0852

Framework versions

Transformers 4.30.0.dev0
Pytorch 2.0.1
Datasets 2.13.1
Tokenizers 0.13.3

vichyt
/

codet5p-770m-py-codebleu-1-True-1e-07-0.1

codet5p-770m-py-codebleu-1-True-1e-07-0.1

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Dataset used to train vichyt/codet5p-770m-py-codebleu-1-True-1e-07-0.1

Evaluation results