Ashraf-kasem
/

custom_gpt2_frames_text_continue

Text Generation

generated_from_keras_callback

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

Ashraf-kasem/custom_gpt2_frames_text_continue

This model is a fine-tuned version of Ashraf-kasem/custom_gpt2_frames_text_continue on an unknown dataset. It achieves the following results on the evaluation set:

Train Loss: 0.6337
Validation Loss: 2.3028
Epoch: 99

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'LinearWarmup', 'config': {'after_warmup_lr_sched': {'initial_learning_rate': 5e-05, 'decay_steps': 628900, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, 'warmup_steps': 125780, 'warmup_learning_rate': 0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
training_precision: mixed_float16

Training results

Train Loss	Validation Loss	Epoch
1.0060	2.0768	0
1.0147	2.0771	1
1.0238	2.0821	2
1.0331	2.0851	3
1.0422	2.0870	4
1.0525	2.0945	5
1.0618	2.1005	6
1.0718	2.1014	7
1.0823	2.1056	8
1.0921	2.1099	9
1.1028	2.1106	10
1.1127	2.1127	11
1.1230	2.1183	12
1.1329	2.1207	13
1.1423	2.1270	14
1.1521	2.1234	15
1.1614	2.1283	16
1.1700	2.1236	17
1.1784	2.1320	18
1.1864	2.1359	19
1.1873	2.1272	20
1.1766	2.1250	21
1.1652	2.1260	22
1.1537	2.1224	23
1.1415	2.1278	24
1.1296	2.1254	25
1.1178	2.1213	26
1.1059	2.1301	27
1.0950	2.1253	28
1.0838	2.1264	29
1.0729	2.1273	30
1.0625	2.1355	31
1.0519	2.1345	32
1.0414	2.1364	33
1.0317	2.1324	34
1.0217	2.1410	35
1.0126	2.1428	36
1.0027	2.1427	37
0.9936	2.1494	38
0.9846	2.1502	39
0.9752	2.1490	40
0.9665	2.1501	41
0.9582	2.1552	42
0.9497	2.1533	43
0.9411	2.1621	44
0.9331	2.1618	45
0.9248	2.1655	46
0.9172	2.1755	47
0.9093	2.1759	48
0.9014	2.1751	49
0.8942	2.1813	50
0.8867	2.1831	51
0.8795	2.1856	52
0.8723	2.1909	53
0.8651	2.1950	54
0.8581	2.1955	55
0.8511	2.2007	56
0.8444	2.2002	57
0.8380	2.2078	58
0.8312	2.2077	59
0.8246	2.2161	60
0.8186	2.2103	61
0.8120	2.2180	62
0.8053	2.2202	63
0.7994	2.2232	64
0.7934	2.2290	65
0.7872	2.2301	66
0.7816	2.2327	67
0.7757	2.2369	68
0.7698	2.2408	69
0.7640	2.2439	70
0.7582	2.2451	71
0.7528	2.2505	72
0.7475	2.2524	73
0.7420	2.2520	74
0.7366	2.2561	75
0.7313	2.2616	76
0.7260	2.2628	77
0.7211	2.2654	78
0.7158	2.2701	79
0.7107	2.2704	80
0.7061	2.2743	81
0.7008	2.2749	82
0.6962	2.2769	83
0.6916	2.2813	84
0.6869	2.2838	85
0.6823	2.2853	86
0.6780	2.2867	87
0.6737	2.2883	88
0.6691	2.2921	89
0.6651	2.2931	90
0.6608	2.2946	91
0.6568	2.2957	92
0.6533	2.2984	93
0.6494	2.2981	94
0.6459	2.2994	95
0.6425	2.3006	96
0.6395	2.3019	97
0.6363	2.3026	98
0.6337	2.3028	99

Framework versions

Transformers 4.25.1
TensorFlow 2.9.0
Datasets 2.8.0
Tokenizers 0.13.2

Downloads last month: 4

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Evaluation results

Metadata error: specify a dataset to view leaderboard