qna_model_0000_1 / hyper_params.json
logoyazilim's picture
Upload 10 files
0bf0462
raw
history blame contribute delete
No virus
166 Bytes
{"per_device_train_batch_size": 8, "per_device_eval_batch_size": 8, "gradient_accumulation_steps": 4, "learning_rate": 0.0005, "num_train_epochs": 8, "max_steps": -1}