Edit model card

Model Card for Model ID

This modelcard aims to be a base template for new models. It has been generated using this raw template.

Training Procedure

Training Hyperparameters

learning_rate: 5e-05 train_batch_size: 2 eval_batch_size: 8 seed: 42 gradient_accumulation_steps: 8 total_train_batch_size: 16 optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 lr_scheduler_type: cosine num_epochs: 10.0 mixed_precision_training: Native AMP

Downloads last month
3
Safetensors
Model size
7.72B params
Tensor type
BF16
·

Dataset used to train jueyuan111/q7-oher