sanchit-gandhi
/

distil-mistral-3B-Instruct-v0.2-logprob-1.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

distil-mistral-3B-Instruct-v0.2-logprob-1.5 / distil-mistral /1715175073.1907828 /hparams.yml

sanchit-gandhi's picture

sanchit-gandhi HF staff

Saving train state of step 5000

7f41237 verified 6 months ago

history blame contribute delete

510 Bytes

	adam_beta1: 0.9
	adam_beta2: 0.999
	global_batch_size: 32
	gradient_accumulation_steps: 2
	learning_rate: 0.0001
	logprob_threshold: -1.5
	lr_scheduler_type: !!python/object/apply:transformers.trainer_utils.SchedulerType
	- linear
	max_label_length: 4096
	max_steps: 200000
	mixed_precision: bf16
	model_name_or_path: sanchit-gandhi/Mistral-3B-Instruct-v0.2
	num_train_epochs: 3.0
	per_device_train_batch_size: 4
	teacher_name_or_path: mistralai/Mistral-7B-Instruct-v0.2
	temperature: 2.0
	warmup_steps: 500
	weight_decay: 0.0