Mistral-7B-Instruct-v0.2-4b-r8-task295

This is a PEFT LoRA adapter trained for the heterogeneous-rank Lots-of-LoRAs experiment.

Source

  • Base model: mistralai/Mistral-7B-Instruct-v0.2
  • Dataset: Lots-of-LoRAs/task295_semeval_2020_task4_commonsense_reasoning
  • Train split: train
  • Eval split: valid
  • Task ID: 295
  • Description: semeval 2020 task4 commonsense reasoning

LoRA

  • Rank: 8
  • Target modules: q_proj, k_proj, v_proj
  • LoRA alpha: 32
  • LoRA dropout: 0.05
  • Bias: none

Training protocol

  • Base model dtype: 4bit-nf4
  • Quantization: QLoRA 4bit NF4, double quantization enabled, bf16 compute
  • Adapter trainable dtype: float32
  • Prompt format: plain
  • Loss: completion-only causal LM cross entropy
  • Epochs: 5.0
  • Best checkpoint metric: eval_loss
  • Learning rate: 0.0002
  • Scheduler: cosine
  • Warmup ratio: 0.03
  • Effective batch size: 16
  • Optimizer: paged_adamw_32bit

Files

  • adapter_model.safetensors: LoRA adapter weights
  • adapter_config.json: PEFT adapter configuration
  • task_manifest.json: source manifest row and resolved splits
  • training_protocol.json: fixed protocol used for this run
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295

Adapter
(1268)
this model