geonho1
/

Mistral-7B-Instruct-v0.2-4b-r8-task295

heterogeneous-rank

Model card Files Files and versions

Mistral-7B-Instruct-v0.2-4b-r8-task295

This is a PEFT LoRA adapter trained for the heterogeneous-rank Lots-of-LoRAs experiment.

Source

Base model: mistralai/Mistral-7B-Instruct-v0.2
Dataset: Lots-of-LoRAs/task295_semeval_2020_task4_commonsense_reasoning
Train split: train
Eval split: valid
Task ID: 295
Description: semeval 2020 task4 commonsense reasoning

LoRA

Rank: 8
Target modules: q_proj, k_proj, v_proj
LoRA alpha: 32
LoRA dropout: 0.05
Bias: none

Training protocol

Base model dtype: 4bit-nf4
Quantization: QLoRA 4bit NF4, double quantization enabled, bf16 compute
Adapter trainable dtype: float32
Prompt format: plain
Loss: completion-only causal LM cross entropy
Epochs: 5.0
Best checkpoint metric: eval_loss
Learning rate: 0.0002
Scheduler: cosine
Warmup ratio: 0.03
Effective batch size: 16
Optimizer: paged_adamw_32bit

Files

adapter_model.safetensors: LoRA adapter weights
adapter_config.json: PEFT adapter configuration
task_manifest.json: source manifest row and resolved splits
training_protocol.json: fixed protocol used for this run

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295

Base model

mistralai/Mistral-7B-Instruct-v0.2

Adapter

(1268)

this model