Instructions to use geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295 with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.2") model = PeftModel.from_pretrained(base_model, "geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295") - Notebooks
- Google Colab
- Kaggle
Mistral-7B-Instruct-v0.2-4b-r8-task295
This is a PEFT LoRA adapter trained for the heterogeneous-rank Lots-of-LoRAs experiment.
Source
- Base model:
mistralai/Mistral-7B-Instruct-v0.2 - Dataset:
Lots-of-LoRAs/task295_semeval_2020_task4_commonsense_reasoning - Train split:
train - Eval split:
valid - Task ID:
295 - Description:
semeval 2020 task4 commonsense reasoning
LoRA
- Rank:
8 - Target modules:
q_proj, k_proj, v_proj - LoRA alpha:
32 - LoRA dropout:
0.05 - Bias:
none
Training protocol
- Base model dtype:
4bit-nf4 - Quantization:
QLoRA 4bit NF4, double quantization enabled, bf16 compute - Adapter trainable dtype:
float32 - Prompt format:
plain - Loss: completion-only causal LM cross entropy
- Epochs:
5.0 - Best checkpoint metric:
eval_loss - Learning rate:
0.0002 - Scheduler:
cosine - Warmup ratio:
0.03 - Effective batch size:
16 - Optimizer:
paged_adamw_32bit
Files
adapter_model.safetensors: LoRA adapter weightsadapter_config.json: PEFT adapter configurationtask_manifest.json: source manifest row and resolved splitstraining_protocol.json: fixed protocol used for this run
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for geonho1/Mistral-7B-Instruct-v0.2-4b-r8-task295
Base model
mistralai/Mistral-7B-Instruct-v0.2