Instructions to use Occupying-Mars/issue49-k160-r32-len1024-adapter with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Occupying-Mars/issue49-k160-r32-len1024-adapter with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("/workspace/hf_cache/hub/models--Qwen--Qwen3-8B/snapshots/b968826d9c46dd6066d109eabc6255188de91218") model = PeftModel.from_pretrained(base_model, "Occupying-Mars/issue49-k160-r32-len1024-adapter") - Notebooks
- Google Colab
- Kaggle
BFCL v2 strict-mix k160 r32 LoRA adapter
LoRA adapter for Qwen/Qwen3-8B, trained for the circuit-shotting BFCL single-call experiments.
Run details:
- dataset:
data/bfcl_strict_10k_mix/train.jsonl - mask attribution:
runs/issue49_bfcl_single_call/relp_full.npz - mask size:
160000MLP channels - LoRA rank:
32 - LoRA alpha:
64 - max sequence length:
1024 - training output:
runs/issue49_bfcl_single_call/lora_bfcl10k_topk160000_r32_balanced_len1024_ckpt
Eval on the 1007-row BFCL single-turn/single-call slice, with canonicalization prompt and normalized exact matching:
| condition | score |
|---|---|
| unmasked merged model | 683/1007 = 67.83% |
| k160 masked merged model | 418/1007 = 41.51% |
Included files:
adapter_model.safetensors: PEFT LoRA adapter weightsadapter_config.json: PEFT adapter configtokenizer.json,tokenizer_config.json,chat_template.jinja: tokenizer artifactstrain_summary.json: training summaryeval_masked_summary.json: k160 masked eval summaryeval_unmasked_summary.json: unmasked eval summary
- Downloads last month
- 81