Text Generation
PEFT
Safetensors
English
switch
qwen3
lora
coconut
latent-cot
reasoning
math
on-policy-rl
grpo
conversational
Instructions to use LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("/root/.cache/modelscope/hub/models/Qwen/Qwen3-8B") model = PeftModel.from_pretrained(base_model, "LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B") - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!