Uploaded finetuned model

  • Developed by: didula-wso2
  • License: apache-2.0
  • Finetuned from model : didula-wso2/Qwen3-8B-ep4_julia_codeforces_extended_with_thinksft_16bit_vllm

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
145
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with didula-wso2/Qwen3-8B-rl_with_think_knowledge_merged.

Model tree for didula-wso2/Qwen3-8B-rl_with_think_knowledge_merged