techhermit/qwen35-slice14b-base

This repository contains the sliced 14B base checkpoint used for the distillation branch.

Provenance

  • Base model: Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
  • Slice target: 32 transformer layers
  • Purpose: serve as the structural base for the adapter release repo

Usage

Load this repo as a normal Transformers checkpoint. Then apply the adapter from the release repo if you want the distilled behavior-tuned variant.

Downloads last month
-
Safetensors
Model size
15B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support