The model is converted from the original Qwen/Qwen2.5-Coder-1.5B-Instruct using:
litert-torch export_hf \
--model=Qwen/Qwen2.5-Coder-1.5B-Instruct \
--output_dir="./dynamic_wi8_afp32" \
--quantization_recipe="dynamic_wi8_afp32" \
--bundle_litert_lm=true
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for 4ntoine/Qwen2.5-Coder-1.5B-Instruct-LiteRTLM
Base model
Qwen/Qwen2.5-1.5B Finetuned
Qwen/Qwen2.5-Coder-1.5B Finetuned
Qwen/Qwen2.5-Coder-1.5B-Instruct