Qwen2-1.5B-Instruct LiteRT-LM Model

This repository contains LiteRT-LM variants of Qwen/Qwen2-1.5B-Instruct optimized for on-device text generation.

Available Artifact

File Quantization Recipe Context Size
Qwen2_1.5B_Instruct.litertlm dynamic_wi8_afp32 - 1.8 GB

Integration

Ready to integrate this into your product? Get started in the LiteRT-LM documentation.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for litert-community/Qwen2-1.5B-Instruct

Quantized
(66)
this model