Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Minami-su
/
Qwen1.5-32B-Chat-quip-3bit
like
1
Text Generation
Transformers
PyTorch
qwen2
conversational
Inference Endpoints
text-generation-inference
QUiP
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
Edit model card
YAML Metadata Warning:
empty or missing yaml metadata in repo card (
https://huggingface.co/docs/hub/model-cards#model-card-metadata
)
usage:
https://github.com/Minami-su/vllm-gptq
Downloads last month
2
Inference API
Text Generation
Examples
Input a message to start chatting with
Minami-su/Qwen1.5-32B-Chat-quip-3bit
.
Send
Model is too large to load in Inference API (serverless). To try the model, launch it on
Inference Endpoints (dedicated)
instead.
JSON Output
Maximize