Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
numen-tech
/
gemma-2-2b-it-w4a16g128asym
like
1
Text Generation
MLC-LLM
conversational
4-bit precision
arxiv:
2308.13137
License:
gemma
Model card
Files
Files and versions
Community
Use this model
4-bit
OmniQuant
quantized version of
gemma-2-2b-it
.
Downloads last month
0
Inference Examples
Text Generation
Inference API (serverless) does not yet support mlc-llm models for this pipeline type.
Model tree for
numen-tech/gemma-2-2b-it-w4a16g128asym
Base model
google/gemma-2-2b
Finetuned
google/gemma-2-2b-it
Quantized
(
115
)
this model