Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
v000000
/
L3-8B-MegaSerpentine-imat-GGUFs
like
2
Transformers
GGUF
mergekit
Merge
llama
Not-For-All-Audiences
llama-cpp
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
v000000
commited on
Jun 17
Commit
7c656ba
•
1 Parent(s):
5f13a84
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+3
-1
README.md
CHANGED
Viewed
@@ -64,4 +64,6 @@ dtype: bfloat16
64
65
{output}<|eot_id|>
66
67
-
```
64
65
{output}<|eot_id|>
66
67
+
```
68
+
69
+
./llama-quantize --imatrix ./imatrix.dat ./L3-8B-MegaSerpentine-Tria.fp16.gguf name quantsize