Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CompressedGemma
/
HPC-Quantize
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
HPC-Quantize
Commit History
Update hexstate_quantize.c
a034f4d
verified
CompressedGemma
commited on
about 6 hours ago
Update hexstate_quantize.c
4384a45
verified
CompressedGemma
commited on
about 6 hours ago
Q8_0 tied embeddings
f32b3c6
verified
CompressedGemma
commited on
1 day ago
Q8_0 tied embeddings
63a70a0
verified
CompressedGemma
commited on
1 day ago
Update hexstate_quantize.c
57f4b1d
verified
CompressedGemma
commited on
1 day ago
Revert to Alpha 0.1
e0ba36a
verified
CompressedGemma
commited on
2 days ago
ALPHA
73e9225
verified
CompressedGemma
commited on
3 days ago
ALPHA
2432d03
verified
CompressedGemma
commited on
3 days ago
ALPHA
28f242e
verified
CompressedGemma
commited on
3 days ago
Delete generate_imatrix.py
00ba2db
verified
CompressedGemma
commited on
21 days ago
Upload 5 files
7803d72
verified
CompressedGemma
commited on
21 days ago
Upload 3 files
766f12c
verified
CompressedGemma
commited on
28 days ago
Upload hpc_forward_merged.c
e9294cc
verified
CompressedGemma
commited on
28 days ago
Upload 2 files
414e1de
verified
CompressedGemma
commited on
29 days ago
Upload 2 files
7d55b19
verified
CompressedGemma
commited on
about 1 month ago
Qwen changes
6bf97ec
verified
CompressedGemma
commited on
about 1 month ago
Upload hpc_forward_merged.c
1581489
verified
CompressedGemma
commited on
about 1 month ago
Qwen attention tensors
44e6b86
verified
CompressedGemma
commited on
May 10
Update README.md
099fd3c
verified
CompressedGemma
commited on
May 8
Update README.md
5a67f67
verified
CompressedGemma
commited on
May 7
Fix OOM
c9097e7
verified
CompressedGemma
commited on
May 7
Fix os import
e81a80a
verified
CompressedGemma
commited on
May 7
Auto-load tokenizer for merge rules
0a9e7db
verified
CompressedGemma
commited on
May 7
Heavily experimental
20bea07
verified
CompressedGemma
commited on
May 7
This should do it
a5c5f6c
verified
CompressedGemma
commited on
May 7
Some tensors are transposed lmao
f67ea3a
verified
CompressedGemma
commited on
May 7
Wow Qwen......
965a465
verified
CompressedGemma
commited on
May 7
Qwen......
fca1031
verified
CompressedGemma
commited on
May 7
Qwen patches
8a88d87
verified
CompressedGemma
commited on
May 7
Experimental imatrix
9b8ff1c
verified
CompressedGemma
commited on
May 7
Upload generate_imatrix.py
60295b3
verified
CompressedGemma
commited on
May 7
Update README.md
2a6ab91
verified
CompressedGemma
commited on
May 7
Calibration data
b33a755
verified
CompressedGemma
commited on
May 7
Qwen fix
3883e8d
verified
CompressedGemma
commited on
May 7
Experimental
9303d37
verified
CompressedGemma
commited on
May 7
Update code comments
262cc7b
verified
CompressedGemma
commited on
May 7
Tensor tweak
dc3b370
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
5c1c396
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
ae8c38d
verified
CompressedGemma
commited on
May 7
Update README.md
dd6d6ba
verified
CompressedGemma
commited on
May 6
Update README.md
96fce02
verified
CompressedGemma
commited on
May 6
It's only calibrated for Gemma, atm.
07b428c
verified
CompressedGemma
commited on
May 6
initial commit
819eddd
verified
CompressedGemma
commited on
May 6