Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
pankajroark
/
llama-fp16-engine
like
0
Model card
Files
Files and versions
Community
bd472e5
llama-fp16-engine
1 contributor
History:
6 commits
pankajroark
update no-quant engine
bd472e5
12 months ago
7b-no-quant-tp1
update no-quant engine
12 months ago
7b-sq-int8kv-tp1
sq version
12 months ago
7b-sq-int8kv-tp8
tp8 checkpoint
12 months ago
.gitattributes
1.56 kB
checkpoint
12 months ago
.gitignore
5 Bytes
checkpoint
12 months ago