Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
4-bit precision

Commit History

Update base_model formatting
576a9b3

TheBloke commited on

Upload README.md
c753e6d

TheBloke commited on

Upload README.md
c01399f

TheBloke commited on

Upload README.md
0921f35

TheBloke commited on

Upload README.md
2be41c4

TheBloke commited on

Update for Transformers GPTQ support
4e8d76e

TheBloke commited on

Update README.md
123edfd

TheBloke commited on

Update README.md
6b2ed28

TheBloke commited on

Update README.md
7a1cec0

TheBloke commited on

Delete stablebeluga2.ggmlv3.q3_K_S.bin
9fed75e

TheBloke commited on

Delete stablebeluga2.ggmlv3.q2_K.bin
ae5a0ff

TheBloke commited on

Initial GGML model commit
984f167

TheBloke commited on

Initial GGML model commit
ecca7f1

TheBloke commited on

Update README.md
1d81439

TheBloke commited on

fix documentation for loading the model, since the fused attention module doesnt work here either. (#4)
7b2ce68

TheBloke mber commited on

Delete gptq_model-4bit-128g.safetensors
8aba4ed

TheBloke commited on

Initial GPTQ model commit
529499f

TheBloke commited on

Initial GPTQ model commit
2c09584

TheBloke commited on

Initial GPTQ model commit
7a5cbca

TheBloke commited on

Initial GPTQ model commit
45bffd9

TheBloke commited on

Initial GPTQ model commit
02f9b5a

TheBloke commited on

Initial GPTQ model commit
b3648da

TheBloke commited on

Initial GPTQ model commit
237b88d

TheBloke commited on

Initial GPTQ model commit
9985d30

TheBloke commited on

Initial GPTQ model commit
c5698f3

TheBloke commited on

Initial GPTQ model commit
dc16fac

TheBloke commited on

Initial GPTQ model commit
2236007

TheBloke commited on

Initial GPTQ model commit
458ab3e

TheBloke commited on

Initial GPTQ model commit
ecbe5a3

TheBloke commited on

Update README.md
0a44e6a

TheBloke commited on

Initial GPTQ model commit
e244efc

TheBloke commited on

Initial GPTQ model commit
0a04528

TheBloke commited on

Initial GPTQ model commit
cdbafe6

TheBloke commited on

Initial GPTQ model commit
27f940d

TheBloke commited on

Initial GPTQ model commit
0b7d74e

TheBloke commited on

Initial GPTQ model commit
babc17d

TheBloke commited on

Initial GPTQ model commit
293cf2b

TheBloke commited on

Initial GPTQ model commit
09e32c1

TheBloke commited on

Initial GPTQ model commit
16ffd8e

TheBloke commited on

Initial GPTQ model commit
01e0a31

TheBloke commited on

Initial GPTQ model commit
ba41eb6

TheBloke commited on

Initial GPTQ model commit
772b74d

TheBloke commited on

Initial GPTQ model commit
de780f6

TheBloke commited on

Initial GPTQ model commit
90ca4cf

TheBloke commited on

Initial GPTQ model commit
679d867

TheBloke commited on

Initial GPTQ model commit
05db291

TheBloke commited on

Initial GPTQ model commit
d015905

TheBloke commited on

Initial GPTQ model commit
2e2dc08

TheBloke commited on

Initial GPTQ model commit
2afbe80

TheBloke commited on

Initial GPTQ model commit
c302438

TheBloke commited on