Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
4-bit precision
gptq

Commit History

Update base_model formatting
576a9b3

TheBloke commited on

Upload README.md
c01399f

TheBloke commited on

Upload README.md
2be41c4

TheBloke commited on

Update for Transformers GPTQ support
4e8d76e

TheBloke commited on

Update README.md
123edfd

TheBloke commited on

Update README.md
6b2ed28

TheBloke commited on

Update README.md
7a1cec0

TheBloke commited on

Update README.md
1d81439

TheBloke commited on

fix documentation for loading the model, since the fused attention module doesnt work here either. (#4)
7b2ce68

TheBloke mber commited on

Initial GPTQ model commit
2c09584

TheBloke commited on

Update README.md
0a44e6a

TheBloke commited on

Initial GPTQ model commit
0a04528

TheBloke commited on

Initial GPTQ model commit
679d867

TheBloke commited on

Initial GPTQ model commit
d015905

TheBloke commited on

Initial GPTQ model commit
2afbe80

TheBloke commited on