philschmid
/

falcon-40b-instruct-GPTQ-inference-endpoints

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

falcon-40b-instruct-GPTQ-inference-endpoints

2 contributors

History: 6 commits

philschmid's picture

philschmid HF staff

Update handler.py

abdc7a2 about 1 year ago

.gitattributes

1.48 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
README.md

14.2 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
config.json

721 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
configuration_RW.py

2.51 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
generation_config.json

111 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
gptq_model-4bit--1g.safetensors

22.5 GB
LFS

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
handler.py

1.5 kB

Update handler.py about 1 year ago
modelling_RW.py

47.1 kB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
quantize_config.json

183 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
requirements.txt

92 Bytes

Update requirements.txt about 1 year ago
special_tokens_map.json

281 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
tokenizer.json

2.73 MB

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago
tokenizer_config.json

220 Bytes

Duplicate from TheBloke/falcon-40b-instruct-GPTQ about 1 year ago