Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TheBloke
/
falcon-40b-instruct-GPTQ
like
198
Text Generation
Transformers
Safetensors
tiiuae/falcon-refinedweb
English
RefinedWeb
custom_code
text-generation-inference
4-bit precision
gptq
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
26
Train
Deploy
Use this model
e1ef23d
falcon-40b-instruct-GPTQ
2 contributors
History:
43 commits
TheBloke
Update README.md
e1ef23d
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
15.4 kB
Update README.md
over 1 year ago
config.json
721 Bytes
Update config.json (#10)
over 1 year ago
configuration_RW.py
2.51 kB
Upload folder using huggingface_hub
over 1 year ago
generation_config.json
111 Bytes
Initial AutoGPTQ model commit
over 1 year ago
gptq_model-4bit--1g.safetensors
22.5 GB
LFS
Initial AutoGPTQ model commit
over 1 year ago
modelling_RW.py
47.1 kB
Initial AutoGPTQ model commit
over 1 year ago
quantize_config.json
183 Bytes
Update quantize_config.json
over 1 year ago
special_tokens_map.json
281 Bytes
Initial AutoGPTQ model commit
over 1 year ago
tokenizer.json
2.73 MB
Initial AutoGPTQ model commit
over 1 year ago
tokenizer_config.json
220 Bytes
Initial AutoGPTQ model commit
over 1 year ago