Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
philschmid
/
falcon-40b-instruct-GPTQ-inference-endpoints
like
2
Text Generation
Transformers
tiiuae/falcon-refinedweb
English
RefinedWeb
custom_code
Inference Endpoints
text-generation-inference
4 papers
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
falcon-40b-instruct-GPTQ-inference-endpoints
2 contributors
History:
6 commits
philschmid
HF staff
Update handler.py
abdc7a2
12 months ago
.gitattributes
1.48 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
README.md
14.2 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
config.json
721 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
configuration_RW.py
2.51 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
generation_config.json
111 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
gptq_model-4bit--1g.safetensors
22.5 GB
LFS
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
handler.py
1.5 kB
Update handler.py
12 months ago
modelling_RW.py
47.1 kB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
quantize_config.json
183 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
requirements.txt
92 Bytes
Update requirements.txt
12 months ago
special_tokens_map.json
281 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
tokenizer.json
2.73 MB
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago
tokenizer_config.json
220 Bytes
Duplicate from TheBloke/falcon-40b-instruct-GPTQ
12 months ago