twhoool02
/

Falcon-7B-instruct-NF4

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Model Card for Falcon-7B-instruct-NF4

Model Details

This model is a NF4 quantized version of the tiiuae/falcon-7b-instruct model.

Developed by: Ted Whooley
Library: Transformers, NF4
Model type: falcon
Model name: Falcon-7B-instruct-NF4
Pipeline tag: text-generation
Qunatized by: twhoool02
Language(s) (NLP): en
License: other

Downloads last month: 1

Safetensors

Model size

3.71B params

Tensor type

F32

·

FP16

·

U8

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for twhoool02/Falcon-7B-instruct-NF4

Base model

tiiuae/falcon-7b-instruct

Quantized

(17)

this model