opt-125m-bnb-4bit / README.md
poedator's picture
Update README.md
b2a17e4

facebook/OPT-125m quantized using bitsandbytes 4-bit NF4 quantization. All license matters are set based on the underlying facebook/OPT-125m model.

This model is work in progress, use it for testing https://github.com/huggingface/transformers/pull/26037 pull request only.

known issue: safetensors deleted linked "quant_map" / "nested_quant_map" items.