Add variant of flan-t5-xl, especially model.safetensors

all_weights = {}
for filename in ["pytorch_model-00001-of-00002.bin", "pytorch_model-00002-of-00002.bin"]:
    _model = torch.load(filename, map_location=torch.device('cpu'))
    all_weights = { **all_weights, **_model }

from safetensors import safe_open
from safetensors.torch import save_file

save_file(all_weights, "/tmp/model.safetensors")

If so, how would I create the model.safetensors.index.json file?

lysandre

Google org Nov 28, 2023

@bayang it's a bit different, it's using transformers' from_pretrained and save_pretrained methods to take care of the sharding

lysandre

Google org Nov 28, 2023

The code is accessible, it's here: https://huggingface.co/spaces/safetensors/convert/blob/main/convert.py

bayang

Nov 28, 2023

@lysandre Oh I see, the space has also the code.
thanks.

bayang

Dec 9, 2023

@lysandre How can I merge both safetensors files, because I'm using this quantization technique from Huggingface Candle? and it loads from one file.

$ cargo run --example tensor-tools --release -- quantize --quantization q6k PATH/TO/T5/model.safetensors /tmp/model.gguf

bayang changed discussion status to open Dec 9, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment