Adding `safetensors` variant of this model

by SFconvertbot - opened Apr 12, 2023

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

-0

SFconvertbot

Apr 12, 2023

This is an automated PR created with https://huggingface.co/spaces/safetensors/convert

This new file is equivalent to pytorch_model.bin but safe in the sense that
no arbitrary code can be put into it.

These files also happen to load much faster than their pytorch counterpart:
https://colab.research.google.com/github/huggingface/notebooks/blob/main/safetensors_doc/en/speed.ipynb

The widgets on your model page will run using this model even if this is not merged
making sure the file actually works.

If you find any issues: please report here: https://huggingface.co/spaces/safetensors/convert/discussions

Feel free to ignore this PR.

Adding `safetensors` variant of this modelfcf4dcd6

osanseviero

Oct 22, 2023

Friendly ping to @lysandre and @joaogante . Is it safe to merge this PR?

joaogante

Oct 23, 2023

•

edited Oct 23, 2023

@osanseviero This is our own bot, so it should be! I see no limitations of gpt2 + safetensors from the transformers side

lysandre

OpenAI community org Oct 23, 2023

Verified that the two checkpoints had equal layers of equal values, merging!

lysandre changed pull request status to merged Oct 23, 2023

julien-c

OpenAI community org Oct 23, 2023

YAYYYY

gpt2-xl is now safetensors activated

joaogante

Oct 24, 2023

@lysandre we can automate that check, our PT -> TF script does it! I'm going to open a PR today (mostly copy/paste), so it can free the team some more time ⏳

lysandre

OpenAI community org Oct 24, 2023

The safetensors Space also does it, but for super widely used checkpoints like this I find it important to double check 😀

In particular an inference test is super important, as safetensors tied weights are not managed the same way as pyorch's bin weights.

See #26292 and #26422 which were necessary after merging safetensors weights.

If your script would have prevented these from happening, would love to automate it!

Thanks

joaogante

Oct 24, 2023

As discussed on slack: the bot does double-check the inference code with the right architecture (here), but it's not checking the hidden-states. The hidden-states part will be added :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment