Metadata Parsing

Given the simplicity of the format, it’s very simple and efficient to fetch and parse metadata about Safetensors weights – i.e. the list of tensors, their types, and their shapes or numbers of parameters – using small (Range) HTTP requests.

This parsing has been implemented in JS in huggingface.js (sample code follows below), but it would be similar in any language.

Example use case

There can be many potential use cases. For instance, we use it on the HuggingFace Hub to display info about models which have safetensors weights:

Usage

http

javascript

python

Example output

For instance, here are the number of params per dtype for a few models on the HuggingFace Hub. Also see this issue for more examples of usage.

model	safetensors	params
gpt2	single-file	{ ‘F32’ => 137022720 }
roberta-base	single-file	{ ‘F32’ => 124697433, ‘I64’ => 514 }
Jean-Baptiste/camembert-ner	single-file	{ ‘F32’ => 110035205, ‘I64’ => 514 }
roberta-large	single-file	{ ‘F32’ => 355412057, ‘I64’ => 514 }
distilbert-base-german-cased	single-file	{ ‘F32’ => 67431550 }
EleutherAI/gpt-neox-20b	sharded	{ ‘F16’ => 20554568208, ‘U8’ => 184549376 }
bigscience/bloom-560m	single-file	{ ‘F16’ => 559214592 }
bigscience/bloom	sharded	{ ‘BF16’ => 176247271424 }
bigscience/bloom-3b	single-file	{ ‘F16’ => 3002557440 }

< > Update on GitHub