echoctx commited on 8 days ago

Commit

cdfb602

verified ·

1 Parent(s): 7543e6d

Add files using upload-large-folder tool

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitattributes +1 -0
README.md +76 -0
config.json +174 -0
generation_config.json +12 -0
hf_quant_config.json +14 -0
model-00001-of-00085.safetensors +3 -0
model-00003-of-00085.safetensors +3 -0
model-00006-of-00085.safetensors +3 -0
model-00007-of-00085.safetensors +3 -0
model-00008-of-00085.safetensors +3 -0
model-00009-of-00085.safetensors +3 -0
model-00012-of-00085.safetensors +3 -0
model-00014-of-00085.safetensors +3 -0
model-00016-of-00085.safetensors +3 -0
model-00019-of-00085.safetensors +3 -0
model-00022-of-00085.safetensors +3 -0
model-00024-of-00085.safetensors +3 -0
model-00026-of-00085.safetensors +3 -0
model-00028-of-00085.safetensors +3 -0
model-00029-of-00085.safetensors +3 -0
model-00030-of-00085.safetensors +3 -0
model-00035-of-00085.safetensors +3 -0
model-00036-of-00085.safetensors +3 -0
model-00037-of-00085.safetensors +3 -0
model-00040-of-00085.safetensors +3 -0
model-00042-of-00085.safetensors +3 -0
model-00044-of-00085.safetensors +3 -0
model-00045-of-00085.safetensors +3 -0
model-00051-of-00085.safetensors +3 -0
model-00052-of-00085.safetensors +3 -0
model-00055-of-00085.safetensors +3 -0
model-00057-of-00085.safetensors +3 -0
model-00060-of-00085.safetensors +3 -0
model-00061-of-00085.safetensors +3 -0
model-00063-of-00085.safetensors +3 -0
model-00064-of-00085.safetensors +3 -0
model-00065-of-00085.safetensors +3 -0
model-00067-of-00085.safetensors +3 -0
model-00068-of-00085.safetensors +3 -0
model-00070-of-00085.safetensors +3 -0
model-00074-of-00085.safetensors +3 -0
model-00076-of-00085.safetensors +3 -0
model-00078-of-00085.safetensors +3 -0
model-00079-of-00085.safetensors +3 -0
model-00080-of-00085.safetensors +3 -0
model-00081-of-00085.safetensors +3 -0
model-00082-of-00085.safetensors +3 -0
model-00083-of-00085.safetensors +3 -0
model-00084-of-00085.safetensors +3 -0
tokenizer.json +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,76 @@

+---
+license: mit
+base_model: zai-org/GLM-5.1
+tags:
+- nvidia
+- nvfp4
+- quantized
+- moe
+- modelopt
+- glm
+library_name: transformers
+pipeline_tag: text-generation
+---
+# CortexLM/GLM-5.1-NVFP4-MTP
+NVFP4 quantized version of [zai-org/GLM-5.1](https://huggingface.co/zai-org/GLM-5.1), a 754B parameter Mixture-of-Experts language model with 256 routed experts per layer.
+Quantized using [NVIDIA Model Optimizer (modelopt)](https://github.com/NVIDIA/Model-Optimizer) with full activation calibration on all 58,459 linear modules including every individual routed expert.
+## Model Details
+| | |
+|---|---|
+| **Base model** | [zai-org/GLM-5.1](https://huggingface.co/zai-org/GLM-5.1) |
+| **Architecture** | GlmMoeDsaForCausalLM (754B MoE) |
+| **Layers** | 78 transformer layers + 1 MTP layer |
+| **Experts** | 256 routed + 1 shared per MoE layer (layers 3-77) |
+| **Hidden size** | 6144 |
+| **Context length** | 202,752 tokens |
+| **Quantization** | NVFP4 (4-bit float weights, FP8 block scales, group size 16) |
+| **KV cache** | FP8 quantized |
+| **MTP layer** | BF16 (stored separately in `mtp.safetensors`) |
+| **Total size** | ~441 GB (vs 1.4 TB BF16 original) |
+## Quantization Details
+This model was quantized using NVIDIA's official [Model Optimizer](https://github.com/NVIDIA/Model-Optimizer) (`modelopt`) NVFP4 pipeline with proper per-expert calibration:
+- **Quantization format**: NVFP4 -- 4-bit floating point with FP8 per-block scaling factors (`float8_e4m3fn`) and a global FP32 `weight_scale_2`, block size of 16
+- **Calibration**: 256 samples from [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) and [nvidia/Nemotron-Post-Training-Dataset-v2](https://huggingface.co/datasets/nvidia/Nemotron-Post-Training-Dataset-v2) (chat, code, math, stem splits), sequence length 2048
+- **Quantized modules**: 58,459 `nn.Linear` modules, including all 256 routed experts per layer individually quantized with calibrated `input_scale` (activation statistics)
+- **KV cache**: FP8 cast quantization on all attention layers
+- **Excluded**: `lm_head` (kept in BF16)
+- **MTP**: Multi-Token Prediction layer (layer 78) kept in BF16 as a separate `mtp.safetensors` file (19.9 GB)
+- **Hardware**: 8x NVIDIA B300 SXM6 275GB GPUs
+- **Calibration time**: ~21 minutes
+- **modelopt version**: 0.42.0.dev (from source, April 2026)
+- **transformers version**: 5.5.0
+### Weight format
+Each quantized linear layer is stored as:
+- `weight`: `uint8` (two FP4 values packed per byte)
+- `weight_scale`: `float8_e4m3fn` (per-block FP8 scale, one per 16 elements)
+- `weight_scale_2`: `float32` scalar (global per-tensor scale)
+- `input_scale`: `float32` scalar (calibrated activation scale, where applicable)
+## Usage
+This checkpoint is designed for use with inference engines that support the NVFP4 format, such as [TensorRT-LLM](https://github.com/NVIDIA/TensorRT-LLM) and [vLLM](https://github.com/vllm-project/vllm) with modelopt backend.
+## Files
+- 85 model shards (`model-00001-of-00085.safetensors` to `model-00085-of-00085.safetensors`) -- NVFP4 quantized layers 0-77
+- `mtp.safetensors` -- BF16 Multi-Token Prediction layer (layer 78, 791 keys, 19.9 GB)
+- `model.safetensors.index.json` -- shard index mapping
+- `config.json` -- model configuration with `quantization_config`
+- `hf_quant_config.json` -- NVFP4 quantization metadata
+- `tokenizer.json`, `tokenizer_config.json` -- tokenizer files
+- `generation_config.json` -- generation defaults
+## Acknowledgements
+- Base model by [ZhipuAI](https://huggingface.co/zai-org)
+- Quantization tooling by [NVIDIA Model Optimizer](https://github.com/NVIDIA/Model-Optimizer)

config.json ADDED Viewed

	@@ -0,0 +1,174 @@

+{
+    "architectures": [
+        "GlmMoeDsaForCausalLM"
+    ],
+    "attention_bias": false,
+    "attention_dropout": 0.0,
+    "bos_token_id": 0,
+    "dtype": "bfloat16",
+    "eos_token_id": [
+        154820,
+        154827,
+        154829
+    ],
+    "ep_size": 1,
+    "first_k_dense_replace": 3,
+    "hidden_act": "silu",
+    "hidden_size": 6144,
+    "index_head_dim": 128,
+    "index_n_heads": 32,
+    "index_topk": 2048,
+    "indexer_rope_interleave": true,
+    "initializer_range": 0.02,
+    "intermediate_size": 12288,
+    "kv_lora_rank": 512,
+    "max_position_embeddings": 202752,
+    "mlp_layer_types": [
+        "dense",
+        "dense",
+        "dense",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse",
+        "sparse"
+    ],
+    "model_type": "glm_moe_dsa",
+    "moe_intermediate_size": 2048,
+    "moe_layer_freq": 1,
+    "n_group": 1,
+    "n_routed_experts": 256,
+    "n_shared_experts": 1,
+    "norm_topk_prob": true,
+    "num_attention_heads": 64,
+    "num_experts_per_tok": 8,
+    "num_hidden_layers": 78,
+    "num_key_value_heads": 64,
+    "num_nextn_predict_layers": 1,
+    "pad_token_id": 154820,
+    "pretraining_tp": 1,
+    "q_lora_rank": 2048,
+    "qk_head_dim": 256,
+    "qk_nope_head_dim": 192,
+    "qk_rope_head_dim": 64,
+    "rms_norm_eps": 1e-05,
+    "rope_interleave": true,
+    "rope_parameters": {
+        "rope_theta": 1000000,
+        "rope_type": "default"
+    },
+    "routed_scaling_factor": 2.5,
+    "scoring_func": "sigmoid",
+    "tie_word_embeddings": false,
+    "topk_group": 1,
+    "topk_method": "noaux_tc",
+    "transformers_version": "5.5.0",
+    "use_cache": true,
+    "v_head_dim": 256,
+    "vocab_size": 154880,
+    "quantization_config": {
+        "config_groups": {
+            "group_0": {
+                "input_activations": {
+                    "dynamic": false,
+                    "num_bits": 4,
+                    "type": "float",
+                    "group_size": 16
+                },
+                "weights": {
+                    "dynamic": false,
+                    "num_bits": 4,
+                    "type": "float",
+                    "group_size": 16
+                },
+                "targets": [
+                    "Linear"
+                ]
+            }
+        },
+        "ignore": [
+            "lm_head"
+        ],
+        "quant_algo": "NVFP4",
+        "kv_cache_scheme": {
+            "dynamic": false,
+            "num_bits": 8,
+            "type": "float"
+        },
+        "producer": {
+            "name": "modelopt",
+            "version": "0.0.1.dev1+g5dc17dfd1.d20260407"
+        },
+        "quant_method": "modelopt"
+    }
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,12 @@

+{
+  "_from_model_config": true,
+  "eos_token_id": [
+    154820,
+    154827,
+    154829
+  ],
+  "pad_token_id": 154820,
+  "temperature": 1.0,
+  "top_p": 0.95,
+  "transformers_version": "5.4.0"
+}

hf_quant_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+    "producer": {
+        "name": "modelopt",
+        "version": "0.0.1.dev1+g5dc17dfd1.d20260407"
+    },
+    "quantization": {
+        "quant_algo": "NVFP4",
+        "kv_cache_quant_algo": "FP8",
+        "group_size": 16,
+        "exclude_modules": [
+            "lm_head"
+        ]
+    }
+}

model-00001-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fbb7f509b7f0c5b841d3b7c39d47e0c9cde54d8a990877deba45c1412afb6db4
+size 4999575736

model-00003-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1367c56079a0e47472f213643eb37032d4960298d8d73181ab895b5404a3a7d
+size 4999506536

model-00006-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5156f32255d87f214611a3830f07ce283ae9201e4121560abe2aec9ed89f282e
+size 4997327120

model-00007-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e7f527633e1c4d01e99de355930cb2ddaf031a6ff7514313ea0da6a89170c6ab
+size 4999507056

model-00008-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:79e1f8adf000a74d5b7c64724526fcbcd97a76fe4b073b259416e4e82dc2fefe
+size 4999505068

model-00009-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1999bf42c9b756d0cfc3f30083b46453baced237b67320a2279d48ff4c090d99
+size 4999499836

model-00012-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:58aa0c1ede39f29c9b97cc9dadae4987ae79501d808b1a4d184e9108ef6f7d5b
+size 4999482268

model-00014-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bc17d55f89dec89daa8cf220c5a2fa8fc4b3adf10fb60efe2d22e1f250381f05
+size 4999489336

model-00016-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5a15553828325040c1ee1749c64e1e28fe25567b337a1ef3ca5070ba5a12c434
+size 4997301720

model-00019-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:98379a62ab06349dc904647ad67ce405e1fc4a5d641112976eb739e0e65920c6
+size 4999468960

model-00022-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d9c028c69e425ae4da9a5d810504ece10f37e9b11d8ed3524c11ea49ebc7624b
+size 4999465028

model-00024-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f7bfac22237d6b894c481a5e42a178a42891f6843a9468b247755ac9e1a5bb4b
+size 4999466220

model-00026-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c5640c129d9245d127714bfcb2a1aeb5627762a06107c027578786a8c41e146e
+size 4997286104

model-00028-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f32eba103ec07dad22c8396f72e63f54fc32873d1ec5a1c671c1c6e301a08e1
+size 4999470076

model-00029-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ed9118a7b7594bbd58c7dc0778196f67482e0a3ce1bb0c8ef72491c178dfc95e
+size 4999469424

model-00030-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:40407806babbdd63a0d9bacaa2fac8e904fbdf84dc446b4c5aa610aa95dc7680
+size 4999464644

model-00035-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:286e3628620ad0c5353b25e7eadb104d3594fe826d68402d3e392dc720ad7f08
+size 4958797740

model-00036-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:02b45ef2085482defb217c894de566afa3ad97cf9a8cffa549ef88a6138a142b
+size 4995468148

model-00037-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c78128b323fd9f3bd26f6187908204e934260a36582ff2886e512b4509ab7ee8
+size 4999456216

model-00040-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8753dc763c93624cdbd95629da26aefabf949479c12646fe4e35d5efa7925e4b
+size 4999454044

model-00042-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b504eaf7f0c0a306a67936047a2167a72f7b53cb187889db9973c3527aa32d7b
+size 4999457936

model-00044-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a05336cc25dd8fa1d8b55f2da663642ab4dcf4e36cc25af2a7f1a16b6ad32a43
+size 4999462320

model-00045-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:74cca9c32cf6aa04f91aeb8788a828805908501bae863c71a74e2ba3018b83e6
+size 4999635488

model-00051-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c1d7ed232958e9530e884e43770f2f05a0ce2f50511df86bfa70baf040ceb41f
+size 4999462704

model-00052-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:78bb941fba46577d93bee6d875040085583e9e60b60a506eb4ad1dc9675f9c87
+size 4999461304

model-00055-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:94bc426b001bd74e6e603ae0ae0acb2868ff0181256d6cc3beb10d060ab341ac
+size 4997280008

model-00057-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eeff47e54102e6902c4d2ec24b40c59208c656db59d4c81f0a96f2c6f8217630
+size 4999463932

model-00060-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a597c3beb4a96b763d185ed3361766605ac476c60ed32351fc881a4f448f984
+size 4999461924

model-00061-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:79c136378840d5731f500630be9c456ee5de5b53565e1e5475370e7db8dcc0dd
+size 4999460488

model-00063-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bafa6f7ffa649825bca670cb30792561e94565a72ad7d558f1ea2fbfe44e6608
+size 4999460276

model-00064-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb55d1759959aa48fc9dde7101eaaa251d8b1c478d6b432695ed66607510b05c
+size 4999460784

model-00065-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7f9af318aa69404d967102fa7ec506014e12ece81a913a72795f0f82da43024c
+size 4997280440

model-00067-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2dd8d6bb7e2aa9475ed2fd404a66582c06e328afd8edc42638d5127ebd200f9f
+size 4999462496

model-00068-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e0d618b43e3c9675bc0908efb8fc5f8c1260a7c03b66bed27fd459be850e443
+size 4999465208

model-00070-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:500f99068f1393bebdc675ac2f396f8ef13e78f8a44c163e5f8ae78a34ac23dd
+size 4999459576

model-00074-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c1d8bb4d2c1f4158625e5c9627023fc989669091a22e63a317a9c7c30a27e336
+size 4999463464

model-00076-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8deef5978ca7a367536eb08365f9c56f3c9731101e62546910f8348d27db28c5
+size 4999458732

model-00078-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8876ea722058351980502738475600111e3fab2aa894e6e729317a895c32590b
+size 4999460680

model-00079-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8940a8c70a1ff20a5cb2330bf5b763e98134afc6b1070650b096d9924df0a47c
+size 4999463284

model-00080-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f91f5a61ef7343849c157e0d36e4cb23e4b7e63d6b94e6f40dc7d8143946e0f2
+size 4999461600

model-00081-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:56049d50b2ac490f83d0adacfa3db6e00739f0a321e22ba2c230ecb57cad1b3b
+size 4999458340

model-00082-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:41cd87cfbf73f0745451e91cdd27f3680a71d0356fef14a22067206cc7efb12a
+size 4999458772

model-00083-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bdad5e268e84b774f5340e920029c70f88b88798be890f45f1b16dde6b49a0c0
+size 4999459488

model-00084-of-00085.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:14433c686404beedadc58fdbc21af6eeb5dcf297f028fec04642fd8c58a144d8
+size 4582843272

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19e773648cb4e65de8660ea6365e10acca112d42a854923df93db4a6f333a82d
+size 20217442