Upload folder using huggingface_hub

Files changed (6) hide show

README.md ADDED Viewed

+---
+library_name: keras-hub
+---
+This is a [`Mistral` model](https://keras.io/api/keras_hub/models/mistral) uploaded using the KerasHub library and can be used with JAX, TensorFlow, and PyTorch backends.
+Model config:
+* **name:** mistral_backbone_1
+* **trainable:** True
+* **vocabulary_size:** 32000
+* **num_layers:** 32
+* **num_query_heads:** 32
+* **hidden_dim:** 4096
+* **intermediate_dim:** 14336
+* **rope_max_wavelength:** 1000000.0
+* **rope_scaling_factor:** 1.0
+* **num_key_value_heads:** 8
+* **sliding_window:** None
+* **layer_norm_epsilon:** 1e-05
+* **dropout:** 0
+This model card has been generated automatically and should be completed by the model author. See [Model Cards documentation](https://huggingface.co/docs/hub/model-cards) for more information.

assets/tokenizer/vocabulary.spm ADDED Viewed

Binary file (493 kB). View file

config.json ADDED Viewed

+{
+    "module": "keras_nlp.models.mistral.mistral_backbone",
+    "class_name": "MistralBackbone",
+    "config": {
+        "name": "mistral_backbone_1",
+        "trainable": true,
+        "vocabulary_size": 32000,
+        "num_layers": 32,
+        "num_query_heads": 32,
+        "hidden_dim": 4096,
+        "intermediate_dim": 14336,
+        "rope_max_wavelength": 1000000.0,
+        "rope_scaling_factor": 1.0,
+        "num_key_value_heads": 8,
+        "sliding_window": null,
+        "layer_norm_epsilon": 1e-05,
+        "dropout": 0
+    },
+    "registered_name": "keras_nlp>MistralBackbone",
+    "assets": [],
+    "weights": "model.weights.h5"
+}

metadata.json ADDED Viewed

+{
+    "keras_version": "3.0.5",
+    "keras_nlp_version": "0.9.0",
+    "parameter_count": 7241732096,
+    "date_saved": "2024-03-25@19:58:40"
+}

model.weights.h5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:83e29a61bce2736790afa6479d25b1d6c6a0a5c89d9892cdca786ed5bd49f820
+size 14484497176

tokenizer.json ADDED Viewed

+{
+    "module": "keras_nlp.models.mistral.mistral_tokenizer",
+    "class_name": "MistralTokenizer",
+    "config": {
+        "name": "mistral_tokenizer",
+        "trainable": true,
+        "dtype": "int32",
+        "proto": null,
+        "sequence_length": null
+    },
+    "registered_name": "keras_nlp>MistralTokenizer",
+    "assets": [
+        "assets/tokenizer/vocabulary.spm"
+    ],
+    "weights": null
+}