Push model using huggingface_hub.

Files changed (3) hide show

README.md ADDED Viewed

+---
+tags:
+- model_hub_mixin
+- pytorch_model_hub_mixin
+---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: [More Information Needed]
+- Docs: [More Information Needed]

config.json ADDED Viewed

+{
+  "activation": "gelu",
+  "bias": false,
+  "d_model": 1536,
+  "dff": null,
+  "dropout_rate": 0.0,
+  "max_block_size": 1024,
+  "n_heads": 24,
+  "n_layers": 24,
+  "norm_first": true,
+  "pos_enc_type": "RoPE",
+  "use_flash_attention": true,
+  "vocab_size": 50304
+}

model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1de34e79ffb576f8c7bfbe5fa658186bcdbd4be22086f0796f40d73659c43a8f
+size 3027605256