Upload folder using huggingface_hub

Browse files

Files changed (12) hide show

README.md +93 -0
config.json +27 -0
measurement.json +0 -0
output-00001-of-00005.safetensors +3 -0
output-00002-of-00005.safetensors +3 -0
output-00003-of-00005.safetensors +3 -0
output-00004-of-00005.safetensors +3 -0
output-00005-of-00005.safetensors +3 -0
special_tokens_map.json +23 -0
tokenizer.json +0 -0
tokenizer.model +3 -0
tokenizer_config.json +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,93 @@

+---
+license: other
+---
+# Join our Discord! https://discord.gg/Nbv9pQ88Xb
+## Nearly 2500 members strong 💪
+### Now with more channels! A hub for creatives and makers alike!
+---
+[BeaverAI](https://huggingface.co/BeaverAI) proudly presents...
+*The finetune that made people buy another 3090...*
+# Behemoth 123B v2.2 🦣
+> Nothing in the void is foreign to us. The place we go is the place we belong.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/fLdJM1oTjLpEKJsbl1BB7.png)
+## Links
+- Original: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2
+- GGUF: https://huggingface.co/TheDrummer/Behemoth-123B-v2.2-GGUF
+- iMatrix: https://huggingface.co/bartowski/Behemoth-123B-v2.2-GGUF (recommended for smaller quants)
+## Description
+Behemoth v2.x is a finetune of the new Largestral 2411 with system prompt support. Testers have noted that **everything** felt improved.
+### Usage
+Testers say this frankenformat maximizes the model's potential: **Metharme** with Mistral's new system tokens
+- `[SYSTEM_PROMPT] <|system|>{{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
+- `<|system|>[SYSTEM_PROMPT] {{system_message}}[/SYSTEM_PROMPT]<|user|>{{user_message}}<|model|>{{assistant_message}}`
+*Take note that the opening system tag SHOULD ALWAYS have a leading whitespace after it.*
+Complete SillyTavern Settings in BeaverAI Club: https://discord.com/channels/1238219753324281886/1309968730301792370/1309968730301792370
+Mirror: https://rentry.org/cd32disa
+### Versions
+- [v2.0](https://huggingface.co/TheDrummer/Behemoth-123B-v2) is equivalent to Behemoth v1.0 (Classic)
+- [v2.1](https://huggingface.co/TheDrummer/Behemoth-123B-v2.1) is equivalent to Behemoth v1.1 (Creative Boost)
+- [v2.2](https://huggingface.co/TheDrummer/Behemoth-123B-v2.2) is an improvement of Behemoth v2.1 (Creative++)
+## Special Thanks
+Thank you to each and everyone who donated/subscribed in [Ko-Fi](https://ko-fi.com/thedrummer) 🙇 I hope to never disappoint!
+```
+Toasty Pigeon
+theguywhogamesalot
+Grozi
+F
+Marinara
+Ko-fi Supporter
+Grozi
+Phaelon
+ONTHEREDTEAM
+EvarinSharath'fe(USM-Valor)
+Silva
+Dakkidaze
+AlexTheVP
+Pseudo
+Kistara
+Dr. Fjut
+Grozi 🥈
+KinjiHakari777
+dustywintr
+Syd
+HumbleConsumer
+Syd
+Ko-fi Supporter
+Arkamist
+joe 🥇
+Toad
+Lied
+Konnect
+Kistara
+Grozi 🥉
+SleepDeprived3
+Luigi
+Nestor
+```
+https://ko-fi.com/thedrummer/leaderboard
+```
+Finetuned by yours truly,
+Drummer
+```
+Thank you Gargy for the GPUs!
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/KvyYIIA1zkxQNEdGro007.png)

config.json ADDED Viewed

	@@ -0,0 +1,27 @@

+{
+  "_name_or_path": "merged/BEHEMOTH-SLERP",
+  "architectures": [
+    "MistralForCausalLM"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "head_dim": 128,
+  "hidden_act": "silu",
+  "hidden_size": 12288,
+  "initializer_range": 0.02,
+  "intermediate_size": 28672,
+  "max_position_embeddings": 131072,
+  "model_type": "mistral",
+  "num_attention_heads": 96,
+  "num_hidden_layers": 88,
+  "num_key_value_heads": 8,
+  "rms_norm_eps": 1e-05,
+  "rope_theta": 1000000.0,
+  "sliding_window": null,
+  "tie_word_embeddings": false,
+  "torch_dtype": "bfloat16",
+  "transformers_version": "4.46.3",
+  "use_cache": true,
+  "vocab_size": 32768
+}

measurement.json ADDED Viewed

The diff for this file is too large to render. See raw diff

output-00001-of-00005.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:39f4c4a8d15e6840564f4bde627830b3e3d9a87b9e4e87b200d1c2129d686ec4
+size 8515542238

output-00002-of-00005.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:935714664f652262904eac7cb9c9afaf025ba06db80e4dbdc864cf4ac8e14e1d
+size 8516639342

output-00003-of-00005.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1968700acc0efba5c238f3bf741a411b21d9dd36e801cfdf1383665bacf67717
+size 8551799034

output-00004-of-00005.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:16a89e24ff95ace9d594e7946b77e47dd92aa1428507b64dc3e42d5e14483769
+size 8558713330

output-00005-of-00005.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8a93fc85e78fc300f3fdb069034da2ab4ee5ac381b8b19a870444d456fcebcf4
+size 7306067232

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,23 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1b968b8dc352f42192367337c78ccc61e1eaddc6d641a579372d4f20694beb7a
+size 587562

tokenizer_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff