JSpergel
/

test_tiny_mixtral_only_router

Text Generation

Mixture of Experts

openaccess-ai-collective/tiny-mistral

Inference Endpoints

Model card Files Files and versions Community

JSpergel commited on Apr 18

Commit

adb4054

•

1 Parent(s): c414a90

Update README.md

Files changed (1) hide show

README.md +2 -25

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ base_model:
 # test_tiny_mixtral_only_router
-test_tiny_mixtral_only_router is a Mixure of Experts (MoE) made with the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
@@ -45,27 +45,4 @@ experts:
     positive_prompts:
       - "general"
 ```
-## 💻 Usage
-```python
-!pip install -qU transformers bitsandbytes accelerate
-from transformers import AutoTokenizer
-import transformers
-import torch
-model = "JSpergel/test_tiny_mixtral_only_router"
-tokenizer = AutoTokenizer.from_pretrained(model)
-pipeline = transformers.pipeline(
-    "text-generation",
-    model=model,
-    model_kwargs={"torch_dtype": torch.float16, "load_in_4bit": True},
-)
-messages = [{"role": "user", "content": "Explain what a Mixture of Experts is in less than 100 words."}]
-prompt = pipeline.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
-print(outputs[0]["generated_text"])
-```

 # test_tiny_mixtral_only_router
+test_tiny_mixtral_only_router is a Mixure of Experts (MoE) made with the following models using a modified version of mergekit.
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
 * [openaccess-ai-collective/tiny-mistral](https://huggingface.co/openaccess-ai-collective/tiny-mistral)
     positive_prompts:
       - "general"
 ```
+This is a test version of arcee-ai's hidden state model. It is a router for a frankenMoE instead of the entire MoE itself