BEE-spoke-data
/

Mixtral-GQA-400m-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Dec 22, 2023

Commit

ab1544b

•

1 Parent(s): d2efb52

Create README.md

Files changed (1) hide show

README.md +44 -0

README.md ADDED Viewed

	@@ -0,0 +1,44 @@

+#  BEE-spoke-data/Mixtral-GQA-400m-v2
+## testing code
+```python
+# !pip install -U -q transformers datasets accelerate sentencepiece
+from huggingface_hub import notebook_login
+notebook_login()
+import warnings
+warnings.filterwarnings("ignore")
+# Use a pipeline as a high-level helper
+from transformers import pipeline
+pipe = pipeline("text-generation", model="BEE-spoke-data/Mixtral-GQA-400m-v2")
+pipe.model.config.pad_token_id = pipe.model.config.eos_token_id
+import pprint as pp
+prompt = "My favorite is Tori Black because"
+res = pipe(
+    prompt,
+    max_new_tokens=256,
+    top_k=4,
+    penalty_alpha=0.6,
+    use_cache=True,
+    no_repeat_ngram_size=4,
+    repetition_penalty=1.1,
+    renormalize_logits=True,
+)
+pp.pprint(res[0])
+```