Eurdem
/

SM_Smaug_52B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Eurdem commited on May 10

Commit

94f89d4

•

1 Parent(s): bf2eca2

Update README.md

Files changed (1) hide show

README.md +29 -13

README.md CHANGED Viewed

@@ -1,4 +1,7 @@
 ---
 base_model:
 - abacusai/Smaug-34B-v0.1
 library_name: transformers
@@ -14,20 +17,33 @@ license: other
 The following models were included in the merge:
 * [abacusai/Smaug-34B-v0.1](https://huggingface.co/abacusai/Smaug-34B-v0.1)
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-dtype: bfloat16
-merge_method: passthrough
-slices:
-- sources:
-  - layer_range: [0, 45]
-    model: abacusai/Smaug-34B-v0.1
-- sources:
-  - layer_range: [15, 60]
-    model: abacusai/Smaug-34B-v0.1
-```

 ---
+language:
+- en
+pipeline_tag: text-generation
 base_model:
 - abacusai/Smaug-34B-v0.1
 library_name: transformers
 The following models were included in the merge:
 * [abacusai/Smaug-34B-v0.1](https://huggingface.co/abacusai/Smaug-34B-v0.1)
+### Usage
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+import transformers
+import torch
+model_id = "Eurdem/SM_Smaug_52B"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto", load_in_4bit= True)
+messages = [
+    {"role": "system", "content": "You are a helpful chatbot who always responds friendly."},
+    {"role": "user", "content": "where is the capital of turkey"},
+]
+input_ids = tokenizer.apply_chat_template(messages,  add_generation_prompt=True, return_tensors="pt").to("cuda")
+outputs = model.generate(input_ids,
+                          max_new_tokens=1024,
+                          do_sample=True,
+                          temperature=0.7,
+                          top_p=0.7,
+                          top_k=500
+                       )
+response = outputs[0][input_ids.shape[-1]:]
+print(tokenizer.decode(response, skip_special_tokens=True))
+```