File size: 928 Bytes
e251c35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e0e1228
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
# How to use me?

```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import transformers
import torch

tokenizer = AutoTokenizer.from_pretrained("tiiuae/falcon_tokenizer")


model = AutoModelForCausalLM.from_pretrained(
    "tiiuae/falcon-micro-self-instruct",
    trust_remote_code=True,
    torch_dtype=torch.bfloat16,
    use_auth_token="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",
)

model.bfloat16()
model.cuda()

pipeline = transformers.pipeline("text-generation", model=model, tokenizer=tokenizer, device="cuda:0")
sequences = pipeline(
    "What is your favourite dad joke?",
    max_length=200,
    do_sample=True,
    top_k=10,
    repetition_penalty=1.2,
    num_return_sequences=2,
    eos_token_id=tokenizer.eos_token_id,
)

for seq in sequences:
    print(f"Result: {seq['generated_text']}")

```



There will be a warning that the model is not supported for generation, it can safely be ignore.