tgsc
/

ult5-pt-small

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Thacio Garcia Scandaroli commited on Apr 13, 2023

Commit

3b27a9c

•

1 Parent(s): 5f45148

Update README.md

Files changed (1) hide show

README.md +45 -1

README.md CHANGED Viewed

@@ -43,7 +43,51 @@ Utilizou-se uma janela de contexto para 1024 tokens e um tokenizador do GPT2 com
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use

 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+Exemplo de geração de texto com top_k de 30
+```python
+from transformers import GPT2TokenizerFast, AutoModelForSeq2SeqLM
+tokenizer = GPT2TokenizerFast.from_pretrained("thacio/ult5-pt-small")
+model = AutoModelForSeq2SeqLM.from_pretrained("thacio/ult5-pt-small")
+text='Um modelo de linguagem é um sistema de inteligência artificial que'
+pred=model.generate(tokenizer.encode(text,return_tensors='pt'),max_new_tokens=30, eos_token_id=tokenizer.eos_token_id, top_k=30, do_sample=True)
+print('input:',text)
+print('generated:',tokenizer.batch_decode(pred, skip_special_tokens=True))
+# input: Um modelo de linguagem é um sistema de inteligência artificial que
+# generated: [' geraria a quantidade de informações por clique. Além das capacidades humanas, elas seriam muito mais produtivas do que as do cérebro humano.\nO que']
+```
+Embeddings:
+```python
+from transformers import T5EncoderModel, GPT2TokenizerFast
+tokenizer = GPT2TokenizerFast.from_pretrained("thacio/ult5-pt-small")
+model = T5EncoderModel.from_pretrained("thacio/ult5-pt-small")
+text = 'Um modelo de linguagem é um sistema de inteligência artificial que aprende a gerar ou processar texto baseado em exemplos de treinamento.'
+input_ids = tokenizer(text, return_tensors="pt").input_ids
+outputs = model(input_ids)
+last_hidden_states = outputs.last_hidden_state
+print(last_hidden_states)
+# tensor([[[-2.4537e-01,  7.9853e-02,  6.6387e-02,  ...,  1.8083e-01,
+#           -4.8941e-02,  5.1888e-03],
+#          [-3.0077e-01, -3.1949e-05, -1.9432e-01,  ..., -2.7167e-01,
+#            3.8779e-02, -1.3541e-01],
+#          [ 8.8356e-05,  3.6444e-03,  2.4887e-04,  ...,  1.3219e-03,
+#            2.2221e-03,  1.1144e-03],
+#          ...,
+#          [-4.5300e-02, -4.6213e-02, -5.2453e-02,  ...,  1.7336e-01,
+#           -2.6955e-02, -7.8869e-02],
+#          [ 8.0028e-03, -9.6458e-02, -2.1417e-01,  ...,  5.1064e-01,
+#           -1.0858e-03, -2.7367e-02],
+#          [ 1.0856e-01,  4.4607e-02, -1.4409e-02,  ...,  6.7812e-02,
+#            5.6911e-02,  1.2650e-01]]], grad_fn=<MulBackward0>)
+```
 ### Direct Use