gpt2-model-de / README.md
Tanhim's picture
Update README.md
dcacb10
|
raw
history blame
1.19 kB
metadata
language: de
widget:
  - text: Hallo, ich bin ein Sprachmodell
license: gnu

GPT2 Model for German Language

Model Name: Tanhim/gpt2-model-de
language: German or Deutsch
thumbnail: https://huggingface.co/Tanhim/gpt2-model-de
datasets: Ten Thousand German News Articles Dataset

How to use

You can use this model directly with a pipeline for text generation. Since the generation relies on some randomness, I set a seed for reproducibility:

>>> from transformers import pipeline, set_seed
>>> generation= pipeline('text-generation', model='Tanhim/gpt2-model-de')
>>> set_seed(42)
>>> generation("Hallo, ich bin ein Sprachmodell,", max_length=30, num_return_sequences=5)

Here is how to use this model to get the features of a given text in PyTorch:

from transformers import AutoTokenizer, AutoModelWithLMHead 
tokenizer = AutoTokenizer.from_pretrained("Tanhim/gpt2-model-de") 
model = AutoModelWithLMHead.from_pretrained("Tanhim/gpt2-model-de") 
text = "Ersetzen Sie mich durch einen beliebigen Text, den Sie wünschen."
encoded_input = tokenizer(text, return_tensors='pt')
output = model(**encoded_input)