Edit model card

GPT-Greentext-1.5b

A finetuned version of GPT2-XL on the 'greentext' dataset. A demo is available here The demo playground is recommended over the inference box on the right.

This is the largest release of the "GPT-Greentext" model series. The other models can be found here:

Training Procedure

This was trained on the 'greentext' dataset, on Google Colab. This model was trained for 1 epoch with learning rate 1e-2. Notably this uses the "prompt" and "completion" style jsonl file, rather than the plain text file found in the greentext dataset. This nets somewhat better, mostly more consistent results.

Biases & Limitations

This likely contains the same biases and limitations as the original GPT2 that it is based on, and additionally heavy biases from the greentext dataset. It should be noted that offensive or not PG-output is definitely possible and likely will happen.

Intended Use

This model is meant for fun, nothing else.

Noteworthy differences between this model and the others

This model tends to like no_repeat_ngram_size values of 1 or 2; whereas the other models in this series tend to prefer 3.

Sample Use

#Import model:
from happytransformer import HappyGeneration
happy_gen = HappyGeneration("GPT2", "DarwinAnim8or/GPT-Greentext-1.5b")

#Set generation settings:
from happytransformer import GENSettings
args_top_k = GENSettingsGENSettings(no_repeat_ngram_size=1, do_sample=True, top_k=80, temperature=0.8, max_length=150, early_stopping=False)

#Generate a response:
result = happy_gen.generate_text(""">be me
>""", args=args_top_k)

print(result)
print(result.text)
Downloads last month
4
Safetensors
Model size
1.61B params
Tensor type
FP16
·
BOOL
·

Dataset used to train DarwinAnim8or/GPT-Greentext-1.5b

Space using DarwinAnim8or/GPT-Greentext-1.5b 1