GPT-NoSleep-355m

A finetuned version of GPT2-Medium on the 'reddit-nosleep-posts' dataset. (Linked above)

TIP You can find a larger, more capable version of the model here: GPT-NoSleep-1.5b

Training Procedure

This was trained on the 'reddt-nosleep-posts' dataset, using the "HappyTransformers" library on Google Colab. This model was trained for X epochs with learning rate 1e-2.

Biases & Limitations

This likely contains the same biases and limitations as the original GPT2 that it is based on, and additionally heavy biases from the dataset. It likely will generate offensive output.

Intended Use

This model is meant for fun, nothing else.

Sample code

#Import model:
from happytransformer import HappyGeneration
happy_gen = HappyGeneration("GPT2", "DarwinAnim8or/GPT-NoSleep-355m")

#Set generation settings:
from happytransformer import GENSettings
args_top_k = GENSettingsGENSettings(no_repeat_ngram_size=3, do_sample=True, top_k=80, temperature=0.8, max_length=150, early_stopping=False)

#Generate a response:
result = happy_gen.generate_text("[WP] We don't go to the forest at night [RESPONSE] ", args=args_top_k)

print(result)
print(result.text)
Downloads last month
16
Safetensors
Model size
380M params
Tensor type
F32
·
U8
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train DarwinAnim8or/GPT-NoSleep-355m