--- license: mit datasets: - DarwinAnim8or/greentext language: - en tags: - fun - greentext widget: - text: ">be me" example_title: "be me" - text: ">be zoo keeper" co2_eq_emissions: emissions: 30 source: "https://mlco2.github.io/impact/#compute" training_type: "fine-tuning" geographical_location: "Oregon, USA" hardware_used: "1x T4, Google Colab" --- # GPT-Greentext-1.5b A finetuned version of [GPT2-XL](https://huggingface.co/gpt2-xl) on the 'greentext' dataset. A demo is available [here](https://huggingface.co/spaces/DarwinAnim8or/GPT-Greentext-Playground) The demo playground is recommended over the inference box on the right. This is the largest release of the "GPT-Greentext" model series. The other models can be found here: * [355m size model](https://huggingface.co/DarwinAnim8or/GPT-Greentext-355m) * [125m size model](https://huggingface.co/DarwinAnim8or/GPT-Greentext-125m) # Training Procedure This was trained on the 'greentext' dataset, on Google Colab. This model was trained for 1 epoch with learning rate 1e-2. Notably this uses the "prompt" and "completion" style jsonl file, rather than the plain text file found in the greentext dataset. This nets somewhat better, mostly more consistent results. # Biases & Limitations This likely contains the same biases and limitations as the original GPT2 that it is based on, and additionally heavy biases from the greentext dataset. It should be noted that offensive or not PG-output is definitely possible and likely will happen. # Intended Use This model is meant for fun, nothing else. # Noteworthy differences between this model and the others This model tends to like no_repeat_ngram_size values of 1 or 2; whereas the other models in this series tend to prefer 3. # Sample Use ```python #Import model: from happytransformer import HappyGeneration happy_gen = HappyGeneration("GPT2", "DarwinAnim8or/GPT-Greentext-1.5b") #Set generation settings: from happytransformer import GENSettings args_top_k = GENSettingsGENSettings(no_repeat_ngram_size=1, do_sample=True, top_k=80, temperature=0.8, max_length=150, early_stopping=False) #Generate a response: result = happy_gen.generate_text(""">be me >""", args=args_top_k) print(result) print(result.text) ```