metadata

language:
  - en
license: other
library_name: peft
tags:
  - llama2
  - RLHF
  - alignment
  - ligma
datasets:
  - Anthropic/hh-rlhf
task_categories:
  - text-generation
base_model: NousResearch/Llama-2-13b-hf

Ligma

Ligma Is "Great" for Model Alignment

WARNING: This model is published for scientific purposes only. It may and most likely will produce toxic content.

Trained on the rejected column of Anthropic's hh-rlhf dataset.

Use at your own risk.

Example Outputs:

License: just comply with llama2 license and you should be ok.