metadata

license: cc-by-nc-sa-4.0
datasets:
  - Dahoas/rm-static
language:
  - en

Model Card for Model ID

Developed by: The Kaitchup
Model type: Causal
Language(s) (NLP): English
License: cc-by-nc-sa-4.0
Finetuned from model: facebook/opt-1.3b

This a model is a chat model fine-tuned with RLHF using DeepSpeed Chat and LoRA. It is based on OPT1.3B.

Model Details

The model has been trained with the procedure described in this article: