ybelkada
/

gpt-neo-125m-detoxified-long-context

Reinforcement Learning

text-generation

Inference Endpoints

Model card Files Files and versions Community

ybelkada commited on Feb 17, 2023

Commit

337f685

•

1 Parent(s): 6d453f7

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -11,6 +11,10 @@ tags:
 This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
 ## Usage
 To use this model for inference, first install the TRL library:

 This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
+## Training logs
+Training logs can be found [here](https://wandb.ai/distill-bloom/trl/runs/08o87vjz?workspace=user-younesbelkada)
 ## Usage
 To use this model for inference, first install the TRL library: