ybelkada commited on
Commit
0cfd1fc
1 Parent(s): 4a19e29

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -11,6 +11,10 @@ tags:
11
  This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
12
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
13
 
 
 
 
 
14
  ## Usage
15
 
16
  To use this model for inference, first install the TRL library:
 
11
  This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
12
  guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
13
 
14
+ ## Training logs
15
+
16
+ The training logs can be found [here](https://wandb.ai/distill-bloom/trl/runs/ogn1tdv3?workspace=user-younesbelkada)
17
+
18
  ## Usage
19
 
20
  To use this model for inference, first install the TRL library: