Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,10 @@ tags:
|
|
11 |
This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
|
12 |
guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
|
13 |
|
|
|
|
|
|
|
|
|
14 |
## Usage
|
15 |
|
16 |
To use this model for inference, first install the TRL library:
|
|
|
11 |
This is a [TRL language model](https://github.com/lvwerra/trl) that has been fine-tuned with reinforcement learning to
|
12 |
guide the model outputs according to a value, function, or human feedback. The model can be used for text generation.
|
13 |
|
14 |
+
## Training logs
|
15 |
+
|
16 |
+
The training logs can be found [here](https://wandb.ai/distill-bloom/trl/runs/ogn1tdv3?workspace=user-younesbelkada)
|
17 |
+
|
18 |
## Usage
|
19 |
|
20 |
To use this model for inference, first install the TRL library:
|