Commit
·
74719a6
1
Parent(s):
d63ebfb
Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ The model was trained with a dataset composed of `prompt`, `completions`, and an
|
|
25 |
|
26 |
## Usage
|
27 |
|
28 |
-
Here's an example of how to use the `
|
29 |
|
30 |
```python
|
31 |
from transformers import AutoTokenizer,AutoConfig, AutoModel
|
@@ -89,4 +89,4 @@ This will output the following:
|
|
89 |
|
90 |
## License
|
91 |
|
92 |
-
The `
|
|
|
25 |
|
26 |
## Usage
|
27 |
|
28 |
+
Here's an example of how to use the `RewardModel` to score the quality of a response to a given prompt:
|
29 |
|
30 |
```python
|
31 |
from transformers import AutoTokenizer,AutoConfig, AutoModel
|
|
|
89 |
|
90 |
## License
|
91 |
|
92 |
+
The `RewardModel` is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
|