Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,10 @@
|
|
2 |
license: llama3.1
|
3 |
library_name: transformers
|
4 |
pipeline_tag: text-generation
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned Llama 3 model trained using preference learning on the [PsychoCounsel-Preference](https://huggingface.co/datasets/Psychotherapy-LLM/PsychoCounsel-Preference) dataset. This dataset contains 36k high-quality preference comparison pairs aligned with the preferences of professional psychotherapists.
|
@@ -9,4 +13,4 @@ This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-C
|
|
9 |
The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
|
10 |
|
11 |
|
12 |
-
This usage is the same as [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B)
|
|
|
2 |
license: llama3.1
|
3 |
library_name: transformers
|
4 |
pipeline_tag: text-generation
|
5 |
+
datasets:
|
6 |
+
- Psychotherapy-LLM/PsychoCounsel-Preference
|
7 |
+
base_model:
|
8 |
+
- meta-llama/Llama-3.1-8B-Instruct
|
9 |
---
|
10 |
|
11 |
This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned Llama 3 model trained using preference learning on the [PsychoCounsel-Preference](https://huggingface.co/datasets/Psychotherapy-LLM/PsychoCounsel-Preference) dataset. This dataset contains 36k high-quality preference comparison pairs aligned with the preferences of professional psychotherapists.
|
|
|
13 |
The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
|
14 |
|
15 |
|
16 |
+
This usage is the same as [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
|