Psychotherapy-LLM
/

PsychoCounsel-Llama3-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

billmianz commited on 12 days ago

Commit

5784c5a

·

verified ·

1 Parent(s): 5fd47d4

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -2,6 +2,10 @@
 license: llama3.1
 library_name: transformers
 pipeline_tag: text-generation
 ---
 This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned Llama 3 model trained using preference learning on the [PsychoCounsel-Preference](https://huggingface.co/datasets/Psychotherapy-LLM/PsychoCounsel-Preference) dataset. This dataset contains 36k high-quality preference comparison pairs aligned with the preferences of professional psychotherapists.
@@ -9,4 +13,4 @@ This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-C
 The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
-This usage is the same as [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B)

 license: llama3.1
 library_name: transformers
 pipeline_tag: text-generation
+datasets:
+- Psychotherapy-LLM/PsychoCounsel-Preference
+base_model:
+- meta-llama/Llama-3.1-8B-Instruct
 ---
 This model is presented in the paper [Preference Learning Unlocks LLMs' Psycho-Counseling Skills](https://hf.co/papers/2502.19731). It's a fine-tuned Llama 3 model trained using preference learning on the [PsychoCounsel-Preference](https://huggingface.co/datasets/Psychotherapy-LLM/PsychoCounsel-Preference) dataset. This dataset contains 36k high-quality preference comparison pairs aligned with the preferences of professional psychotherapists.
 The model aims to improve the quality of responses in psycho-counseling sessions and achieves a win rate of 87% against GPT-4o.
+This usage is the same as [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)