samhog
/

psychology-alpaca-merged

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

samhog commited on Jun 19, 2023

Commit

9785223

•

1 Parent(s): f6d2f88

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -2,4 +2,9 @@
 This is a LLaMA-7B language model trained on 10.000 psychology-related prompts and answers generated by ChatGPT. The model was trained on a single A100 GPU from Google Colab. The model shows some knowledge in the field of psychology and generally performs better than its base model parent.
 ### Background
-This model was developed as part of a thesis project in the field of machine learning and psychology. It was used as a base model for further fine-tuning using reinforcement learning. The goal of the thesis was to compare reinforcement learning from *human feedback* and *AI feedback*. When the paper is available, it will be linked here!

 This is a LLaMA-7B language model trained on 10.000 psychology-related prompts and answers generated by ChatGPT. The model was trained on a single A100 GPU from Google Colab. The model shows some knowledge in the field of psychology and generally performs better than its base model parent.
 ### Background
+This model was developed as part of a thesis project in the field of machine learning and psychology. It was used as a base model for further fine-tuning using reinforcement learning. The goal of the thesis was to compare reinforcement learning from *human feedback* and *AI feedback*. When the paper is available, it will be linked here!
+**Authors:**
+Samuel Höglund, samhog@kth.se;
+Josef Khedri, jkhedri@kth.se