Collections

Discover the best community collections!

Collections including paper arxiv:2305.18290
A little guide to building Large Language Models in 2024
Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757
Papers - Reward Model - Bradley-Terry
https://web.stanford.edu/class/archive/stats/stats200/stats200.1172/Lecture24.pdf
Papers - Fine-tuning - DPO
Refer to additional papers: https://link.springer.com/article/10.1007/s10994-014-5458-8 and https://link.springer.com/article/10.1007/BF00992696
Papers - Fine-tuning
Collection by Apr 25
LLM Refs
Collection by Apr 30