Atom of Thoughts for Markov LLM Test-Time Scaling Paper ā¢ 2502.12018 ā¢ Published 29 days ago ā¢ 15
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper ā¢ 2501.18511 ā¢ Published Jan 30 ā¢ 19
Enhancing Human-Like Responses in Large Language Models Paper ā¢ 2501.05032 ā¢ Published Jan 9 ā¢ 50
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Paper ā¢ 2402.14740 ā¢ Published Feb 22, 2024 ā¢ 13
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 ā¢ 226
š§ Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration ā¢ 13 items ā¢ Updated about 13 hours ago ā¢ 36
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines Paper ā¢ 2408.01050 ā¢ Published Aug 2, 2024 ā¢ 9
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Paper ā¢ 2408.00690 ā¢ Published Aug 1, 2024 ā¢ 25