view article Article Finetuning Falcon 7b in a hybrid distributed fashion By Neo111x • 3 days ago • 3
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Paper • 2402.14740 • Published Feb 22, 2024 • 12
view article Article Building a MusicGen API to Generate Custom Music Tracks Locally By theeseus-ai • 29 days ago • 2
view article Article Optimizing Deep Learning Training Techniques By lingvanex-mt • about 1 month ago • 2
view article Article Improving performance with Arena Learning in post training By satpalsr • Sep 11, 2024 • 5
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 124
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 • 55
view article Article Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks By rcaulk • Aug 19, 2024 • 7
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published Jul 19, 2024 • 39
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published Jun 18, 2024 • 31
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 167
view article Article Introducing the Hugging Face LLM Inference Container for Amazon SageMaker May 31, 2023 • 2