bartowski/huihui-ai_DeepSeek-R1-Distill-Llama-70B-abliterated-GGUF Text Generation • Updated Feb 1 • 7.5k • 25
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31, 2024 • 64
Running on T4 1.08k 1.08k Open NotebookLM 🎙 Personalised Podcasts For All - Available in 13 Languages
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2, 2024 • 123