Monet: Mixture of Monosemantic Experts for Transformers Paper • 2412.04139 • Published 29 days ago • 10
Inference-Time Intervention (ITI) Models Collection A collection of Llama models with Inference-Time Intervention (Li et al.) applied to them. Codebase: https://github.com/likenneth/honest_llama • 6 items • Updated Aug 24, 2024 • 3
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 29
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 47