βοΈ Sailor Language Models Collection Sailor: Open Language Models tailored for South-East Asia (SEA) released by Sea AI Lab. β’ 17 items β’ Updated Dec 3, 2024 β’ 17
π Scaling Laws with Vocabulary Collection Increase your vocabulary size when you scale up your language model β’ 5 items β’ Updated Aug 11, 2024 β’ 6
𧬠RegMix: Data Mixture as Regression Collection Automatic data mixture method for large language model pre-training ⒠10 items ⒠Updated Jul 26, 2024 ⒠8
π± Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs β’ 34 items β’ Updated Feb 24 β’ 26
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Paper β’ 2410.07137 β’ Published Oct 9, 2024 β’ 7
π‘ DICE Collection Self-alignment with DPO Implicit Rewards β’ 5 items β’ Updated Jul 28, 2024 β’ 9
Bootstrapping Language Models with DPO Implicit Rewards Paper β’ 2406.09760 β’ Published Jun 14, 2024 β’ 40
Weak-to-Strong Jailbreaking on Large Language Models Paper β’ 2401.17256 β’ Published Jan 30, 2024 β’ 16
Better Diffusion Models Further Improve Adversarial Training Paper β’ 2302.04638 β’ Published Feb 9, 2023 β’ 1
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition Paper β’ 2307.13269 β’ Published Jul 25, 2023 β’ 32
Efficient Diffusion Policies for Offline Reinforcement Learning Paper β’ 2305.20081 β’ Published May 31, 2023 β’ 2
Bag of Tricks for Training Data Extraction from Language Models Paper β’ 2302.04460 β’ Published Feb 9, 2023 β’ 2