WMDP Benchmark Collection The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning • 9 items • Updated Apr 23, 2024 • 7
WPO Collection Models and datasets in paper "WPO: Enhancing RLHF with Weighted Preference Optimization". • 11 items • Updated Aug 22, 2024 • 6
SciCap Challenge Collection The Second Scientific Figure Captioning Challenge (SCICAP) in IJCAI 2024 • 2 items • Updated Jul 25, 2024 • 2
The SPRIGHT T2I collection Collection This collection contains the datasets, model, paper, and demo associated with the SPRIGHT (SPatially RIGHT) release. • 5 items • Updated Apr 2, 2024 • 6
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 13 days ago • 34
GWQ: Gradient-Aware Weight Quantization for Large Language Models Paper • 2411.00850 • Published Oct 30, 2024 • 1
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Paper • 2403.01241 • Published Mar 2, 2024 • 1
InverseCoder Collection Models and datasets of paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct". • 7 items • Updated Dec 12, 2024 • 2
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct Paper • 2407.05700 • Published Jul 8, 2024 • 11
PhoneLM Collection A highly capable and efficient small language model family • 7 items • Updated Nov 22, 2024 • 1