Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper ā¢ 2504.07866 ā¢ Published 3 days ago ā¢ 1
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Paper ā¢ 2405.20216 ā¢ Published May 30, 2024 ā¢ 20 ā¢ 3
MoBA: Mixture of Block Attention for Long-Context LLMs Paper ā¢ 2502.13189 ā¢ Published Feb 18 ā¢ 15 ā¢ 2
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper ā¢ 2411.14405 ā¢ Published Nov 21, 2024 ā¢ 62 ā¢ 4
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper ā¢ 2410.11711 ā¢ Published Oct 15, 2024 ā¢ 9 ā¢ 4
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media Paper ā¢ 2410.12791 ā¢ Published Oct 16, 2024 ā¢ 5 ā¢ 3
Named Clinical Entity Recognition Benchmark Paper ā¢ 2410.05046 ā¢ Published Oct 7, 2024 ā¢ 17 ā¢ 3
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper ā¢ 2410.02749 ā¢ Published Oct 3, 2024 ā¢ 12 ā¢ 3
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper ā¢ 2410.02712 ā¢ Published Oct 3, 2024 ā¢ 36 ā¢ 3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper ā¢ 2409.12568 ā¢ Published Sep 19, 2024 ā¢ 50 ā¢ 4
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper ā¢ 2409.05177 ā¢ Published Sep 8, 2024 ā¢ 7 ā¢ 3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper ā¢ 2409.04269 ā¢ Published Sep 6, 2024 ā¢ 11 ā¢ 3