Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17 • 27
B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests Paper • 2409.08692 • Published Sep 13 • 25
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper • 2408.15079 • Published Aug 27 • 52
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis Paper • 2407.13301 • Published Jul 18 • 54
DynMoE Family Collection DynMoE model checkpoints and paper on huggingface • 4 items • Updated Aug 19 • 4
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models Paper • 2405.14297 • Published May 23 • 2