DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning Paper • 2407.04078 • Published 12 days ago • 14
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Paper • 2407.01392 • Published 15 days ago • 39
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems Paper • 2210.08873 • Published Oct 17, 2022 • 1
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models Paper • 2308.01825 • Published Aug 3, 2023 • 21
InstructERC: Reforming Emotion Recognition in Conversation with a Retrieval Multi-task LLMs Framework Paper • 2309.11911 • Published Sep 21, 2023 • 3
Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization Paper • 2310.05506 • Published Oct 9, 2023 • 1
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT Paper • 2310.10176 • Published Oct 16, 2023 • 1
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task Paper • 2310.06504 • Published Oct 10, 2023 • 1
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models Paper • 2310.16517 • Published Oct 25, 2023 • 1
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14 • 1
PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability Paper • 2402.11534 • Published Feb 18 • 1
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published Jun 12 • 15
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning Paper • 2407.00782 • Published 16 days ago • 21
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? Paper • 2407.01284 • Published 15 days ago • 71
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Paper • 2406.13542 • Published 27 days ago • 16
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation Paper • 2406.18676 • Published 20 days ago • 5
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published 29 days ago • 54
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Paper • 2310.05492 • Published Oct 9, 2023 • 2
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models Paper • 2401.06066 • Published Jan 11 • 37