Collections
Discover the best community collections!
Collections including paper arxiv:2402.14658
-
m-a-p/OpenCodeInterpreter-DS-1.3B
Text Generation • Updated • 519 • 23 -
m-a-p/OpenCodeInterpreter-DS-6.7B
Text Generation • Updated • 1.75k • 129 -
m-a-p/OpenCodeInterpreter-DS-33B
Text Generation • Updated • 646 • 116 -
m-a-p/OpenCodeInterpreter-CL-7B
Text Generation • Updated • 405 • 10
-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper • 2402.08714 • Published • 10 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 18 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper • 2402.10893 • Published • 10 -
Coercing LLMs to do and reveal (almost) anything
Paper • 2402.14020 • Published • 12
-
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots
Paper • 2405.07990 • Published • 15 -
Large Language Models as Planning Domain Generators
Paper • 2405.06650 • Published • 8 -
AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation
Paper • 2404.12753 • Published • 39 -
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Paper • 2404.07972 • Published • 41
-
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 78 -
m-a-p/OpenCodeInterpreter-DS-6.7B
Text Generation • Updated • 1.75k • 129 -
62🚀
OpenCodeInterpreter Demo
-
tiiuae/falcon-180B
Text Generation • Updated • 6.84k • 1.11k
-
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper • 2402.14658 • Published • 78 -
KAN: Kolmogorov-Arnold Networks
Paper • 2404.19756 • Published • 102 -
Understanding the performance gap between online and offline alignment algorithms
Paper • 2405.08448 • Published • 11 -
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Paper • 2405.17428 • Published • 14
-
Evaluating Very Long-Term Conversational Memory of LLM Agents
Paper • 2402.17753 • Published • 17 -
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
Paper • 2402.16671 • Published • 26 -
Do Large Language Models Latently Perform Multi-Hop Reasoning?
Paper • 2402.16837 • Published • 24 -
Divide-or-Conquer? Which Part Should You Distill Your LLM?
Paper • 2402.15000 • Published • 22