Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2412.14689

Synthetic Data and Self-Improvement

about 7 hours ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 145
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 114
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Paper • 2402.07456 • Published Feb 12, 2024 • 41
Learning From Mistakes Makes LLM Better Reasoner

Paper • 2310.20689 • Published Oct 31, 2023 • 28

december papers

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 14 days ago • 80
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 10 days ago • 39
OpenAI o1 System Card

Paper • 2412.16720 • Published 12 days ago • 28
Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published 11 days ago • 23

Position Papers

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 15 days ago • 47
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 15 days ago • 23
The Open Source Advantage in Large Language Models (LLMs)

Paper • 2412.12004 • Published 17 days ago • 9

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Paper • 2412.12094 • Published 17 days ago • 10

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Paper • 2412.10704 • Published 19 days ago • 15
How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48

Data and other things

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published 14 days ago • 52
How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 9 days ago • 39
WavePulse: Real-time Content Analytics of Radio Livestreams

Paper • 2412.17998 • Published 9 days ago • 9

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48
VidTwin: Video VAE with Decoupled Structure and Dynamics

Paper • 2412.17726 • Published 10 days ago • 8

about 4 hours ago

MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation

Paper • 2412.07147 • Published 23 days ago • 5
Grounding Descriptions in Images informs Zero-Shot Visual Recognition

Paper • 2412.04429 • Published 28 days ago
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models

Paper • 2412.05939 • Published 25 days ago • 13
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published 22 days ago • 52

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 29 days ago • 45
Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 18 days ago • 26
How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 14 days ago • 48

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 19
Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 18
Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs