Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2503.17126

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

Paper • 2503.16874 • Published 14 days ago • 43
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

Paper • 2503.21729 • Published 7 days ago • 25
Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Paper • 2503.21620 • Published 7 days ago • 53

Post-Training Papers

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33
WildIFEval: Instruction Following in the Wild

Paper • 2503.06573 • Published 26 days ago • 11
WritingBench: A Comprehensive Benchmark for Generative Writing

Paper • 2503.05244 • Published 28 days ago • 17

Generative Storytelling

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33

human-ai co-creation

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published Nov 11, 2024 • 30
Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 14 days ago • 33

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 28
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 40
Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 53
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

Paper • 2405.18991 • Published May 29, 2024 • 12

Interesting new techniques

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 65
Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 85
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 53
Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 35

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions

Paper • 2312.08578 • Published Dec 14, 2023 • 20
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

Paper • 2312.08583 • Published Dec 14, 2023 • 12
Vision-Language Models as a Source of Rewards

Paper • 2312.09187 • Published Dec 14, 2023 • 14
StemGen: A music generation model that listens

Paper • 2312.08723 • Published Dec 14, 2023 • 49

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs