new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jan 23

Submitted by

akhaliq

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

·
8 authors

Submitted by

akhaliq

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

·
4 authors

Submitted by

akhaliq

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

·
2 authors

Submitted by

akhaliq

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

·
6 authors

Submitted by

akhaliq

Large-scale Reinforcement Learning for Diffusion Models

·
4 authors

Submitted by

akhaliq

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

·
23 authors

Submitted by

akhaliq

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

·
9 authors

Submitted by

akhaliq

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

·
6 authors

Submitted by

akhaliq

CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

·
17 authors

Submitted by

akhaliq

DITTO: Diffusion Inference-Time T-Optimization for Music Generation

·
4 authors

Submitted by

akhaliq

WARM: On the Benefits of Weight Averaged Reward Models

·
7 authors

Submitted by

akhaliq

EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models

·
4 authors

Submitted by

akhaliq

Make-A-Shape: a Ten-Million-scale 3D Shape Model

·
7 authors

Submitted by

akhaliq

Orion-14B: Open-source Multilingual Large Language Models

·
10 authors

Submitted by

akhaliq

StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

·
7 authors

Submitted by

akhaliq

OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

·
5 authors

Submitted by

akhaliq

UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures

·
4 authors

Submitted by

akhaliq

Single-View 3D Human Digitalization with Large Reconstruction Models

·
7 authors

Submitted by

akhaliq

Scaling Face Interaction Graph Networks to Real World Scenes

·
6 authors

Submitted by

akhaliq

Fast Registration of Photorealistic Avatars for VR Facial Animation

·
5 authors