-
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Paper • 2401.10208 • Published • 1 -
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Paper • 2305.11172 • Published -
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Paper • 2302.00402 • Published -
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities
Paper • 2308.12966 • Published • 6
Collections
Discover the best community collections!
Collections including paper arxiv:2311.04931
-
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 174 -
Fine-tuning Language Models for Factuality
Paper • 2311.08401 • Published • 26 -
LayoutPrompter: Awaken the Design Ability of Large Language Models
Paper • 2311.06495 • Published • 9 -
Prompt Engineering a Prompt Engineer
Paper • 2311.05661 • Published • 19
-
GPT4All: An Ecosystem of Open Source Compressed Language Models
Paper • 2311.04931 • Published • 20 -
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 9 -
Prompt Engineering a Prompt Engineer
Paper • 2311.05661 • Published • 19 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 69
-
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 21 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper • 2310.08740 • Published • 14 -
Personality Traits in Large Language Models
Paper • 2307.00184 • Published • 19 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper • 2310.12962 • Published • 13