Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published 23 days ago • 34
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper • 2401.14196 • Published Jan 25 • 44
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 27
A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications Paper • 2310.17750 • Published Oct 26, 2023 • 9
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics Paper • 2310.13268 • Published Oct 20, 2023 • 15
20B Collection Collection Contain all my Frankenstein 20B Llama2 models, I received a lots of good feedback on them. • 8 items • Updated Nov 2, 2023 • 16
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 442
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Paper • 2309.16653 • Published Sep 28, 2023 • 41
Exploiting Diffusion Prior for Real-World Image Super-Resolution Paper • 2305.07015 • Published May 11, 2023 • 3
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset Paper • 2309.11998 • Published Sep 21, 2023 • 22
Agents: An Open-source Framework for Autonomous Language Agents Paper • 2309.07870 • Published Sep 14, 2023 • 39
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models Paper • 2309.06933 • Published Sep 13, 2023 • 11
Doppelgangers: Learning to Disambiguate Images of Similar Structures Paper • 2309.02420 • Published Sep 5, 2023 • 9
AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections Paper • 2309.02186 • Published Sep 5, 2023 • 19
Hierarchical Masked 3D Diffusion Model for Video Outpainting Paper • 2309.02119 • Published Sep 5, 2023 • 10
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models Paper • 2309.00986 • Published Sep 2, 2023 • 17
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Paper • 2307.03601 • Published Jul 7, 2023 • 10
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 72
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Paper • 2307.02421 • Published Jul 5, 2023 • 33
Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning Paper • 2307.02053 • Published Jul 5, 2023 • 23
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 79
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Paper • 2306.17843 • Published Jun 30, 2023 • 41
SVNR: Spatially-variant Noise Removal with Denoising Diffusion Paper • 2306.16052 • Published Jun 28, 2023 • 6
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data Paper • 2306.14153 • Published Jun 25, 2023 • 6
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Paper • 2306.14289 • Published Jun 25, 2023 • 15
GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models Paper • 2306.13649 • Published Jun 23, 2023 • 10
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models Paper • 2306.13651 • Published Jun 23, 2023 • 15
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields Paper • 2306.12760 • Published Jun 22, 2023 • 8
Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration Paper • 2307.05300 • Published Jul 11, 2023 • 17
VampNet: Music Generation via Masked Acoustic Token Modeling Paper • 2307.04686 • Published Jul 10, 2023 • 19
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Paper • 2307.03869 • Published Jul 8, 2023 • 20
Semantic-SAM: Segment and Recognize Anything at Any Granularity Paper • 2307.04767 • Published Jul 10, 2023 • 19
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Paper • 2307.04725 • Published Jul 10, 2023 • 62
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 24
Retentive Network: A Successor to Transformer for Large Language Models Paper • 2307.08621 • Published Jul 17, 2023 • 166
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents Paper • 2308.05960 • Published Aug 11, 2023 • 18
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales Paper • 2308.01320 • Published Aug 2, 2023 • 42
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Paper • 2307.16789 • Published Jul 31, 2023 • 94
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper • 2307.06949 • Published Jul 13, 2023 • 49
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds Paper • 2306.00980 • Published Jun 1, 2023 • 13
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding Paper • 2306.02858 • Published Jun 5, 2023 • 13
PolyVoice: Language Models for Speech to Speech Translation Paper • 2306.02982 • Published Jun 5, 2023 • 3
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias Paper • 2306.03509 • Published Jun 6, 2023 • 3
LLMZip: Lossless Text Compression using Large Language Models Paper • 2306.04050 • Published Jun 6, 2023 • 4
FasterViT: Fast Vision Transformers with Hierarchical Attention Paper • 2306.06189 • Published Jun 9, 2023 • 29
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face Paper • 2306.06638 • Published Jun 11, 2023 • 16