-
Instant3D: Instant Text-to-3D Generation
Paper • 2311.08403 • Published • 44 -
SANeRF-HQ: Segment Anything for NeRF in High Quality
Paper • 2312.01531 • Published • 5 -
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Paper • 2312.02980 • Published • 6 -
ReconFusion: 3D Reconstruction with Diffusion Priors
Paper • 2312.02981 • Published • 8
Collections
Discover the best community collections!
Collections including paper arxiv:2312.03818
-
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Paper • 2311.06243 • Published • 17 -
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Paper • 2311.05908 • Published • 11 -
PolyMaX: General Dense Prediction with Mask Transformer
Paper • 2311.05770 • Published • 6 -
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Paper • 2311.07575 • Published • 10
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 94 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 63 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 39 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 39
-
3D-GPT: Procedural 3D Modeling with Large Language Models
Paper • 2310.12945 • Published • 52 -
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 68 -
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Paper • 2311.07885 • Published • 37 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper • 2311.12775 • Published • 28
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 37 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 69 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 77 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 80
-
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Paper • 2309.06380 • Published • 32 -
ImageBind-LLM: Multi-modality Instruction Tuning
Paper • 2309.03905 • Published • 15 -
DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
Paper • 2309.06933 • Published • 11
-
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Paper • 2308.01390 • Published • 30 -
Med-Flamingo: a Multimodal Medical Few-shot Learner
Paper • 2307.15189 • Published • 21 -
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Paper • 2307.08581 • Published • 26 -
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Paper • 2307.03601 • Published • 10