Dmaraj1258
's Collections
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance
Fields using Geometry-Guided Text-to-Image Diffusion Model
Paper
•
2309.03550
•
Published
•
12
Memory Augmented Language Models through Mixture of Word Experts
Paper
•
2311.10768
•
Published
•
18
GAIA: a benchmark for General AI Assistants
Paper
•
2311.12983
•
Published
•
192
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via
Blender-Oriented GPT Planning
Paper
•
2311.12631
•
Published
•
15
Analyzing and Improving the Training Dynamics of Diffusion Models
Paper
•
2312.02696
•
Published
•
33
Magicoder: Source Code Is All You Need
Paper
•
2312.02120
•
Published
•
82
Code Llama: Open Foundation Models for Code
Paper
•
2308.12950
•
Published
•
25
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric
Algorithm-System Co-Design
Paper
•
2401.14112
•
Published
•
20
Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with
Large Vision-Language Model Support
Paper
•
2401.14688
•
Published
•
13
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models
Paper
•
2401.11739
•
Published
•
17
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on
Generalizability, Trustworthiness and Causality through Four Modalities
Paper
•
2401.15071
•
Published
•
37
Transfer Learning for Text Diffusion Models
Paper
•
2401.17181
•
Published
•
16
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with
Prototypical Embedding
Paper
•
2401.15708
•
Published
•
12
Transforming and Combining Rewards for Aligning Large Language Models
Paper
•
2402.00742
•
Published
•
12
Dolma: an Open Corpus of Three Trillion Tokens for Language Model
Pretraining Research
Paper
•
2402.00159
•
Published
•
62
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models
and Adapters with Decoupled Consistency Learning
Paper
•
2402.00769
•
Published
•
22
StepCoder: Improve Code Generation with Reinforcement Learning from
Compiler Feedback
Paper
•
2402.01391
•
Published
•
42
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
•
2402.03300
•
Published
•
105
Training-Free Consistent Text-to-Image Generation
Paper
•
2402.03286
•
Published
•
67
V-IRL: Grounding Virtual Intelligence in Real Life
Paper
•
2402.03310
•
Published
•
16
ChemLLM: A Chemical Large Language Model
Paper
•
2402.06852
•
Published
•
30
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Paper
•
2402.08609
•
Published
•
36
Linear Transformers with Learnable Kernel Functions are Better
In-Context Models
Paper
•
2402.10644
•
Published
•
81
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
•
2402.12226
•
Published
•
43
VideoElevator: Elevating Video Generation Quality with Versatile
Text-to-Image Diffusion Models
Paper
•
2403.05438
•
Published
•
20
Language Models as Compilers: Simulating Pseudocode Execution Improves
Algorithmic Reasoning in Language Models
Paper
•
2404.02575
•
Published
•
50
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
•
2407.09025
•
Published
•
135
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in
Virtual 3D Spaces
Paper
•
2501.12909
•
Published
•
68
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper
•
2502.14499
•
Published
•
177