1 32 358

nDimensional

AI & ML interests

Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks

Recent Activity

updated a model 2 days ago

nDimensional/Anti_Pony

upvoted a paper 2 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

liked a Space 16 days ago

huanngzh/MV-Adapter-T2MV-Anime

View all activity

Organizations

None yet

nDimensional's activity

upvoted a paper 2 days ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published 10 days ago • 58

upvoted 2 papers about 2 months ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 21

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 46

upvoted a paper 3 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 52

upvoted 3 papers 4 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2, 2024 • 94

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 84

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 92

upvoted 2 papers 5 months ago

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 60

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Paper • 2407.16982 • Published Jul 24, 2024 • 41

upvoted a paper 6 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

upvoted a collection 6 months ago

OWL-series 🦉

Collection

Models and applications of OWL-ViT and OWLv2. • 13 items • Updated Mar 11, 2024 • 6

upvoted a collection 8 months ago

LLaVA-1.6

Collection

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31, 2024 • 68

upvoted 4 papers 8 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 119

upvoted 4 papers 9 months ago

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Paper • 2404.09995 • Published Apr 15, 2024 • 6

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 82

Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13, 2024 • 27

Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation

Paper • 2403.16990 • Published Mar 25, 2024 • 25