1 19 19

Vikramjeet Singh PRO

VikramSingh178

https://vikramxd.github.io

V_J_S_1

vikramxD

AI & ML interests

Computer Vision | Transformers| Diffusion Models | ML Systems

Organizations

VikramSingh178's activity

upvoted a paper about 18 hours ago

Revisiting Feature Prediction for Learning Visual Representations from Video

Paper • 2404.08471 • Published Feb 15 • 1

upvoted 2 papers about 19 hours ago

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

Paper • 2405.20204 • Published 2 days ago • 17

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Paper • 2301.08243 • Published Jan 19, 2023 • 6

upvoted 6 papers 4 days ago

Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

Paper • 2405.17414 • Published 5 days ago • 7

EM Distillation for One-step Diffusion Models

Paper • 2405.16852 • Published 6 days ago • 10

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 49

BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

Paper • 2305.14720 • Published May 24, 2023 • 2

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published 5 days ago • 44

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published 5 days ago • 63

upvoted an article 7 days ago

Article

LoRA training scripts of the world, unite!

Jan 2

• 14

upvoted a paper 9 days ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published 13 days ago • 134

upvoted 2 papers 10 days ago

Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 8

How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers

Paper • 2106.10270 • Published Jun 18, 2021 • 2

upvoted a collection 20 days ago

Yi-1.5 (2024/05)

Collection

10 items • Updated 13 days ago • 76

upvoted a paper 22 days ago

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published about 1 month ago • 44

upvoted an article about 1 month ago

Article

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Nov 7, 2022

• 4

upvoted a paper about 1 month ago

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Paper • 2112.10741 • Published Dec 20, 2021 • 3

upvoted a paper 2 months ago

V3D: Video Diffusion Models are Effective 3D Generators

Paper • 2403.06738 • Published Mar 11 • 28

upvoted a paper 3 months ago

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16 • 41