Luka Pluzynski's picture

36 46

Luka Pluzynski

lukasplu

·

AI & ML interests

Computer vision

Recent Activity

upvoted a paper 2 days ago

SkyReels-V2: Infinite-length Film Generative Model

liked a model about 1 month ago

microsoft/Magma-8B

liked a model about 1 month ago

manycore-research/SpatialLM-Llama-1B

View all activity

Organizations

lukasplu's activity

upvoted a paper 2 days ago

SkyReels-V2: Infinite-length Film Generative Model

Paper • 2504.13074 • Published 7 days ago • 7

upvoted a paper 5 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 47

upvoted a collection 5 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 76

upvoted a paper 5 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

upvoted 2 collections 5 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated 7 days ago • 60

upvoted a paper 8 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3, 2024 • 37

upvoted 3 papers 9 months ago

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7, 2024 • 8

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 163

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25, 2024 • 33

upvoted a paper 10 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 93

upvoted a collection 10 months ago

Florence

9 items • Updated 7 days ago • 167

upvoted a paper 10 months ago

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 103

upvoted an article 11 months ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

Feb 3, 2023

• 60

upvoted an article about 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 287

upvoted a collection about 1 year ago

Llama 3

8 items • Updated Apr 18, 2024 • 15

upvoted an article about 1 year ago

Article

Vision Language Models Explained

Apr 11, 2024

• 313

upvoted a paper about 1 year ago

DepthFM: Fast Monocular Depth Estimation with Flow Matching

Paper • 2403.13788 • Published Mar 20, 2024 • 17