Biziel's picture

48 25

Biziel

Grzegorz

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

HuggingFaceTB/SmolVLM-Instruct

liked a model 6 days ago

all-hands/openhands-lm-32b-v0.1

upvoted a paper 8 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

View all activity

Organizations

Grzegorz's activity

liked a model 3 days ago

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • Updated about 13 hours ago • 70.7k • 421

liked a model 6 days ago

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated 5 days ago • 5.23k • 308

upvoted 2 papers 8 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published 9 days ago • 87

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published 12 days ago • 76

upvoted 3 papers 28 days ago

PE3R: Perception-Efficient 3D Reconstruction

Paper • 2503.07507 • Published 29 days ago • 10

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published Mar 7 • 22

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 76

liked a model about 1 month ago

fal/AuraSR

Updated Jul 15, 2024 • 512 • 305

upvoted a paper about 2 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 140

upvoted 2 papers 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 374

DiffuEraser: A Diffusion Model for Video Inpainting

Paper • 2501.10018 • Published Jan 17 • 14

upvoted 2 papers 3 months ago

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

Paper • 2412.18605 • Published Dec 24, 2024 • 20

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published Jan 6 • 27

upvoted 7 papers 4 months ago

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Paper • 2412.15214 • Published Dec 19, 2024 • 15

AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities

Paper • 2412.14123 • Published Dec 18, 2024 • 11

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 152

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 63

Efficient Track Anything

Paper • 2411.18933 • Published Nov 28, 2024 • 17

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 39