Zijian Zhou's picture

Zijian Zhou PRO

franciszzj

·

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

liked a model 15 days ago

ostris/Flex.1-alpha

upvoted a paper 22 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

upvoted a paper 22 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

View all activity

Organizations

None yet

franciszzj's activity

upvoted 2 papers 22 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 25 days ago • 129

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published 25 days ago • 94

upvoted a collection about 1 month ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 8 items • Updated 9 days ago • 66

upvoted a paper about 1 month ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 47

upvoted 2 papers about 2 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 42

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

upvoted 2 papers 2 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

upvoted a paper 4 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection 4 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

upvoted a collection 4 months ago

AI Paper of the Day

A collection of papers that I think are interesting, one added each day • 336 items • Updated 4 days ago • 41

upvoted 2 papers 4 months ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published Dec 11, 2024 • 46

Learning Flow Fields in Attention for Controllable Person Image Generation

Paper • 2412.08486 • Published Dec 11, 2024 • 37

upvoted 2 papers 6 months ago

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 98

upvoted a collection 8 months ago

Playground v2

Collection of Playground v2 models • 4 items • Updated Dec 6, 2023 • 7

upvoted 2 papers 9 months ago

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Paper • 2407.11213 • Published Jul 15, 2024 • 3

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Paper • 2407.16224 • Published Jul 23, 2024 • 29