120 109 895

Yasunori Ozaki PRO

alfredplpl

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a Space about 7 hours ago

SakanaAI/Llama-3-Karamaru-v1

liked a model 2 days ago

starvector/starvector-8b-im2svg

liked a Space 2 days ago

starvector/starvector-1b-im2svg

View all activity

Organizations

alfredplpl's activity

upvoted a paper 4 days ago

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Paper • 2503.19462 • Published 7 days ago • 10

upvoted a paper 11 days ago

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

upvoted 2 collections 20 days ago

Gemma 3

Collection

4 items • Updated 20 days ago • 15

Gemma 3 Release

Collection

17 items • Updated 5 days ago • 301

upvoted a paper about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 138

upvoted 7 papers about 2 months ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7 • 24

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 103

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 215

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 64

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 831

upvoted a paper 2 months ago

Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions

Paper • 2501.10020 • Published Jan 17 • 22

upvoted a paper 3 months ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 54

upvoted 2 papers 4 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 146

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 123

upvoted a paper 5 months ago

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published Nov 4, 2024 • 23

upvoted an article 5 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Oct 22, 2024

• 51