KW's picture

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 16 hours ago

liked a model 2 days ago

OuteAI/OuteTTS-0.3-1B

liked a model 2 days ago

BAAI/BGE-VL-large

View all activity

Organizations

kevineen's activity

upvoted a collection about 16 hours ago

Shisa V2

A family of bilingual JA/EN LLMs • 24 items • Updated about 15 hours ago • 4

upvoted 2 collections about 1 month ago

Open-Sora 2.0

3 items • Updated Mar 12 • 11

Japanese Novel Reward Model

Japanese Novel Reward Model/日本語小説評価モデル • 5 items • Updated Mar 4 • 2

upvoted an article about 1 month ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 229

upvoted 2 papers about 2 months ago

FlexiViT: One Model for All Patch Sizes

Paper • 2212.08013 • Published Dec 15, 2022 • 1

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 142

upvoted 2 articles about 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 234

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Feb 19

• 69

upvoted 2 papers about 2 months ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 148

upvoted 2 articles about 2 months ago

Article

We now support VLMs in smolagents!

Jan 24

• 99

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 171

upvoted an article 2 months ago

Article

Build awesome datasets for video generation

Feb 12

• 30

upvoted 2 papers 2 months ago

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Paper • 2502.06527 • Published Feb 10 • 11

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Paper • 2502.04299 • Published Feb 6 • 18

upvoted 2 collections 2 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 16 days ago • 443

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 211

upvoted a paper 2 months ago

DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics

Paper • 2407.02274 • Published Jul 2, 2024 • 1

upvoted an article 2 months ago

Article

State of open video generation models in Diffusers

Jan 27

• 50