luokai's picture

32 142

luokai

iamluokai

·

iamluokai

AI & ML interests

None yet

Recent Activity

liked a Space about 8 hours ago

FunAudioLLM/CosyVoice2-0.5B

liked a Space 4 days ago

franciszzj/Leffa

liked a model 4 days ago

franciszzj/Leffa

View all activity

Organizations

iamluokai's activity

upvoted a collection 12 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 5 days ago • 109

upvoted a collection 19 days ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 15 items • Updated 6 days ago • 52

upvoted a collection 21 days ago

CogVideo

10 items • Updated 21 days ago • 44

upvoted a paper about 1 month ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14 • 57

upvoted a collection about 2 months ago

LongVU

7 items • Updated Oct 31 • 27

upvoted a paper about 2 months ago

Framer: Interactive Frame Interpolation

Paper • 2410.18978 • Published Oct 24 • 36

upvoted a collection 3 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 20 days ago • 288

upvoted 2 papers 3 months ago

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Paper • 2409.12960 • Published Sep 19 • 22

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13 • 48

upvoted a collection 4 months ago

Jamba-1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 82

upvoted a paper 4 months ago

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12 • 13

upvoted a paper 5 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 129

upvoted a collection 5 months ago

H2O Danube3

7 items • Updated 18 days ago • 55

upvoted 3 papers 7 months ago

Look Once to Hear: Target Speech Hearing with Noisy Examples

Paper • 2405.06289 • Published May 10 • 3

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Paper • 2405.14979 • Published May 23 • 15

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 253

upvoted an article 8 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24

• 64

upvoted 3 papers 8 months ago

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23 • 34

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17 • 44

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27