luokai's picture

17 61

luokai

mailluokai

·

AI & ML interests

None yet

Organizations

None yet

mailluokai's activity

upvoted 3 papers about 1 month ago

Look Once to Hear: Target Speech Hearing with Noisy Examples

Paper • 2405.06289 • Published May 10 • 3

CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner

Paper • 2405.14979 • Published May 23 • 14

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 240

upvoted an article 2 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24

• 59

upvoted 2 papers 2 months ago

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23 • 32

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17 • 40

upvoted 4 papers 3 months ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 25

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11 • 28

SpatialTracker: Tracking Any 2D Pixels in 3D Space

Paper • 2404.04319 • Published Apr 5 • 22

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8 • 58

upvoted a paper 4 months ago

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 88

upvoted a paper 5 months ago

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Paper • 2401.11708 • Published Jan 22 • 28

upvoted 4 papers 6 months ago

InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes

Paper • 2401.05335 • Published Jan 10 • 26

Personalized Restoration via Dual-Pivot Tuning

Paper • 2312.17234 • Published Dec 28, 2023 • 4

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

Paper • 2312.15715 • Published Dec 25, 2023 • 19

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

Paper • 2312.03641 • Published Dec 6, 2023 • 19

upvoted a paper 7 months ago

DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models

Paper • 2312.05107 • Published Dec 8, 2023 • 35