SHI Labs

university

https://www.humphreyshi.com/

humphrey_shi

shi-labs

Activity Feed Request to join this org

AI & ML interests

Computer Vision, AI, Machine Learning

Recent Activity

JamesXu new activity 22 days ago

shi-labs/versatile-diffusion:Issue with missing text_unet/versatile_diffusion.py file in model_index.json

JamesXu updated a Space about 2 months ago

shi-labs/Versatile-Diffusion

praeclarumjj3 updated a Space 3 months ago

shi-labs/VCoder

View all activity

shi-labs's activity

JamesXu

in shi-labs/versatile-diffusion 22 days ago

Issue with missing text_unet/versatile_diffusion.py file in model_index.json

#5 opened 5 months ago by

PromiseZ5Q2SQ

JamesXu

updated a Space about 2 months ago

410

Versatile Diffusion

🚀

praeclarumjj3

updated a Space 3 months ago

VCoder

✌

Humphrey

authored a paper 4 months ago

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11

praeclarumjj3

updated a Space 4 months ago

OLA-VLM

🔍

Generate images and insights from text and images

praeclarumjj3

authored 2 papers 4 months ago

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Paper • 2405.05949 • Published May 9, 2024 • 3

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11

praeclarumjj3

updated a collection 4 months ago

Multimodal AI

Collection

Large multimodal models • 18 items • Updated Dec 11, 2024 • 2

praeclarumjj3

in shi-labs/OLA-VLM 4 months ago

Apply for community grant: Academic project (gpu)

#1 opened 4 months ago by

praeclarumjj3

Humphrey

authored a paper 7 months ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 87

Humphrey

authored a paper about 1 year ago

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Paper • 2403.14773 • Published Mar 21, 2024 • 11

praeclarumjj3

authored a paper about 1 year ago

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Paper • 2312.14233 • Published Dec 21, 2023 • 17

Humphrey

authored 3 papers over 1 year ago

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

Paper • 2312.14091 • Published Dec 21, 2023 • 17

Neighborhood Attention Transformer

Paper • 2204.07143 • Published Apr 14, 2022

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 15

JamesXu

authored a paper over 1 year ago

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 15

JiayiGuo821

authored a paper over 1 year ago

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Paper • 2312.04410 • Published Dec 7, 2023 • 15

Humphrey

authored 3 papers over 1 year ago

AI & ML interests

Recent Activity

Team members 12

shi-labs's activity

Issue with missing text_unet/versatile_diffusion.py file in model_index.json

Versatile Diffusion

VCoder

OLA-VLM

Apply for community grant: Academic project (gpu)