Zhongpai Gao

gaozhongpai

Gaozhongpai

AI & ML interests

3D computer vision

Recent Activity

upvoted a paper about 21 hours ago

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

upvoted a paper 2 days ago

7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting

upvoted a paper 2 days ago

Learning Continuous Mesh Representation with Spherical Implicit Surface

View all activity

Organizations

gaozhongpai's activity

upvoted a paper about 21 hours ago

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

Paper • 2503.14492 • Published 2 days ago • 14

upvoted 4 papers 2 days ago

7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting

Paper • 2503.07946 • Published 10 days ago • 1

Learning Continuous Mesh Representation with Spherical Implicit Surface

Paper • 2301.04695 • Published Jan 11, 2023 • 1

DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering

Paper • 2406.02518 • Published Jun 4, 2024 • 1

6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering

Paper • 2410.04974 • Published Oct 7, 2024 • 1

upvoted a paper 3 days ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published 6 days ago • 110

upvoted 2 papers 4 days ago

Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo

Paper • 2503.09799 • Published 8 days ago • 12

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Paper • 2503.10437 • Published 7 days ago • 28

upvoted a paper 11 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 14 days ago • 81

upvoted a paper 18 days ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published 22 days ago • 60

upvoted a paper 25 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 28 days ago • 131

upvoted a paper 30 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 145

upvoted 3 papers about 1 month ago

upvoted 2 papers about 2 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 112

Relightable Full-Body Gaussian Codec Avatars

Paper • 2501.14726 • Published Jan 24 • 10

upvoted a paper 3 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 142

upvoted a paper 5 months ago

Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Paper • 2410.12781 • Published Oct 16, 2024 • 6

upvoted a paper 6 months ago

MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation

Paper • 2410.02458 • Published Oct 3, 2024 • 9