Michael Rawle's picture

7

Michael Rawle

therubberrabbit

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

upvoted a paper 5 days ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

upvoted a paper 5 days ago

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

View all activity

Organizations

None yet

therubberrabbit's activity

upvoted 4 papers 5 days ago

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

Paper • 2502.01572 • Published 7 days ago • 19

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published 7 days ago • 32

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Paper • 2502.02589 • Published 6 days ago • 8

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 6 days ago • 46

upvoted 3 papers 6 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 7 days ago • 53

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 7 days ago • 166

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 10 days ago • 20