2 33 61

Chao Zhou

ASHIDAKA

AI & ML interests

Object Detection, Transformer

Recent Activity

liked a dataset 5 days ago

open-r1/OpenR1-Math-220k

upvoted an article 5 days ago

Open R1: Update #2

upvoted an article 11 days ago

Open-R1: Update #1

View all activity

Organizations

None yet

ASHIDAKA's activity

upvoted an article 5 days ago

Article

Open R1: Update #2

and 6 others •

6 days ago

• 166

upvoted an article 11 days ago

Article

Open-R1: Update #1

and 7 others •

15 days ago

• 280

upvoted an article 25 days ago

Article

How to train a Language Model with Megatron-LM

Sep 7, 2022

• 8

upvoted a collection 3 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 6 days ago • 69

upvoted 4 papers 4 months ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17, 2024 • 38

upvoted 2 papers 6 months ago

Diffusion Policy Policy Optimization

Paper • 2409.00588 • Published Sep 1, 2024 • 20

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 98

upvoted 2 papers 7 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 113

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Paper • 2407.17365 • Published Jul 24, 2024 • 12

upvoted a collection 7 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 648

upvoted a paper 7 months ago

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Paper • 2407.14057 • Published Jul 19, 2024 • 45

upvoted 2 papers 8 months ago

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11, 2024 • 57

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Paper • 2406.04314 • Published Jun 6, 2024 • 28

upvoted 4 papers 9 months ago

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Paper • 2406.02900 • Published Jun 5, 2024 • 12

I4VGen: Image as Stepping Stone for Text-to-Video Generation

Paper • 2406.02230 • Published Jun 4, 2024 • 17

Aya 23: Open Weight Releases to Further Multilingual Progress

Paper • 2405.15032 • Published May 23, 2024 • 28

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 130