Shawn/Yuxuan Tong's picture

Shawn/Yuxuan Tong

tongyx361

·

https://tongyx361.github.io

AI & ML interests

Aiming to build AI systems to better serve every human being, especially for complex intellectual activities. Specifically interested in the following topics: 1) Large Language Model (LLM) 2) AI for (Advanced) Education (e.g. Eureka Labs) / Research (e.g. SciCode-Bench) / Software Engineering (e.g. SWE-Bench) 3) Scalable Alignment（e.g. Scalable Oversight) 4) Hardware-Algorithm Co-Design (e.g. Flash Attention)

Recent Activity

upvoted a collection 18 days ago

published a dataset 20 days ago

tongyx361/math-train-qwq-rs-n32

updated a collection 20 days ago

Demysitifying Long CoT

View all activity

Organizations

tongyx361's activity

New activity in Alibaba-NLP/gte-Qwen2-7B-instruct 6 months ago

Fix eval_mteb.py of undefined variables

#33 opened 6 months ago by

commented 2 papers 8 months ago

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

Paper • 2407.13690 • Published Jun 18, 2024 • 2 •

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

Paper • 2407.13690 • Published Jun 18, 2024 • 2 •

New activity in deepseek-ai/deepseek-math-7b-rl about 1 year ago

Should I use CoT prompting in RL model as instruction-tuned model?

#3 opened about 1 year ago by

New activity in Vivacem/MMIQC about 1 year ago

The number of data sets is inconsistent with the paper

#2 opened about 1 year ago by

New activity in meta-math/MetaMath-Mistral-7B about 1 year ago

What are the training hyperparameters?

#4 opened about 1 year ago by

New activity in peiyi9979/math-shepherd-mistral-7b-prm about 1 year ago

Why does the config show this is a LLaMA model?

#1 opened about 1 year ago by

New activity in yuvalkirstain/pickapic_v1 almost 2 years ago

Is there any valid method to download the dataset without images or part of it like test split only?

#2 opened almost 2 years ago by

Is there any valid method to download the dataset without images or part of it like test split only?

#2 opened almost 2 years ago by