2 12 4

Tianduo Wang

Tianduo

TianduoWang

AI & ML interests

nlp, representation learning

Recent Activity

liked a dataset 9 days ago

agentica-org/DeepScaleR-Preview-Dataset

upvoted a paper 28 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

upvoted a paper 2 months ago

Fast Video Generation with Sliding Tile Attention

View all activity

Organizations

Tianduo's activity

liked a dataset 9 days ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 3.5k • 107

upvoted a paper 28 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 29 days ago • 117

upvoted a paper 2 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

liked a dataset 6 months ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 805 • 363

upvoted a paper 6 months ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 32

upvoted a paper 7 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 73

upvoted a paper 9 months ago

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18

authored a paper 9 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34

upvoted a paper 9 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34

commented a paper 9 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 34 •

authored 2 papers 9 months ago

Learning Multi-Step Reasoning by Solving Arithmetic Tasks

Paper • 2306.01707 • Published Jun 2, 2023 • 1

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 94

upvoted 2 papers 9 months ago

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 45

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 163

liked a model 10 months ago

fishaudio/fish-speech-1.2

Text-to-Speech • Updated Jul 2, 2024 • 123 • 207

upvoted 2 papers 10 months ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 43

Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24, 2024 • 34

upvoted a paper about 1 year ago

ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17, 2024 • 32

liked a model about 1 year ago

stabilityai/stable-code-3b

Text Generation • Updated Jul 10, 2024 • 6.6k • 643

updated a dataset over 1 year ago

Tianduo/gsm8k-split

Viewer • Updated Dec 28, 2023 • 8.79k • 67 • 1