1 125 594

Motoki Wu

tokestermw

https://motoki.co

AI & ML interests

None yet

Recent Activity

upvoted an article about 4 hours ago

Open-R1: Update #1

liked a model about 19 hours ago

Satori-reasoning/Satori-7B-Round2

liked a model 1 day ago

HuggingFaceTB/SmolLM2-135M-Instruct

View all activity

Organizations

tokestermw's activity

upvoted an article about 4 hours ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 239

liked a model about 19 hours ago

Satori-reasoning/Satori-7B-Round2

Updated 2 days ago • 62 • 4

liked 2 models 1 day ago

HuggingFaceTB/SmolLM2-135M-Instruct

Text Generation • Updated 1 day ago • 115k • 99

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 1 day ago • 99.9k • 499

upvoted a paper 2 days ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published 4 days ago • 16

upvoted an article 2 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

3 days ago

• 30

upvoted 2 papers 7 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 8 days ago • 79

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 8 days ago • 51

upvoted an article 7 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

8 days ago

• 35

liked a model 7 days ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated 5 days ago • 84.4k • • 643

liked a model 8 days ago

watt-ai/watt-tool-70B

Updated Dec 20, 2024 • 4.74k • 29

upvoted a paper 9 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 10 days ago • 100

liked a model 9 days ago

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated 6 days ago • 71.6k • 336

upvoted a paper 10 days ago

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published 21 days ago • 17

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 658

liked a dataset 10 days ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Dec 2, 2024 • 500 • 182k • 134

liked a model 14 days ago

HuggingFaceTB/SmolVLM-500M-Instruct

Image-Text-to-Text • Updated 6 days ago • 14.7k • 96

upvoted a paper 14 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 16 days ago • 23

liked 2 models 17 days ago

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • Updated 6 days ago • 353k • 451

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 6 days ago • 368k • 355