Bowen's picture

1 2

Bowen

PeterJinGo

·

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo

upvoted a paper 14 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

updated a collection 15 days ago

View all activity

Organizations

Collections 1

Papers 6

arxiv:2503.09516

arxiv:2410.07157

arxiv:2410.05983

arxiv:2404.07103

models 11

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo

Updated 7 days ago • 184

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-it-em-ppo

Updated 16 days ago • 27

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo

Updated 16 days ago • 7

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo

Updated 16 days ago • 77

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo

Updated 16 days ago • 1.11k

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-it-em-grpo

Updated 16 days ago • 4

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo

Updated 16 days ago • 75

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo

Updated 16 days ago • 122

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-em-ppo

Updated 16 days ago • 1.16k

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-em-grpo

Updated 16 days ago • 3

datasets 11

PeterJinGo/nq_hotpotqa_train

Viewer • Updated 15 days ago • 221k • 266

PeterJinGo/wiki-18-e5-index

Updated 30 days ago • 1.76k

PeterJinGo/wiki-18-corpus

Updated 30 days ago • 824

PeterJinGo/ultrafeedback_first_5000

Viewer • Updated Jan 15 • 5k • 8

PeterJinGo/gsm8k-chat

Viewer • Updated Jan 12 • 7.47k • 42

PeterJinGo/math-zeroshot-chat

Viewer • Updated Dec 23, 2024 • 7.5k • 46

PeterJinGo/math-zeroshot

Viewer • Updated Dec 20, 2024 • 7.5k • 43

PeterJinGo/math2

Viewer • Updated Dec 9, 2024 • 7.5k • 37

PeterJinGo/math

Viewer • Updated Dec 6, 2024 • 7.5k • 51

PeterJinGo/gsm8k

Viewer • Updated Dec 2, 2024 • 7.47k • 47