arxiv:2410.08102
Xinlin Zhuang
Mihara-bot
·
AI & ML interests
NLP, Text classification, Text generation
Recent Activity
liked
a dataset
7 days ago
tasksource/PRM800K
liked
a model
8 days ago
TinyLlama/TinyLlama-1.1B-intermediate-step-240k-503b
liked
a dataset
9 days ago
peiyi9979/Math-Shepherd
Organizations
None yet
Papers
1
models
12
Mihara-bot/BART-BASE-CHINESE-V2
Text2Text Generation
•
Updated
•
8
Mihara-bot/ppo-SnowballTargetTESTCOLAB
Reinforcement Learning
•
Updated
•
33
Mihara-bot/ppo-PyramidsTESTCOLAB
Reinforcement Learning
•
Updated
•
15
Mihara-bot/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mihara-bot/Reinforce-FCNN
Reinforcement Learning
•
Updated
Mihara-bot/tuned-ppo-LunarLander-v2
Updated
Mihara-bot/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
2
Mihara-bot/taxi-v3
Reinforcement Learning
•
Updated
Mihara-bot/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Mihara-bot/PPO-LunarLander-v2
Reinforcement Learning
•
Updated
•
3
datasets
None public yet