Junxiao Song
haha-point
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
authored
a paper
6 months ago
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Organizations
None yet
haha-point's activity
What's the difference between zephyr-7b-beta and zephyr-7b-alpha?
1
#36 opened about 1 year ago
by
haha-point