arxiv:2501.00911
Sanjiban Choudhury
sc2582
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
rl-llm-agent/Llama-3.2-3B-Instruct-online-dpo-alfworld-iter1
authored
a paper
3 days ago
The Virtues of Laziness in Model-based RL: A Unified Objective and
Algorithms
authored
a paper
3 days ago
Inverse Reinforcement Learning without Reinforcement Learning
Organizations
Papers
11
spaces
1
models
None public yet
datasets
None public yet