Gibran Iqbal PRO
Jibbscript
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 17 hours ago
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
upvoted
a
paper
about 17 hours ago
ReviewScore: Misinformed Peer Review Detection with Large Language
Models
upvoted
a
paper
about 17 hours ago
Language Models Can Learn from Verbal Feedback Without Scalar Rewards