-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
Hung Le
neurocoder
AI & ML interests
None yet
Recent Activity
new activity
about 2 months ago
neurocoder/Qwen2.5-0.5B-Instruct-MemoryR:Improve language tag
new activity
about 2 months ago
neurocoder/logsQwen2.5-0.5B-Instruct-math-gsm8k:Improve language tag
upvoted
a
paper
2 months ago
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with
Knowledge Sparkle Dust
Organizations
None yet