-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
Hung Le
neurocoder
AI & ML interests
None yet
Recent Activity
new activity
30 days ago
neurocoder/Qwen2.5-0.5B-Instruct-MemoryR:Improve language tag
new activity
30 days ago
neurocoder/logsQwen2.5-0.5B-Instruct-math-gsm8k:Improve language tag
upvoted
a
paper
about 2 months ago
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with
Knowledge Sparkle Dust
Organizations
None yet
Collections
1
models
5

neurocoder/Qwen2.5-0.5B-Instruct-MemoryR
Updated

neurocoder/logsQwen2.5-0.5B-Instruct-math-gsm8k
Text Generation
•
Updated
•
18

neurocoder/Qwen2.5-0.5B-Open-R1-Code-GRPO
Updated

neurocoder/Falcon3-1B-Instruct-sft-math-gsm8k
Updated
•
3

neurocoder/Llama-3.2-1B-Instruct-sft-math-gsm8k
Text Generation
•
Updated
•
21
datasets
0
None public yet