-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
Hung Le
neurocoder
AI & ML interests
None yet
Organizations
None yet
My papers
-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
models
5

neurocoder/Qwen2.5-0.5B-Instruct-MemoryR
Updated

neurocoder/logsQwen2.5-0.5B-Instruct-math-gsm8k
Text Generation
•
0.5B
•
Updated
•
10

neurocoder/Qwen2.5-0.5B-Open-R1-Code-GRPO
Updated

neurocoder/Falcon3-1B-Instruct-sft-math-gsm8k
2B
•
Updated
•
2

neurocoder/Llama-3.2-1B-Instruct-sft-math-gsm8k
Text Generation
•
1B
•
Updated
•
10
datasets
0
None public yet