-
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 55 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 62 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 205
Jakhongir Saydaliev
Jakh0103
·
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
Papers
liked
a Space
about 1 month ago
huggingface/ai-deadlines
updated
a collection
about 2 months ago
Papers