Judge Assisted GRPO Tuning: The Pirates, Knights, and Vikings Experiment By vkerkez • about 18 hours ago
Undress AI: Technical Frameworks and Responsible Implementation in the Age of Generative Models By Emna112 • 4 days ago • 1
Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm By royswastik • 4 days ago • 1
Deploy Multimodal Models from Hugging Face to FriendliAI with Ease By FriendliAI and 2 others • 4 days ago • 15
Gagner au Loto grâce à l’IA : entre fantasme numérique et révolution statistique By Emna112 • 5 days ago • 1
Judge Assisted GRPO Tuning: The Pirates, Knights, and Vikings Experiment By vkerkez • about 18 hours ago
Undress AI: Technical Frameworks and Responsible Implementation in the Age of Generative Models By Emna112 • 4 days ago • 1
Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm By royswastik • 4 days ago • 1
Deploy Multimodal Models from Hugging Face to FriendliAI with Ease By FriendliAI and 2 others • 4 days ago • 15
Gagner au Loto grâce à l’IA : entre fantasme numérique et révolution statistique By Emna112 • 5 days ago • 1