Mutual Theory of Mind for Human-AI Communication Paper β’ 2210.03842 β’ Published Oct 7, 2022 β’ 1
Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System Paper β’ 2502.16750 β’ Published Feb 23 β’ 10
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper β’ 2502.18449 β’ Published Feb 25 β’ 73
Beyond Release: Access Considerations for Generative AI Systems Paper β’ 2502.16701 β’ Published Feb 23 β’ 13
Slamming: Training a Speech Language Model on One GPU in a Day Paper β’ 2502.15814 β’ Published Feb 19 β’ 69
GΓΆdel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement Paper β’ 2410.04444 β’ Published Oct 6, 2024 β’ 2