Manuel Romero's picture

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

upvoted a paper 3 days ago

TTRL: Test-Time Reinforcement Learning

upvoted an article 4 days ago

Tiny Agents: a MCP-powered agent in 50 lines of code

View all activity

Organizations

mrm8488's activity

upvoted a paper 3 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 7 days ago • 93

upvoted an article 4 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

5 days ago

• 186

upvoted a paper 14 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 130

upvoted a collection 19 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 19 days ago • 77

upvoted an article 23 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 236

upvoted 2 collections about 1 month ago

Scaling Laws 📏

4 items • Updated Oct 15, 2024 • 3

🤖 Agents

21 items • Updated Dec 31, 2024 • 152

upvoted a paper about 1 month ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111

upvoted 3 collections about 2 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 21 items • Updated 14 days ago • 132

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated Mar 11 • 16

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated Mar 19 • 107

upvoted an article 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 159

upvoted 2 papers 2 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Paper • 2502.05167 • Published Feb 7 • 15

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 228

upvoted a collection 3 months ago

WildChat-50m

All model responses associated with the WildChat-50m paper. • 55 items • Updated Jan 29 • 8

upvoted an article 3 months ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 475