samusenps

AI & ML interests

Foundational Architectures, Multi-Modality, Interpretability, Benchmarking w/ simulations, Robotics, Integration with Non envasive Open Source stack RISC-V BCI. Extremely high quality training data. Fully Open Source ML/AI.

Recent Activity

upvoted a paper 2 days ago

Structured 3D Latents for Scalable and Versatile 3D Generation

liked a Space 3 days ago

osanseviero/how-much-do-i-cost

liked a model 3 days ago

SparkAudio/Spark-TTS-0.5B

View all activity

Organizations

samusenps's activity

upvoted a paper 2 days ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 64

liked a Space 3 days ago

How Much Do I Cost

📚

liked 3 models 3 days ago

upvoted 6 papers 3 days ago

CoRe^2: Collect, Reflect and Refine to Generate Better and Faster

Paper • 2503.09662 • Published 6 days ago • 29

Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models

Paper • 2503.09669 • Published 6 days ago • 34

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published 5 days ago • 44

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published 5 days ago • 64

Transformers without Normalization

Paper • 2503.10622 • Published 5 days ago • 121

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 349

upvoted 3 papers 6 days ago

Frontier Models are Capable of In-context Scheming

Paper • 2412.04984 • Published Dec 6, 2024 • 2

Multi-Turn Code Generation Through Single-Step Rewards

Paper • 2502.20380 • Published 19 days ago • 30

Simple Guidance Mechanisms for Discrete Diffusion Models

Paper • 2412.10193 • Published Dec 13, 2024 • 1

liked a model 20 days ago

perplexity-ai/r1-1776

Text Generation • Updated 20 days ago • 60.8k • • 2.16k

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 206

liked a model about 2 months ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 245k • 3.23k

upvoted 3 articles about 2 months ago

Article

State of open video generation models in Diffusers

Jan 27

• 50

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 814

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 437