Samuel Arcadinho's picture

4 31 4

Samuel Arcadinho

SSamDav

·

SSamDav

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

MIEB: Massive Image Embedding Benchmark

upvoted a paper 13 days ago

Kimi-VL Technical Report

upvoted a paper 15 days ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

View all activity

Organizations

SSamDav's activity

upvoted a paper 9 days ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published 10 days ago • 15

upvoted a paper 13 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 14 days ago • 120

upvoted 2 papers 15 days ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 16 days ago • 81

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 16 days ago • 150

upvoted a paper 28 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 29 days ago • 141

upvoted 3 papers about 1 month ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 143

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 160

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 43

upvoted 4 papers about 2 months ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published Mar 3 • 32

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 78

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 100

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18 • 17

upvoted 2 collections 2 months ago

Dria-Agent-a

powerful agentic models built for pythonic function calling • 4 items • Updated Feb 14 • 4

Tiny-Agent-a

fast and powerful agentic models designed to run on edge devices. • 6 items • Updated Feb 12 • 7

upvoted a paper 2 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 140

upvoted 4 papers 3 months ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 12

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 120

upvoted a paper 4 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 148