Hugo Pitorro's picture

Hugo Pitorro

twigs

·

https://pitorro.de

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

How Effective are State Space Models for Machine Translation?

commented on a paper 13 days ago

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

authored a paper 15 days ago

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

View all activity

Organizations

None yet

twigs's activity

upvoted a paper 15 days ago

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

Paper • 2502.15612 • Published 18 days ago • 4

upvoted 2 papers about 1 month ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 186

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 21

upvoted a paper 6 months ago

SimPO: Simple Preference Optimization with a Reference-Free Reward

Paper • 2405.14734 • Published May 23, 2024 • 11

upvoted 4 papers about 1 year ago

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 55

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 20

CroissantLLM: A Truly Bilingual French-English Language Model

Paper • 2402.00786 • Published Feb 1, 2024 • 26

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 55

upvoted 2 papers over 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118