dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen2.5-Math-1.5B

liked a model 1 day ago

google/gemma-3-4b-it

liked a Space 1 day ago

huggingface-projects/gemma-3-12b-it

View all activity

Organizations

None yet

huba-buba's activity

upvoted an article 1 day ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

3 days ago

• 242

upvoted an article 3 days ago

Article

Open R1: Update #3

and 9 others •

3 days ago

• 214

upvoted a paper 3 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 4 days ago • 63

upvoted an article 3 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

11 days ago

• 65

upvoted 2 papers 4 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 4 days ago • 53

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 9 days ago • 208

upvoted a collection 9 days ago

QwQ

Collection

Qwen with Questions • 6 items • Updated 8 days ago • 82

upvoted a paper 12 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 22 days ago • 45

upvoted 2 articles 13 days ago

Article

SigLIP 2: A better multilingual vision language encoder

22 days ago

• 134

Article

SmolVLM2: Bringing Video Understanding to Every Device

23 days ago

• 205

upvoted a paper 17 days ago

WebGames: Challenging General-Purpose Web-Browsing AI Agents

Paper • 2502.18356 • Published 17 days ago • 11

upvoted a paper 20 days ago

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published 22 days ago • 11

upvoted an article 21 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 198

upvoted a paper 22 days ago

Thinking Preference Optimization

Paper • 2502.13173 • Published 25 days ago • 16

upvoted an article 26 days ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

• 25

upvoted 3 papers 29 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 29 days ago • 33

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published about 1 month ago • 50

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published Feb 11 • 29

upvoted an article 30 days ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 204

upvoted a paper 30 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 47