sergicalsix's picture

1 259

sergicalsix

sergicalsix

·

AI & ML interests

None yet

Recent Activity

updated a collection about 20 hours ago

2025 LLM Papers on Hugging Face with Japanese Memos

upvoted a paper 3 days ago

AgentRxiv: Towards Collaborative Autonomous Research

upvoted a paper 3 days ago

Modifying Large Language Model Post-Training for Diverse Creative Writing

View all activity

Organizations

None yet

sergicalsix's activity

upvoted 7 papers 3 days ago

AgentRxiv: Towards Collaborative Autonomous Research

Paper • 2503.18102 • Published 8 days ago • 21

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published 10 days ago • 33

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published 11 days ago • 47

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published 4 days ago • 71

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 6 days ago • 105

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 7 days ago • 110

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published 11 days ago • 70

upvoted 4 papers 10 days ago

Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

Paper • 2503.11514 • Published 19 days ago • 15

Inside-Out: Hidden Factual Knowledge in LLMs

Paper • 2503.15299 • Published 12 days ago • 49

Tokenize Image as a Set

Paper • 2503.16425 • Published 11 days ago • 14

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 13 days ago • 131

upvoted an article 13 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

20 days ago

• 360

upvoted 8 papers 13 days ago

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 25 days ago • 43

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published 25 days ago • 52

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published 18 days ago • 75

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 21 days ago • 83

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 26 days ago • 221

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published 24 days ago • 4

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 97

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 82