1 53 63

wei

fengwei

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Kimi-VL Technical Report

upvoted a collection 5 days ago

Kimi-VL-A3B

liked a model 8 days ago

jinaai/jina-reranker-m0

View all activity

Organizations

None yet

fengwei's activity

upvoted a paper 5 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 6 days ago • 109

upvoted a collection 5 days ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 4 days ago • 58

upvoted 2 papers 11 days ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 13 days ago • 74

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 16 days ago • 241

upvoted a paper 13 days ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 15 days ago • 78

upvoted a paper 16 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 21 days ago • 134

upvoted an article 20 days ago

Article

Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

Mar 11, 2022

• 11

upvoted 4 papers 20 days ago

upvoted a paper 21 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 23 days ago • 114

upvoted a paper 27 days ago

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 76

upvoted 5 papers about 2 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 142

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 190

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 181

upvoted 2 papers 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 224

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 117