5 32 91

Pu Fanyi

pufanyi

https://pufanyi.github.io

AI & ML interests

Recent Activity

liked a dataset 14 days ago

allenai/WildChat

upvoted an article about 1 month ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

liked a model about 1 month ago

ds4sd/SmolDocling-256M-preview

View all activity

Organizations

pufanyi's activity

liked a dataset 14 days ago

allenai/WildChat

Viewer • Updated Oct 17, 2024 • 529k • 1.49k • 142

upvoted an article about 1 month ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 172

liked a model about 1 month ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated 30 days ago • 81.5k • 1.25k

upvoted a paper about 1 month ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 97

liked a model about 1 month ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • Updated 14 days ago • 225k • 212

liked a Space about 1 month ago

Tesseract OCR

🐢

Extract text from images

liked a dataset about 1 month ago

open-r1/codeforces-cots

Viewer • Updated 25 days ago • 254k • 10.3k • 148

liked a Space about 1 month ago

Multimodal SAE

💬

Demo for Multimodal-SAE

liked 2 models 2 months ago

ds4sd/docling-models

Updated Mar 4 • 704k • 129

deepseek-ai/DeepSeek-R1

Text Generation • Updated 26 days ago • 1.76M • • 12k

upvoted a paper 2 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

upvoted 2 papers 3 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 120

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

liked 2 datasets 3 months ago

lmms-lab/multimodal-open-r1-8k-verified

Viewer • Updated Jan 27 • 7.69k • 1.34k • 52

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 130k • 453

upvoted a paper 3 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 26

authored a paper 3 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 26

upvoted a paper 3 months ago

Fine-Tuning Language Models with Just Forward Passes

Paper • 2305.17333 • Published May 27, 2023 • 3

liked a model 3 months ago

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 597 • 596

upvoted a paper 3 months ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 9