dongyf's picture

17 5

dongyf

Dongyuff

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

upvoted a paper 24 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a model 24 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

View all activity

Organizations

None yet

Dongyuff's activity

upvoted a paper 1 day ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 2 days ago • 61

upvoted a paper 24 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 27 days ago • 183

liked 5 models 24 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 16 days ago • 1.18M • 538

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

Text Generation • Updated 16 days ago • 1.55M • • 636

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 821k • • 2.13k

tencent/Hunyuan3D-2

Image-to-3D • Updated 12 days ago • 33k • 1.05k

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated Feb 2 • 578k • • 860

upvoted 13 papers 24 days ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 28

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 20

Frontiers in Intelligent Colonoscopy

Paper • 2410.17241 • Published Oct 22, 2024 • 4

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Paper • 2410.14649 • Published Oct 18, 2024 • 9

Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes

Paper • 2410.16930 • Published Oct 22, 2024 • 8

3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors

Paper • 2410.16266 • Published Oct 21, 2024 • 5

LLM-based Optimization of Compound AI Systems: A Survey

Paper • 2410.16392 • Published Oct 21, 2024 • 15

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22, 2024 • 15

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published Oct 22, 2024 • 15

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 23

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs

Paper • 2410.16267 • Published Oct 21, 2024 • 18

Mitigating Object Hallucination via Concentric Causal Attention

Paper • 2410.15926 • Published Oct 21, 2024 • 17

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21, 2024 • 26