caojn's picture

24 6

caojn

iiJingnan

AI & ML interests

CV, NLP

Recent Activity

upvoted a paper 3 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

upvoted a paper 26 days ago

PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs

upvoted a paper 26 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

View all activity

Organizations

None yet

iiJingnan's activity

upvoted a paper 3 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 4 days ago • 63

upvoted 19 papers 26 days ago

PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs

Paper • 2502.00963 • Published Feb 3 • 16

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 30 days ago • 184

Revealing the Barriers of Language Agents in Planning

Paper • 2410.12409 • Published Oct 16, 2024 • 27

Exploring Model Kinship for Merging Large Language Models

Paper • 2410.12613 • Published Oct 16, 2024 • 21

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Paper • 2410.12628 • Published Oct 16, 2024 • 35

HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks

Paper • 2410.12381 • Published Oct 16, 2024 • 44

Towards Natural Image Matting in the Wild via Real-Scenario Prior

Paper • 2410.06593 • Published Oct 9, 2024 • 3

Empirical Study of Mutual Reinforcement Effect and Application in Few-shot Text Classification Tasks via Prompt

Paper • 2410.09745 • Published Oct 13, 2024 • 3

GS^3: Efficient Relighting with Triple Gaussian Splatting

Paper • 2410.11419 • Published Oct 15, 2024 • 12

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Paper • 2410.09754 • Published Oct 13, 2024 • 8

Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation

Paper • 2410.08001 • Published Oct 10, 2024 • 4

EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation

Paper • 2410.09704 • Published Oct 13, 2024 • 13

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Paper • 2410.11805 • Published Oct 15, 2024 • 13

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 13

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

Paper • 2410.11710 • Published Oct 15, 2024 • 20

Agent-as-a-Judge: Evaluate Agents with Agents

Paper • 2410.10934 • Published Oct 14, 2024 • 19

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Paper • 2410.11795 • Published Oct 15, 2024 • 18

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 31

MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

Paper • 2410.11779 • Published Oct 15, 2024 • 26