Pavlo Molchanov's picture

Pavlo Molchanov PRO

pmolchanov

·

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

authored a paper 3 days ago

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

authored a paper 3 days ago

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

authored a paper 3 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

View all activity

Organizations

pmolchanov's activity

authored 11 papers 3 days ago

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Paper • 2410.01680 • Published Oct 2, 2024 • 36

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Paper • 2410.21271 • Published Oct 28, 2024 • 7

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 45

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

Paper • 2412.07679 • Published Dec 10, 2024

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Paper • 2411.12915 • Published Nov 19, 2024

Entropy-Regularized Process Reward Model

Paper • 2412.11006 • Published Dec 15, 2024

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published 29 days ago • 40

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published 19 days ago • 13

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published 8 days ago • 10

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 6 days ago • 86

liked 2 datasets 5 days ago

nvidia/ClimbMix

Viewer • Updated 1 day ago • 355M • 1.29k • 21

nvidia/ClimbLab

Viewer • Updated 2 days ago • 1.1B • 9.83k • 28

upvoted a paper 5 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 6 days ago • 86

liked 2 models 6 days ago

nvidia/Nemotron-H-8B-Base-8K

Text Generation • Updated 7 days ago • 4.26k • 37

nvidia/Nemotron-H-47B-Base-8K

Text Generation • Updated 1 day ago • 1.12k • 16

upvoted 2 papers 7 days ago

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published 8 days ago • 10

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published 9 days ago • 18

upvoted a paper 9 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published 19 days ago • 13

updated a model 13 days ago

nvidia/C-RADIOv2-g

Image Feature Extraction • Updated 6 days ago • 513 • 11