Neural Magic Enterprise

Enterprise

company

https://neuralmagic.com/

neuralmagic

neuralmagic

AI & ML interests

LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV

Recent Activity

nm-research updated a collection 5 days ago

Mistral Quantized

nm-research updated a collection 5 days ago

Mistral Quantized

nm-research updated a collection 5 days ago

Mistral Quantized

View all activity

neuralmagic-ent's activity

nm-research

updated 2 collections 5 days ago

Mistral Quantized

6 items • Updated 5 days ago • 1

phi-4 quantized

5 items • Updated 5 days ago • 1

markurtz

authored 4 papers 4 months ago

How Well Do Sparse Imagenet Models Transfer?

Paper • 2111.13445 • Published Nov 26, 2021 • 1

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

Paper • 2203.07259 • Published Mar 14, 2022 • 4

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49