16 4 1

Eldar Kurtic

ekurtic

AI & ML interests

Compression of deep neural networks

Recent Activity

updated a model about 2 hours ago

daslab-testing/DeepSeek-R1-Distill-Qwen-14B-HRSTQ-4bit-128g

updated a model 12 days ago

neuralmagic/Mixtral-8x7B-Instruct-v0.1-FP8

updated a model 20 days ago

neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8

View all activity

Organizations

ekurtic's activity

updated a model about 2 hours ago

daslab-testing/DeepSeek-R1-Distill-Qwen-14B-HRSTQ-4bit-128g

Updated about 2 hours ago

updated a model 12 days ago

neuralmagic/Mixtral-8x7B-Instruct-v0.1-FP8

Updated 12 days ago • 2.31k

updated a model 20 days ago

neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8

Updated 20 days ago • 412

published a model about 1 month ago

nm-testing/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Text Generation • Updated Feb 1 • 363 • 3

updated a model about 1 month ago

nm-testing/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic

Text Generation • Updated Feb 1 • 363 • 3

New activity in neuralmagic/Mistral-Small-24B-Instruct-2501-FP8-Dynamic about 2 months ago

Add OpenLLM Leaderboard V1 and V2 evals

#1 opened about 2 months ago by

nm-research

updated a model 2 months ago

nm-testing/DeepSeek-V2.5-1210-quantized.w4a16

Updated Jan 7 • 19

updated a model 3 months ago

nm-testing/test-w4a16-mixtral-actorder-group

Updated Dec 26, 2024 • 100

updated a model 4 months ago

neuralmagic/Sparse-Llama-3.1-8B-2of4

Text Generation • Updated Dec 16, 2024 • 127 • 62

upvoted a paper 4 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49

commented a paper 4 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49 •

authored 4 papers 4 months ago

Error Feedback Can Accurately Compress Preconditioners

Paper • 2306.06098 • Published Jun 9, 2023

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence

Paper • 2405.15593 • Published May 24, 2024 • 1

Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning

Paper • 2407.10994 • Published Jun 24, 2024 • 2

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49

updated a Space 4 months ago

Quant Llms Text Generation

🔥

Quantized vs. Unquantized LLM: Text Generation Comparison

New activity in neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic 4 months ago

Context length

#2 opened 4 months ago by

galigator

authored a paper 5 months ago

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Paper • 2410.14649 • Published Oct 18, 2024 • 9

updated 2 models 5 months ago

neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 7.87k • 5

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 4.95k • 6