AI & ML interests

LLMs, optimization, compression, sparsification, quantization, pruning, distillation, NLP, CV

Recent Activity

dsikka  published a model about 1 month ago
neuralmagic/Llama-3.2-3B-Instruct-quantized.w8a8
jfinks25  updated a Space 3 months ago
neuralmagic/README
View all activity