neuralmagic/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation
•
Updated
•
1.34k
•
1
LLMs compressed using SparseGPT and GPTQ for optimized inference with nm-vllm https://github.com/neuralmagic/nm-vllm