BitNet 1.58-bit

#2
by Qapdex - opened
export LD_LIBRARY_PATH= LD_LIBRARY_PATH

SLM-ATP-Hybrid-Projector

Microsoft DevBlog

ik_llama
This repository is a fork of llama.cpp with better CPU and hybrid GPU/CPU performance, new SOTA quantization types, // first-class Bitnet // support, better DeepSeek performance via MLA, FlashMLA, fused MoE operations and tensor overrides for hybrid GPU/CPU inference, row-interleaved quant packing, etc.

Sign up or log in to comment