What is Marlin?

#1
by Samvanity - opened

As title. What does the suffix Marlin mean? A special fine tune?

Thanks!

Owner

Hey @Samvanity Marlin is a 4bit weight quantized format for performant inference. You can read the latest summary in this blog here https://neuralmagic.com/blog/pushing-the-boundaries-of-mixed-precision-llm-inference-with-marlin/

thank you for make this great model as marlin

Sign up or log in to comment