GreenBit LLaMA

This is GreenBitAI's pretrained 2-bit LLaMA model with extreme compression yet still strong performance.

Please refer to our Github page for the code to run the model and more information.

Model Description

Downloads last month
13
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.