GreenBit LLaMA

This is GreenBitAI's pretrained 4-bit LLaMA 3B model with advanced compression design and lossless performance to FP16 models.

Please refer to our Github page for the code to run the model and more information.

Model Description

  • Developed by: GreenBitAI
  • Model type: Causal (Llama)
  • Language(s) (NLP): English
  • License: Apache 2.0
Downloads last month
13
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.