GreenBit LLaMA

This is GreenBitAI's pretrained 4-bit LLaMA 3B model with advanced compression design and lossless performance to FP16 models.

Please refer to our Github page for the code to run the model and more information.

Model Description

Developed by: GreenBitAI
Model type: Causal (Llama)
Language(s) (NLP): English
License: Apache 2.0

Downloads last month: 13

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.