Llama-3-Taiwan-70B-Instruct - GPTQ
- Model creator: Yen-Ting Lin
- Original model: Llama-3-Taiwan-70B-Instruct
Description
This repo contains GPTQ model files for Llama-3-Taiwan-70B-Instruct.
Quantization parameter
- Bits : 4
- Group Size : 128
- Act Order : Yes
- Damp % : 0.1
- Seq Len : 2048
- Size : 37.07 GB
It tooks about 6.5 hrs to quantize on H100.
- Downloads last month
- 19
This model does not have enough activity to be deployed to Inference API (serverless) yet.
Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.