minyichen commited on
Commit
413551f
1 Parent(s): 135f123

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -10
README.md CHANGED
@@ -13,8 +13,6 @@ tags:
13
  - llama-3
14
  ---
15
 
16
- <img src="https://cdn-uploads.huggingface.co/production/uploads/5df9c78eda6d0311fd3d541f/vlfv5sHbt4hBxb3YwULlU.png" alt="Taiwan LLM Logo" width="600" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
17
-
18
  # Llama-3-Taiwan-70B-Instruct - GPTQ
19
  - Model creator: [Yen-Ting Lin](https://huggingface.co/yentinglin)
20
  - Original model: [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
@@ -24,19 +22,19 @@ tags:
24
 
25
  This repo contains GPTQ model files for [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct).
26
 
27
- Multiple GPTQ parameter permutations are provided; see Provided Files below for details of the options provided, their parameters, and the software used to create them.
28
-
29
  <!-- description end -->
30
  <!-- repositories-available start -->
31
  * [GPTQ models for GPU inference](minyichen/Llama-3-Taiwan-70B-Instruct-GPTQ)
32
  * [Yen-Ting Lin's original unquantized model](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
33
  <!-- repositories-available end -->
34
 
35
- <!-- prompt-template start -->
36
- ## Prompt template: Vicuna
 
 
 
 
 
 
37
 
38
- ```
39
- A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {prompt} ASSISTANT:
40
 
41
- ```
42
- <!-- prompt-template end -->
 
13
  - llama-3
14
  ---
15
 
 
 
16
  # Llama-3-Taiwan-70B-Instruct - GPTQ
17
  - Model creator: [Yen-Ting Lin](https://huggingface.co/yentinglin)
18
  - Original model: [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
 
22
 
23
  This repo contains GPTQ model files for [Llama-3-Taiwan-70B-Instruct](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct).
24
 
 
 
25
  <!-- description end -->
26
  <!-- repositories-available start -->
27
  * [GPTQ models for GPU inference](minyichen/Llama-3-Taiwan-70B-Instruct-GPTQ)
28
  * [Yen-Ting Lin's original unquantized model](https://huggingface.co/yentinglin/Llama-3-Taiwan-70B-Instruct)
29
  <!-- repositories-available end -->
30
 
31
+ ## Quantization parameter
32
+
33
+ - Bits : 4
34
+ - Group Size : 128
35
+ - Act Order : Yes
36
+ - Damp % : 0.01
37
+ - Seq Len : 2048
38
+ - Size : 37.07 GB
39
 
 
 
40