Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ license: apache-2.0
|
|
7 |
Original Base Model: `meta-llama/Meta-Llama-3-8B-Instruct`.<br>
|
8 |
Link: [https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
|
9 |
|
10 |
-
Quantization Configurations
|
11 |
```
|
12 |
"quantization_config": {
|
13 |
"batch_size": 1,
|
@@ -34,3 +34,6 @@ Quantization Configurations:
|
|
34 |
"use_exllama": true
|
35 |
},
|
36 |
```
|
|
|
|
|
|
|
|
7 |
Original Base Model: `meta-llama/Meta-Llama-3-8B-Instruct`.<br>
|
8 |
Link: [https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
|
9 |
|
10 |
+
## Quantization Configurations
|
11 |
```
|
12 |
"quantization_config": {
|
13 |
"batch_size": 1,
|
|
|
34 |
"use_exllama": true
|
35 |
},
|
36 |
```
|
37 |
+
|
38 |
+
## Source Codes
|
39 |
+
Source Codes: [https://github.com/vkola-lab/medpodgpt/tree/main/quantization](https://github.com/vkola-lab/medpodgpt/tree/main/quantization).
|