valeriojob commited on
Commit
e0fceed
1 Parent(s): 4ead056

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -18,6 +18,11 @@ tags:
18
  - Version 1 (v.1) of MedGPT is the very first version of MedGPT and the training dataset has been kept simple and small with only 60 examples.
19
  - This repo includes the quantized models in the GGUF format. There is a separate repo called [valeriojob/MedGPT-Llama3.1-8B-BA-v.1](https://huggingface.co/valeriojob/MedGPT-Llama3.1-8B-BA-v.1) that includes the default 16bit format of the model as well as the LoRA adapters of the model.
20
  - This model was quantized using [llama.cpp](https://github.com/ggerganov/llama.cpp).
 
 
 
 
 
21
 
22
  ## Model description
23
 
 
18
  - Version 1 (v.1) of MedGPT is the very first version of MedGPT and the training dataset has been kept simple and small with only 60 examples.
19
  - This repo includes the quantized models in the GGUF format. There is a separate repo called [valeriojob/MedGPT-Llama3.1-8B-BA-v.1](https://huggingface.co/valeriojob/MedGPT-Llama3.1-8B-BA-v.1) that includes the default 16bit format of the model as well as the LoRA adapters of the model.
20
  - This model was quantized using [llama.cpp](https://github.com/ggerganov/llama.cpp).
21
+ - This model is available in the following quantization formats:
22
+ - BF16
23
+ - Q4_K_M
24
+ - Q5_K_M
25
+ - Q8_0
26
 
27
  ## Model description
28