Update README.md
Browse files
README.md
CHANGED
|
@@ -20,8 +20,9 @@ base_model_relation: finetune
|
|
| 20 |
|
| 21 |
The bfloat16 (bf16) weights for [mispeech/midashenglm-7b-0804-fp32](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32).
|
| 22 |
|
| 23 |
-
|
| 24 |
-
|
|
|
|
| 25 |
|
| 26 |
## Usage
|
| 27 |
|
|
|
|
| 20 |
|
| 21 |
The bfloat16 (bf16) weights for [mispeech/midashenglm-7b-0804-fp32](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32).
|
| 22 |
|
| 23 |
+
Recommended for most general-purpose scenarios, including inference and fine-tuning.
|
| 24 |
+
It delivers quality comparable to FP32 while being significantly faster on modern GPUs (e.g., A100, H100, RTX 4090).
|
| 25 |
+
The [original fp32 model](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32) is only for strict numerical reproduction of benchmark results.
|
| 26 |
|
| 27 |
## Usage
|
| 28 |
|