jimbozhang commited on
Commit
1262f3f
·
verified ·
1 Parent(s): f78687c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -20,8 +20,9 @@ base_model_relation: finetune
20
 
21
  The bfloat16 (bf16) weights for [mispeech/midashenglm-7b-0804-fp32](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32).
22
 
23
- For production deployments, use this bf16 version for optimal efficiency. The original fp32 model is for strict numerical reproduction of benchmark results.
24
-
 
25
 
26
  ## Usage
27
 
 
20
 
21
  The bfloat16 (bf16) weights for [mispeech/midashenglm-7b-0804-fp32](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32).
22
 
23
+ Recommended for most general-purpose scenarios, including inference and fine-tuning.
24
+ It delivers quality comparable to FP32 while being significantly faster on modern GPUs (e.g., A100, H100, RTX 4090).
25
+ The [original fp32 model](https://huggingface.co/mispeech/midashenglm-7b-0804-fp32) is only for strict numerical reproduction of benchmark results.
26
 
27
  ## Usage
28