Xingkai Yu
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -76,7 +76,7 @@ The model structure of DeepSeek-V3.1-Terminus is the same as DeepSeek-V3. Please
|
|
| 76 |
|
| 77 |
For the model's chat template other than search agent, please refer to the [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1) repo.
|
| 78 |
|
| 79 |
-
Here we also provide an updated inference demo code in the `inference` folder to help the community get started with running our model and understand the details of model architecture
|
| 80 |
|
| 81 |
**NOTE: In the current model checkpoint, the parameters of `self_attn.o_proj` do not conform to the UE8M0 FP8 scale data format. This is a known issue and will be corrected in future model releases.**
|
| 82 |
|
|
|
|
| 76 |
|
| 77 |
For the model's chat template other than search agent, please refer to the [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1) repo.
|
| 78 |
|
| 79 |
+
**Here we also provide an updated inference demo code in the `inference` folder to help the community get started with running our model and understand the details of model architecture.**
|
| 80 |
|
| 81 |
**NOTE: In the current model checkpoint, the parameters of `self_attn.o_proj` do not conform to the UE8M0 FP8 scale data format. This is a known issue and will be corrected in future model releases.**
|
| 82 |
|