Update README.md
Browse files
README.md
CHANGED
|
@@ -1,8 +1,11 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
### huihui-ai/grok-2
|
| 5 |
-
This Python [script](https://huggingface.co/huihui-ai/grok-2/blob/main/convert_safetensors.py) is designed to process and merge sharded weight files
|
|
|
|
| 6 |
|
| 7 |
Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
|
| 8 |
|
|
@@ -35,4 +38,4 @@ Just a simple merge, without any inference code, and does not indicate whether t
|
|
| 35 |
- Ensure the input directory `xai-org/grok-2` contains valid `pytorch_model-*.safetensors` files.
|
| 36 |
- The script assumes a tensor parallelism degree of 8 (`tp_count = 8`). Modify the `tp_count` value in the script if needed.
|
| 37 |
- Memory requirements may be high; run on a machine with sufficient memory.
|
| 38 |
-
- If shards are missing or shapes mismatch, the script will print warnings and attempt to proceed.
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
base_model:
|
| 4 |
+
- xai-org/grok-2
|
| 5 |
---
|
| 6 |
### huihui-ai/grok-2
|
| 7 |
+
This Python [script](https://huggingface.co/huihui-ai/grok-2/blob/main/convert_safetensors.py) is designed to process and merge sharded weight files
|
| 8 |
+
(in safetensors format) for a machine learning model, specifically targeting the `[xai-org/grok-2](https://huggingface.co/xai-org/grok-2)` model. The main functionalities include:
|
| 9 |
|
| 10 |
Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
|
| 11 |
|
|
|
|
| 38 |
- Ensure the input directory `xai-org/grok-2` contains valid `pytorch_model-*.safetensors` files.
|
| 39 |
- The script assumes a tensor parallelism degree of 8 (`tp_count = 8`). Modify the `tp_count` value in the script if needed.
|
| 40 |
- Memory requirements may be high; run on a machine with sufficient memory.
|
| 41 |
+
- If shards are missing or shapes mismatch, the script will print warnings and attempt to proceed.
|