Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,8 @@ This Python [script](https://huggingface.co/huihui-ai/grok-2/blob/main/convert_s
|
|
9 |
|
10 |
Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
|
11 |
|
|
|
|
|
12 |
1. **Collecting safetensors files**: Locates all `pytorch_model-*.safetensors` files in the specified model directory.
|
13 |
2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
|
14 |
3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
|
|
|
9 |
|
10 |
Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
|
11 |
|
12 |
+
Now, do we need a custom MixtralForCausalLM?
|
13 |
+
|
14 |
1. **Collecting safetensors files**: Locates all `pytorch_model-*.safetensors` files in the specified model directory.
|
15 |
2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
|
16 |
3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
|