huihui-ai commited on
Commit
da2946f
·
verified ·
1 Parent(s): af2487a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -9,6 +9,8 @@ This Python [script](https://huggingface.co/huihui-ai/grok-2/blob/main/convert_s
9
 
10
  Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
11
 
 
 
12
  1. **Collecting safetensors files**: Locates all `pytorch_model-*.safetensors` files in the specified model directory.
13
  2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
14
  3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.
 
9
 
10
  Just a simple merge, without any inference code, and does not indicate whether the final model is reasonable or correct.
11
 
12
+ Now, do we need a custom MixtralForCausalLM?
13
+
14
  1. **Collecting safetensors files**: Locates all `pytorch_model-*.safetensors` files in the specified model directory.
15
  2. **Loading files into cache**: Loads all safetensors files into memory and builds a key-to-file mapping.
16
  3. **Merging Tensor Parallel (TP) shards**: Merges shards for tensor parallelism (TP=8) along specific dimensions and verifies the merged tensor shapes.