doberst commited on
Commit
e791356
·
verified ·
1 Parent(s): 411c97d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -9,8 +9,8 @@ tags: [green, llmware-chat, p3, onnx, qnn, emerald]
9
  # phi-3.5-onnx-qnn
10
 
11
  <!-- Provide a quick summary of what the model is/does. -->
12
-
13
- **phi-3.5-onnx-qnn** is an ONNX QNN int4 quantized version of [Microsoft Phi-3.5-mini-instruct](https://www.huggingface.co/microsoft/Phi-3.5-mini-instruct), providing an NPU inference implementation, optimized for AI PCs on ARM64 NPU.
14
 
15
 
16
  ### Model Description
 
9
  # phi-3.5-onnx-qnn
10
 
11
  <!-- Provide a quick summary of what the model is/does. -->
12
+
13
+ **phi-3.5-onnx-qnn** is an ONNX QNN int4 quantized version of [Microsoft Phi-3.5-mini-instruct](https://www.huggingface.co/microsoft/Phi-3.5-mini-instruct), providing a small fast NPU inference implementation, optimized for NPU deployment on Windows ARM64 AI PCs with Snapdragon Elite X NPU processors.
14
 
15
 
16
  ### Model Description