pi5 eval added
Browse files
README.md
CHANGED
@@ -42,12 +42,18 @@ This is a fine-tuned version of **DeepSeek R1 Distill Qwen 1.5B**, optimized for
|
|
42 |
- **File Format:** GGUF
|
43 |
- **Released Version:** Q4_K_M.gguf
|
44 |
|
45 |
-
|
46 |
-
|
47 |
-
|
48 |
-
| **
|
49 |
-
| **
|
50 |
-
| **
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
## Usage Instructions
|
53 |
### System Prompt
|
|
|
42 |
- **File Format:** GGUF
|
43 |
- **Released Version:** Q4_K_M.gguf
|
44 |
|
45 |
+
| Metric | **3090 Ti** | **Raspberry Pi 5** |
|
46 |
+
|-----------------|-------------------------------|-------------------------------|
|
47 |
+
| **Prompt Eval Time** | 33.78 ms / 406 tokens (0.08 ms per token, 12017.88 tokens/sec) | 17831.25 ms / 535 tokens (33.33 ms per token, 30.00 tokens/sec) |
|
48 |
+
| **Eval Time** | 7133.93 ms / 1694 tokens (4.21 ms per token, 237.46 tokens/sec) | 52006.54 ms / 529 tokens (98.31 ms per token, 10.17 tokens/sec) |
|
49 |
+
| **Total Time** | 7167.72 ms / 2100 tokens | 70881.95 ms / 1064 tokens |
|
50 |
+
| **Decoding Speed** | N/A | 529 tokens in 70.40s (7.51 tokens/sec) |
|
51 |
+
| **Sampling Speed** | N/A | 149.33 ms / 530 runs (0.28 ms per token, 3549.26 tokens/sec) |
|
52 |
+
|
53 |
+
### **Observations:**
|
54 |
+
- The **3090 Ti** is significantly faster, handling **12017.88 tokens/sec** for prompt evaluation, compared to **30 tokens/sec** on the **Pi 5**.
|
55 |
+
- In token evaluation, the **3090 Ti** manages **237.46 tokens/sec**, whereas the **Pi 5** achieves just **10.17 tokens/sec**.
|
56 |
+
- The **Pi 5**'s total execution time (70.88s) is close to the **3090 Ti**, but it processes far fewer tokens in that time.
|
57 |
|
58 |
## Usage Instructions
|
59 |
### System Prompt
|