Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# Quantization
|
2 |
Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
|
3 |
|
@@ -9,22 +20,15 @@ Steps to create:
|
|
9 |
1. `git clone https://github.com/LambdaLabsML/lambda-quant`
|
10 |
2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
|
11 |
3. `python quantize.py -m Qwen/QwQ-32B -q GPTQ-Int4`
|
|
|
12 |
## Evaluation
|
13 |
TODO
|
|
|
14 |
## Benchmarks
|
15 |
TODO
|
|
|
16 |
# Base Model README.md
|
17 |
-
|
18 |
-
license: apache-2.0
|
19 |
-
license_link: https://huggingface.co/Qwen/QWQ-32B/blob/main/LICENSE
|
20 |
-
language:
|
21 |
-
- en
|
22 |
-
pipeline_tag: text-generation
|
23 |
-
base_model: Qwen/Qwen2.5-32B
|
24 |
-
tags:
|
25 |
-
- chat
|
26 |
-
library_name: transformers
|
27 |
-
---
|
28 |
|
29 |
# QwQ-32B
|
30 |
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
license_link: https://huggingface.co/Qwen/QWQ-32B/blob/main/LICENSE
|
4 |
+
language:
|
5 |
+
- en
|
6 |
+
pipeline_tag: text-generation
|
7 |
+
base_model: Qwen/Qwen2.5-32B
|
8 |
+
tags:
|
9 |
+
- chat
|
10 |
+
library_name: transformers
|
11 |
+
---
|
12 |
# Quantization
|
13 |
Created with [lambda-quant](https://github.com/LambdaLabsML/lambda-quant/tree/f97108fe4a9ee061a7b969b23a9605a6d561863d) on `Python 3.10.12 (main, Nov 6 2024, 20:22:13) [GCC 11.4.0]`
|
14 |
|
|
|
20 |
1. `git clone https://github.com/LambdaLabsML/lambda-quant`
|
21 |
2. `git checkout f97108fe4a9ee061a7b969b23a9605a6d561863d`
|
22 |
3. `python quantize.py -m Qwen/QwQ-32B -q GPTQ-Int4`
|
23 |
+
|
24 |
## Evaluation
|
25 |
TODO
|
26 |
+
|
27 |
## Benchmarks
|
28 |
TODO
|
29 |
+
|
30 |
# Base Model README.md
|
31 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|
33 |
# QwQ-32B
|
34 |
|