cpatonn commited on
Commit
2e7b0b5
·
verified ·
1 Parent(s): 1348d1a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -1,3 +1,16 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - nvidia/OpenReasoning-Nemotron-7B
5
+ datasets:
6
+ - mit-han-lab/pile-val-backup
7
+ ---
8
+ # OpenReasoning-Nemotron-7B-AWQ
9
+
10
+ ## Method
11
+ Quantised using [vllm-project/llm-compressor](https://github.com/vllm-project/llm-compressor.git) and the following configs:
12
+ ```
13
+ recipe = [
14
+ AWQModifier(ignore=["lm_head"], scheme="W4A16_ASYM", targets=["Linear"]),
15
+ ]
16
+ ```