DavidAU commited on
Commit
72a8b13
·
verified ·
1 Parent(s): f542bd6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -4
README.md CHANGED
@@ -39,11 +39,34 @@ and
39
 
40
  In one MASSSIVE 52B merge.
41
 
42
- In two configs...
43
 
44
- Quants Q4KS (two) uploading...
 
 
45
 
46
- Source / full quanting to follow.
47
 
48
- Model is fully operational at Q2k.
49
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
39
 
40
  In one MASSSIVE 52B merge.
41
 
42
+ In two configs... too.
43
 
44
+ CONFIGS:
45
+ - #1 -> Qwen2.5-Coder-32B-Instruct primary/start, with OlympicCoder-32B as "finalizer".
46
+ - #1 -> OlympicCoder-32B as primary/start, with Qwen2.5-Coder-32B-Instruct as "finalizer".
47
 
48
+ Quants Q4_K_S (two - one of each config) uploading...
49
 
50
+ Each config/version will be very different from each other.
51
 
52
+ Source(s) / full quanting to follow.
53
+
54
+ NOTE: Model is fully operational at Q2k.
55
+
56
+ ---
57
+
58
+ Config / Settings
59
+
60
+ ---
61
+
62
+ Model is set at 32k/32768 context for these GGUFS, full quants/full repos will be 128k/131072.
63
+
64
+ Requirements:
65
+ - Temp .5 to .7 (or lower)
66
+ - topk: 20, topp: .8, minp: .04
67
+ - rep pen: 1.1 (can be lower)
68
+ - Jinja Template (embedded) or CHATML template.
69
+
70
+ Refer to either "Qwen2.5-Coder-32B-Instruct" and/or "OlympicCoder-32B" for additional settings, benchmarks and usage.
71
+
72
+ ---