lbourdois commited on
Commit
c168bd7
·
verified ·
1 Parent(s): be6da1e

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +500 -486
README.md CHANGED
@@ -1,487 +1,501 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-72B-Instruct
4
- library_name: transformers
5
- tags:
6
- - mergekit
7
- - merge
8
- license: other
9
- ---
10
-
11
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/654527ce2a13610acc25d921/LlM5gC_gmgUDCCO4MY8wx.png)
12
-
13
- # merge
14
-
15
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
-
17
- This model recieved no post merge retraining (yet) and minimal testing. Please contribute any feedback or evaluations of any kind via the community tab.
18
-
19
- # License
20
-
21
- Hippocratic License 3.0 + Ecocide module, + Extractive Industries module, + Copyleft
22
- [![Hippocratic License HL3-CL-ECO-EXTR](https://img.shields.io/static/v1?label=Hippocratic%20License&message=HL3-CL-ECO-EXTR&labelColor=5e2751&color=bc8c3d)](https://firstdonoharm.dev/version/3/0/cl-eco-extr.html)
23
- https://firstdonoharm.dev/version/3/0/cl-eco-extr.txt
24
-
25
- ## Merge Details
26
- ### Merge Method
27
-
28
- This model was merged using the passthrough merge method. Every layer is doubled in order, from Qwen/Qwen2.5-72B-Instruct, with the MLP layers + 3 output layers only copied once, creating 132B parameters. No additional fine-tune has been done in this merged model.
29
-
30
- ### Models Merged
31
-
32
- The following models were included in the merge:
33
- * [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
34
-
35
- ### Configuration
36
-
37
- The following YAML configuration was used to produce this model:
38
-
39
- ```yaml
40
- slices:
41
- - sources:
42
- - model: Qwen/Qwen2.5-72B-Instruct
43
- layer_range: [0, 4]
44
- - sources:
45
- - model: Qwen/Qwen2.5-72B-Instruct
46
- layer_range: [4, 5]
47
- - sources:
48
- - model: Qwen/Qwen2.5-72B-Instruct
49
- layer_range: [4, 5]
50
- - sources:
51
- - model: Qwen/Qwen2.5-72B-Instruct
52
- layer_range: [5, 6]
53
- - sources:
54
- - model: Qwen/Qwen2.5-72B-Instruct
55
- layer_range: [5, 6]
56
- - sources:
57
- - model: Qwen/Qwen2.5-72B-Instruct
58
- layer_range: [6, 7]
59
- - sources:
60
- - model: Qwen/Qwen2.5-72B-Instruct
61
- layer_range: [6, 7]
62
- - sources:
63
- - model: Qwen/Qwen2.5-72B-Instruct
64
- layer_range: [7, 8]
65
- - sources:
66
- - model: Qwen/Qwen2.5-72B-Instruct
67
- layer_range: [7, 8]
68
- - sources:
69
- - model: Qwen/Qwen2.5-72B-Instruct
70
- layer_range: [8, 9]
71
- - sources:
72
- - model: Qwen/Qwen2.5-72B-Instruct
73
- layer_range: [8, 9]
74
- - sources:
75
- - model: Qwen/Qwen2.5-72B-Instruct
76
- layer_range: [9, 10]
77
- - sources:
78
- - model: Qwen/Qwen2.5-72B-Instruct
79
- layer_range: [9, 10]
80
- - sources:
81
- - model: Qwen/Qwen2.5-72B-Instruct
82
- layer_range: [10, 11]
83
- - sources:
84
- - model: Qwen/Qwen2.5-72B-Instruct
85
- layer_range: [10, 11]
86
- - sources:
87
- - model: Qwen/Qwen2.5-72B-Instruct
88
- layer_range: [11, 12]
89
- - sources:
90
- - model: Qwen/Qwen2.5-72B-Instruct
91
- layer_range: [11, 12]
92
- - sources:
93
- - model: Qwen/Qwen2.5-72B-Instruct
94
- layer_range: [12, 13]
95
- - sources:
96
- - model: Qwen/Qwen2.5-72B-Instruct
97
- layer_range: [12, 13]
98
- - sources:
99
- - model: Qwen/Qwen2.5-72B-Instruct
100
- layer_range: [13, 14]
101
- - sources:
102
- - model: Qwen/Qwen2.5-72B-Instruct
103
- layer_range: [13, 14]
104
- - sources:
105
- - model: Qwen/Qwen2.5-72B-Instruct
106
- layer_range: [14, 15]
107
- - sources:
108
- - model: Qwen/Qwen2.5-72B-Instruct
109
- layer_range: [14, 15]
110
- - sources:
111
- - model: Qwen/Qwen2.5-72B-Instruct
112
- layer_range: [15, 16]
113
- - sources:
114
- - model: Qwen/Qwen2.5-72B-Instruct
115
- layer_range: [15, 16]
116
- - sources:
117
- - model: Qwen/Qwen2.5-72B-Instruct
118
- layer_range: [16, 17]
119
- - sources:
120
- - model: Qwen/Qwen2.5-72B-Instruct
121
- layer_range: [16, 17]
122
- - sources:
123
- - model: Qwen/Qwen2.5-72B-Instruct
124
- layer_range: [17, 18]
125
- - sources:
126
- - model: Qwen/Qwen2.5-72B-Instruct
127
- layer_range: [17, 18]
128
- - sources:
129
- - model: Qwen/Qwen2.5-72B-Instruct
130
- layer_range: [18, 19]
131
- - sources:
132
- - model: Qwen/Qwen2.5-72B-Instruct
133
- layer_range: [18, 19]
134
- - sources:
135
- - model: Qwen/Qwen2.5-72B-Instruct
136
- layer_range: [19, 20]
137
- - sources:
138
- - model: Qwen/Qwen2.5-72B-Instruct
139
- layer_range: [19, 20]
140
- - sources:
141
- - model: Qwen/Qwen2.5-72B-Instruct
142
- layer_range: [20, 21]
143
- - sources:
144
- - model: Qwen/Qwen2.5-72B-Instruct
145
- layer_range: [20, 21]
146
- - sources:
147
- - model: Qwen/Qwen2.5-72B-Instruct
148
- layer_range: [21, 22]
149
- - sources:
150
- - model: Qwen/Qwen2.5-72B-Instruct
151
- layer_range: [21, 22]
152
- - sources:
153
- - model: Qwen/Qwen2.5-72B-Instruct
154
- layer_range: [22, 23]
155
- - sources:
156
- - model: Qwen/Qwen2.5-72B-Instruct
157
- layer_range: [22, 23]
158
- - sources:
159
- - model: Qwen/Qwen2.5-72B-Instruct
160
- layer_range: [23, 24]
161
- - sources:
162
- - model: Qwen/Qwen2.5-72B-Instruct
163
- layer_range: [23, 24]
164
- - sources:
165
- - model: Qwen/Qwen2.5-72B-Instruct
166
- layer_range: [24, 25]
167
- - sources:
168
- - model: Qwen/Qwen2.5-72B-Instruct
169
- layer_range: [24, 25]
170
- - sources:
171
- - model: Qwen/Qwen2.5-72B-Instruct
172
- layer_range: [25, 26]
173
- - sources:
174
- - model: Qwen/Qwen2.5-72B-Instruct
175
- layer_range: [25, 26]
176
- - sources:
177
- - model: Qwen/Qwen2.5-72B-Instruct
178
- layer_range: [26, 27]
179
- - sources:
180
- - model: Qwen/Qwen2.5-72B-Instruct
181
- layer_range: [26, 27]
182
- - sources:
183
- - model: Qwen/Qwen2.5-72B-Instruct
184
- layer_range: [27, 28]
185
- - sources:
186
- - model: Qwen/Qwen2.5-72B-Instruct
187
- layer_range: [27, 28]
188
- - sources:
189
- - model: Qwen/Qwen2.5-72B-Instruct
190
- layer_range: [28, 29]
191
- - sources:
192
- - model: Qwen/Qwen2.5-72B-Instruct
193
- layer_range: [28, 29]
194
- - sources:
195
- - model: Qwen/Qwen2.5-72B-Instruct
196
- layer_range: [29, 30]
197
- - sources:
198
- - model: Qwen/Qwen2.5-72B-Instruct
199
- layer_range: [29, 30]
200
- - sources:
201
- - model: Qwen/Qwen2.5-72B-Instruct
202
- layer_range: [30, 31]
203
- - sources:
204
- - model: Qwen/Qwen2.5-72B-Instruct
205
- layer_range: [30, 31]
206
- - sources:
207
- - model: Qwen/Qwen2.5-72B-Instruct
208
- layer_range: [31, 32]
209
- - sources:
210
- - model: Qwen/Qwen2.5-72B-Instruct
211
- layer_range: [31, 32]
212
- - sources:
213
- - model: Qwen/Qwen2.5-72B-Instruct
214
- layer_range: [32, 33]
215
- - sources:
216
- - model: Qwen/Qwen2.5-72B-Instruct
217
- layer_range: [32, 33]
218
- - sources:
219
- - model: Qwen/Qwen2.5-72B-Instruct
220
- layer_range: [33, 34]
221
- - sources:
222
- - model: Qwen/Qwen2.5-72B-Instruct
223
- layer_range: [33, 34]
224
- - sources:
225
- - model: Qwen/Qwen2.5-72B-Instruct
226
- layer_range: [34, 35]
227
- - sources:
228
- - model: Qwen/Qwen2.5-72B-Instruct
229
- layer_range: [34, 35]
230
- - sources:
231
- - model: Qwen/Qwen2.5-72B-Instruct
232
- layer_range: [35, 36]
233
- - sources:
234
- - model: Qwen/Qwen2.5-72B-Instruct
235
- layer_range: [35, 36]
236
- - sources:
237
- - model: Qwen/Qwen2.5-72B-Instruct
238
- layer_range: [36, 37]
239
- - sources:
240
- - model: Qwen/Qwen2.5-72B-Instruct
241
- layer_range: [36, 37]
242
- - sources:
243
- - model: Qwen/Qwen2.5-72B-Instruct
244
- layer_range: [37, 38]
245
- - sources:
246
- - model: Qwen/Qwen2.5-72B-Instruct
247
- layer_range: [37, 38]
248
- - sources:
249
- - model: Qwen/Qwen2.5-72B-Instruct
250
- layer_range: [38, 39]
251
- - sources:
252
- - model: Qwen/Qwen2.5-72B-Instruct
253
- layer_range: [38, 39]
254
- - sources:
255
- - model: Qwen/Qwen2.5-72B-Instruct
256
- layer_range: [39, 40]
257
- - sources:
258
- - model: Qwen/Qwen2.5-72B-Instruct
259
- layer_range: [39, 40]
260
- - sources:
261
- - model: Qwen/Qwen2.5-72B-Instruct
262
- layer_range: [40, 41]
263
- - sources:
264
- - model: Qwen/Qwen2.5-72B-Instruct
265
- layer_range: [40, 41]
266
- - sources:
267
- - model: Qwen/Qwen2.5-72B-Instruct
268
- layer_range: [41, 42]
269
- - sources:
270
- - model: Qwen/Qwen2.5-72B-Instruct
271
- layer_range: [41, 42]
272
- - sources:
273
- - model: Qwen/Qwen2.5-72B-Instruct
274
- layer_range: [42, 43]
275
- - sources:
276
- - model: Qwen/Qwen2.5-72B-Instruct
277
- layer_range: [42, 43]
278
- - sources:
279
- - model: Qwen/Qwen2.5-72B-Instruct
280
- layer_range: [43, 44]
281
- - sources:
282
- - model: Qwen/Qwen2.5-72B-Instruct
283
- layer_range: [43, 44]
284
- - sources:
285
- - model: Qwen/Qwen2.5-72B-Instruct
286
- layer_range: [44, 45]
287
- - sources:
288
- - model: Qwen/Qwen2.5-72B-Instruct
289
- layer_range: [44, 45]
290
- - sources:
291
- - model: Qwen/Qwen2.5-72B-Instruct
292
- layer_range: [45, 46]
293
- - sources:
294
- - model: Qwen/Qwen2.5-72B-Instruct
295
- layer_range: [45, 46]
296
- - sources:
297
- - model: Qwen/Qwen2.5-72B-Instruct
298
- layer_range: [46, 47]
299
- - sources:
300
- - model: Qwen/Qwen2.5-72B-Instruct
301
- layer_range: [46, 47]
302
- - sources:
303
- - model: Qwen/Qwen2.5-72B-Instruct
304
- layer_range: [47, 48]
305
- - sources:
306
- - model: Qwen/Qwen2.5-72B-Instruct
307
- layer_range: [47, 48]
308
- - sources:
309
- - model: Qwen/Qwen2.5-72B-Instruct
310
- layer_range: [48, 49]
311
- - sources:
312
- - model: Qwen/Qwen2.5-72B-Instruct
313
- layer_range: [48, 49]
314
- - sources:
315
- - model: Qwen/Qwen2.5-72B-Instruct
316
- layer_range: [49, 50]
317
- - sources:
318
- - model: Qwen/Qwen2.5-72B-Instruct
319
- layer_range: [49, 50]
320
- - sources:
321
- - model: Qwen/Qwen2.5-72B-Instruct
322
- layer_range: [50, 51]
323
- - sources:
324
- - model: Qwen/Qwen2.5-72B-Instruct
325
- layer_range: [50, 51]
326
- - sources:
327
- - model: Qwen/Qwen2.5-72B-Instruct
328
- layer_range: [51, 52]
329
- - sources:
330
- - model: Qwen/Qwen2.5-72B-Instruct
331
- layer_range: [51, 52]
332
- - sources:
333
- - model: Qwen/Qwen2.5-72B-Instruct
334
- layer_range: [52, 53]
335
- - sources:
336
- - model: Qwen/Qwen2.5-72B-Instruct
337
- layer_range: [52, 53]
338
- - sources:
339
- - model: Qwen/Qwen2.5-72B-Instruct
340
- layer_range: [53, 54]
341
- - sources:
342
- - model: Qwen/Qwen2.5-72B-Instruct
343
- layer_range: [53, 54]
344
- - sources:
345
- - model: Qwen/Qwen2.5-72B-Instruct
346
- layer_range: [54, 55]
347
- - sources:
348
- - model: Qwen/Qwen2.5-72B-Instruct
349
- layer_range: [54, 55]
350
- - sources:
351
- - model: Qwen/Qwen2.5-72B-Instruct
352
- layer_range: [55, 56]
353
- - sources:
354
- - model: Qwen/Qwen2.5-72B-Instruct
355
- layer_range: [55, 56]
356
- - sources:
357
- - model: Qwen/Qwen2.5-72B-Instruct
358
- layer_range: [56, 57]
359
- - sources:
360
- - model: Qwen/Qwen2.5-72B-Instruct
361
- layer_range: [56, 57]
362
- - sources:
363
- - model: Qwen/Qwen2.5-72B-Instruct
364
- layer_range: [57, 58]
365
- - sources:
366
- - model: Qwen/Qwen2.5-72B-Instruct
367
- layer_range: [57, 58]
368
- - sources:
369
- - model: Qwen/Qwen2.5-72B-Instruct
370
- layer_range: [58, 59]
371
- - sources:
372
- - model: Qwen/Qwen2.5-72B-Instruct
373
- layer_range: [58, 59]
374
- - sources:
375
- - model: Qwen/Qwen2.5-72B-Instruct
376
- layer_range: [59, 60]
377
- - sources:
378
- - model: Qwen/Qwen2.5-72B-Instruct
379
- layer_range: [59, 60]
380
- - sources:
381
- - model: Qwen/Qwen2.5-72B-Instruct
382
- layer_range: [60, 61]
383
- - sources:
384
- - model: Qwen/Qwen2.5-72B-Instruct
385
- layer_range: [60, 61]
386
- - sources:
387
- - model: Qwen/Qwen2.5-72B-Instruct
388
- layer_range: [61, 62]
389
- - sources:
390
- - model: Qwen/Qwen2.5-72B-Instruct
391
- layer_range: [61, 62]
392
- - sources:
393
- - model: Qwen/Qwen2.5-72B-Instruct
394
- layer_range: [62, 63]
395
- - sources:
396
- - model: Qwen/Qwen2.5-72B-Instruct
397
- layer_range: [62, 63]
398
- - sources:
399
- - model: Qwen/Qwen2.5-72B-Instruct
400
- layer_range: [63, 64]
401
- - sources:
402
- - model: Qwen/Qwen2.5-72B-Instruct
403
- layer_range: [63, 64]
404
- - sources:
405
- - model: Qwen/Qwen2.5-72B-Instruct
406
- layer_range: [64, 65]
407
- - sources:
408
- - model: Qwen/Qwen2.5-72B-Instruct
409
- layer_range: [64, 65]
410
- - sources:
411
- - model: Qwen/Qwen2.5-72B-Instruct
412
- layer_range: [65, 66]
413
- - sources:
414
- - model: Qwen/Qwen2.5-72B-Instruct
415
- layer_range: [65, 66]
416
- - sources:
417
- - model: Qwen/Qwen2.5-72B-Instruct
418
- layer_range: [66, 67]
419
- - sources:
420
- - model: Qwen/Qwen2.5-72B-Instruct
421
- layer_range: [66, 67]
422
- - sources:
423
- - model: Qwen/Qwen2.5-72B-Instruct
424
- layer_range: [67, 68]
425
- - sources:
426
- - model: Qwen/Qwen2.5-72B-Instruct
427
- layer_range: [67, 68]
428
- - sources:
429
- - model: Qwen/Qwen2.5-72B-Instruct
430
- layer_range: [68, 69]
431
- - sources:
432
- - model: Qwen/Qwen2.5-72B-Instruct
433
- layer_range: [68, 69]
434
- - sources:
435
- - model: Qwen/Qwen2.5-72B-Instruct
436
- layer_range: [69, 70]
437
- - sources:
438
- - model: Qwen/Qwen2.5-72B-Instruct
439
- layer_range: [69, 70]
440
- - sources:
441
- - model: Qwen/Qwen2.5-72B-Instruct
442
- layer_range: [70, 71]
443
- - sources:
444
- - model: Qwen/Qwen2.5-72B-Instruct
445
- layer_range: [70, 71]
446
- - sources:
447
- - model: Qwen/Qwen2.5-72B-Instruct
448
- layer_range: [71, 72]
449
- - sources:
450
- - model: Qwen/Qwen2.5-72B-Instruct
451
- layer_range: [71, 72]
452
- - sources:
453
- - model: Qwen/Qwen2.5-72B-Instruct
454
- layer_range: [72, 73]
455
- - sources:
456
- - model: Qwen/Qwen2.5-72B-Instruct
457
- layer_range: [72, 73]
458
- - sources:
459
- - model: Qwen/Qwen2.5-72B-Instruct
460
- layer_range: [73, 74]
461
- - sources:
462
- - model: Qwen/Qwen2.5-72B-Instruct
463
- layer_range: [73, 74]
464
- - sources:
465
- - model: Qwen/Qwen2.5-72B-Instruct
466
- layer_range: [74, 75]
467
- - sources:
468
- - model: Qwen/Qwen2.5-72B-Instruct
469
- layer_range: [74, 75]
470
- - sources:
471
- - model: Qwen/Qwen2.5-72B-Instruct
472
- layer_range: [75, 76]
473
- - sources:
474
- - model: Qwen/Qwen2.5-72B-Instruct
475
- layer_range: [75, 76]
476
- - sources:
477
- - model: Qwen/Qwen2.5-72B-Instruct
478
- layer_range: [76, 77]
479
- - sources:
480
- - model: Qwen/Qwen2.5-72B-Instruct
481
- layer_range: [76, 77]
482
- - sources:
483
- - model: Qwen/Qwen2.5-72B-Instruct
484
- layer_range: [77, 80]
485
- merge_method: passthrough
486
- dtype: float16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
487
  ```
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-72B-Instruct
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+ license: other
9
+ language:
10
+ - zho
11
+ - eng
12
+ - fra
13
+ - spa
14
+ - por
15
+ - deu
16
+ - ita
17
+ - rus
18
+ - jpn
19
+ - kor
20
+ - vie
21
+ - tha
22
+ - ara
23
+ ---
24
+
25
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/654527ce2a13610acc25d921/LlM5gC_gmgUDCCO4MY8wx.png)
26
+
27
+ # merge
28
+
29
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
30
+
31
+ This model recieved no post merge retraining (yet) and minimal testing. Please contribute any feedback or evaluations of any kind via the community tab.
32
+
33
+ # License
34
+
35
+ Hippocratic License 3.0 + Ecocide module, + Extractive Industries module, + Copyleft
36
+ [![Hippocratic License HL3-CL-ECO-EXTR](https://img.shields.io/static/v1?label=Hippocratic%20License&message=HL3-CL-ECO-EXTR&labelColor=5e2751&color=bc8c3d)](https://firstdonoharm.dev/version/3/0/cl-eco-extr.html)
37
+ https://firstdonoharm.dev/version/3/0/cl-eco-extr.txt
38
+
39
+ ## Merge Details
40
+ ### Merge Method
41
+
42
+ This model was merged using the passthrough merge method. Every layer is doubled in order, from Qwen/Qwen2.5-72B-Instruct, with the MLP layers + 3 output layers only copied once, creating 132B parameters. No additional fine-tune has been done in this merged model.
43
+
44
+ ### Models Merged
45
+
46
+ The following models were included in the merge:
47
+ * [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
48
+
49
+ ### Configuration
50
+
51
+ The following YAML configuration was used to produce this model:
52
+
53
+ ```yaml
54
+ slices:
55
+ - sources:
56
+ - model: Qwen/Qwen2.5-72B-Instruct
57
+ layer_range: [0, 4]
58
+ - sources:
59
+ - model: Qwen/Qwen2.5-72B-Instruct
60
+ layer_range: [4, 5]
61
+ - sources:
62
+ - model: Qwen/Qwen2.5-72B-Instruct
63
+ layer_range: [4, 5]
64
+ - sources:
65
+ - model: Qwen/Qwen2.5-72B-Instruct
66
+ layer_range: [5, 6]
67
+ - sources:
68
+ - model: Qwen/Qwen2.5-72B-Instruct
69
+ layer_range: [5, 6]
70
+ - sources:
71
+ - model: Qwen/Qwen2.5-72B-Instruct
72
+ layer_range: [6, 7]
73
+ - sources:
74
+ - model: Qwen/Qwen2.5-72B-Instruct
75
+ layer_range: [6, 7]
76
+ - sources:
77
+ - model: Qwen/Qwen2.5-72B-Instruct
78
+ layer_range: [7, 8]
79
+ - sources:
80
+ - model: Qwen/Qwen2.5-72B-Instruct
81
+ layer_range: [7, 8]
82
+ - sources:
83
+ - model: Qwen/Qwen2.5-72B-Instruct
84
+ layer_range: [8, 9]
85
+ - sources:
86
+ - model: Qwen/Qwen2.5-72B-Instruct
87
+ layer_range: [8, 9]
88
+ - sources:
89
+ - model: Qwen/Qwen2.5-72B-Instruct
90
+ layer_range: [9, 10]
91
+ - sources:
92
+ - model: Qwen/Qwen2.5-72B-Instruct
93
+ layer_range: [9, 10]
94
+ - sources:
95
+ - model: Qwen/Qwen2.5-72B-Instruct
96
+ layer_range: [10, 11]
97
+ - sources:
98
+ - model: Qwen/Qwen2.5-72B-Instruct
99
+ layer_range: [10, 11]
100
+ - sources:
101
+ - model: Qwen/Qwen2.5-72B-Instruct
102
+ layer_range: [11, 12]
103
+ - sources:
104
+ - model: Qwen/Qwen2.5-72B-Instruct
105
+ layer_range: [11, 12]
106
+ - sources:
107
+ - model: Qwen/Qwen2.5-72B-Instruct
108
+ layer_range: [12, 13]
109
+ - sources:
110
+ - model: Qwen/Qwen2.5-72B-Instruct
111
+ layer_range: [12, 13]
112
+ - sources:
113
+ - model: Qwen/Qwen2.5-72B-Instruct
114
+ layer_range: [13, 14]
115
+ - sources:
116
+ - model: Qwen/Qwen2.5-72B-Instruct
117
+ layer_range: [13, 14]
118
+ - sources:
119
+ - model: Qwen/Qwen2.5-72B-Instruct
120
+ layer_range: [14, 15]
121
+ - sources:
122
+ - model: Qwen/Qwen2.5-72B-Instruct
123
+ layer_range: [14, 15]
124
+ - sources:
125
+ - model: Qwen/Qwen2.5-72B-Instruct
126
+ layer_range: [15, 16]
127
+ - sources:
128
+ - model: Qwen/Qwen2.5-72B-Instruct
129
+ layer_range: [15, 16]
130
+ - sources:
131
+ - model: Qwen/Qwen2.5-72B-Instruct
132
+ layer_range: [16, 17]
133
+ - sources:
134
+ - model: Qwen/Qwen2.5-72B-Instruct
135
+ layer_range: [16, 17]
136
+ - sources:
137
+ - model: Qwen/Qwen2.5-72B-Instruct
138
+ layer_range: [17, 18]
139
+ - sources:
140
+ - model: Qwen/Qwen2.5-72B-Instruct
141
+ layer_range: [17, 18]
142
+ - sources:
143
+ - model: Qwen/Qwen2.5-72B-Instruct
144
+ layer_range: [18, 19]
145
+ - sources:
146
+ - model: Qwen/Qwen2.5-72B-Instruct
147
+ layer_range: [18, 19]
148
+ - sources:
149
+ - model: Qwen/Qwen2.5-72B-Instruct
150
+ layer_range: [19, 20]
151
+ - sources:
152
+ - model: Qwen/Qwen2.5-72B-Instruct
153
+ layer_range: [19, 20]
154
+ - sources:
155
+ - model: Qwen/Qwen2.5-72B-Instruct
156
+ layer_range: [20, 21]
157
+ - sources:
158
+ - model: Qwen/Qwen2.5-72B-Instruct
159
+ layer_range: [20, 21]
160
+ - sources:
161
+ - model: Qwen/Qwen2.5-72B-Instruct
162
+ layer_range: [21, 22]
163
+ - sources:
164
+ - model: Qwen/Qwen2.5-72B-Instruct
165
+ layer_range: [21, 22]
166
+ - sources:
167
+ - model: Qwen/Qwen2.5-72B-Instruct
168
+ layer_range: [22, 23]
169
+ - sources:
170
+ - model: Qwen/Qwen2.5-72B-Instruct
171
+ layer_range: [22, 23]
172
+ - sources:
173
+ - model: Qwen/Qwen2.5-72B-Instruct
174
+ layer_range: [23, 24]
175
+ - sources:
176
+ - model: Qwen/Qwen2.5-72B-Instruct
177
+ layer_range: [23, 24]
178
+ - sources:
179
+ - model: Qwen/Qwen2.5-72B-Instruct
180
+ layer_range: [24, 25]
181
+ - sources:
182
+ - model: Qwen/Qwen2.5-72B-Instruct
183
+ layer_range: [24, 25]
184
+ - sources:
185
+ - model: Qwen/Qwen2.5-72B-Instruct
186
+ layer_range: [25, 26]
187
+ - sources:
188
+ - model: Qwen/Qwen2.5-72B-Instruct
189
+ layer_range: [25, 26]
190
+ - sources:
191
+ - model: Qwen/Qwen2.5-72B-Instruct
192
+ layer_range: [26, 27]
193
+ - sources:
194
+ - model: Qwen/Qwen2.5-72B-Instruct
195
+ layer_range: [26, 27]
196
+ - sources:
197
+ - model: Qwen/Qwen2.5-72B-Instruct
198
+ layer_range: [27, 28]
199
+ - sources:
200
+ - model: Qwen/Qwen2.5-72B-Instruct
201
+ layer_range: [27, 28]
202
+ - sources:
203
+ - model: Qwen/Qwen2.5-72B-Instruct
204
+ layer_range: [28, 29]
205
+ - sources:
206
+ - model: Qwen/Qwen2.5-72B-Instruct
207
+ layer_range: [28, 29]
208
+ - sources:
209
+ - model: Qwen/Qwen2.5-72B-Instruct
210
+ layer_range: [29, 30]
211
+ - sources:
212
+ - model: Qwen/Qwen2.5-72B-Instruct
213
+ layer_range: [29, 30]
214
+ - sources:
215
+ - model: Qwen/Qwen2.5-72B-Instruct
216
+ layer_range: [30, 31]
217
+ - sources:
218
+ - model: Qwen/Qwen2.5-72B-Instruct
219
+ layer_range: [30, 31]
220
+ - sources:
221
+ - model: Qwen/Qwen2.5-72B-Instruct
222
+ layer_range: [31, 32]
223
+ - sources:
224
+ - model: Qwen/Qwen2.5-72B-Instruct
225
+ layer_range: [31, 32]
226
+ - sources:
227
+ - model: Qwen/Qwen2.5-72B-Instruct
228
+ layer_range: [32, 33]
229
+ - sources:
230
+ - model: Qwen/Qwen2.5-72B-Instruct
231
+ layer_range: [32, 33]
232
+ - sources:
233
+ - model: Qwen/Qwen2.5-72B-Instruct
234
+ layer_range: [33, 34]
235
+ - sources:
236
+ - model: Qwen/Qwen2.5-72B-Instruct
237
+ layer_range: [33, 34]
238
+ - sources:
239
+ - model: Qwen/Qwen2.5-72B-Instruct
240
+ layer_range: [34, 35]
241
+ - sources:
242
+ - model: Qwen/Qwen2.5-72B-Instruct
243
+ layer_range: [34, 35]
244
+ - sources:
245
+ - model: Qwen/Qwen2.5-72B-Instruct
246
+ layer_range: [35, 36]
247
+ - sources:
248
+ - model: Qwen/Qwen2.5-72B-Instruct
249
+ layer_range: [35, 36]
250
+ - sources:
251
+ - model: Qwen/Qwen2.5-72B-Instruct
252
+ layer_range: [36, 37]
253
+ - sources:
254
+ - model: Qwen/Qwen2.5-72B-Instruct
255
+ layer_range: [36, 37]
256
+ - sources:
257
+ - model: Qwen/Qwen2.5-72B-Instruct
258
+ layer_range: [37, 38]
259
+ - sources:
260
+ - model: Qwen/Qwen2.5-72B-Instruct
261
+ layer_range: [37, 38]
262
+ - sources:
263
+ - model: Qwen/Qwen2.5-72B-Instruct
264
+ layer_range: [38, 39]
265
+ - sources:
266
+ - model: Qwen/Qwen2.5-72B-Instruct
267
+ layer_range: [38, 39]
268
+ - sources:
269
+ - model: Qwen/Qwen2.5-72B-Instruct
270
+ layer_range: [39, 40]
271
+ - sources:
272
+ - model: Qwen/Qwen2.5-72B-Instruct
273
+ layer_range: [39, 40]
274
+ - sources:
275
+ - model: Qwen/Qwen2.5-72B-Instruct
276
+ layer_range: [40, 41]
277
+ - sources:
278
+ - model: Qwen/Qwen2.5-72B-Instruct
279
+ layer_range: [40, 41]
280
+ - sources:
281
+ - model: Qwen/Qwen2.5-72B-Instruct
282
+ layer_range: [41, 42]
283
+ - sources:
284
+ - model: Qwen/Qwen2.5-72B-Instruct
285
+ layer_range: [41, 42]
286
+ - sources:
287
+ - model: Qwen/Qwen2.5-72B-Instruct
288
+ layer_range: [42, 43]
289
+ - sources:
290
+ - model: Qwen/Qwen2.5-72B-Instruct
291
+ layer_range: [42, 43]
292
+ - sources:
293
+ - model: Qwen/Qwen2.5-72B-Instruct
294
+ layer_range: [43, 44]
295
+ - sources:
296
+ - model: Qwen/Qwen2.5-72B-Instruct
297
+ layer_range: [43, 44]
298
+ - sources:
299
+ - model: Qwen/Qwen2.5-72B-Instruct
300
+ layer_range: [44, 45]
301
+ - sources:
302
+ - model: Qwen/Qwen2.5-72B-Instruct
303
+ layer_range: [44, 45]
304
+ - sources:
305
+ - model: Qwen/Qwen2.5-72B-Instruct
306
+ layer_range: [45, 46]
307
+ - sources:
308
+ - model: Qwen/Qwen2.5-72B-Instruct
309
+ layer_range: [45, 46]
310
+ - sources:
311
+ - model: Qwen/Qwen2.5-72B-Instruct
312
+ layer_range: [46, 47]
313
+ - sources:
314
+ - model: Qwen/Qwen2.5-72B-Instruct
315
+ layer_range: [46, 47]
316
+ - sources:
317
+ - model: Qwen/Qwen2.5-72B-Instruct
318
+ layer_range: [47, 48]
319
+ - sources:
320
+ - model: Qwen/Qwen2.5-72B-Instruct
321
+ layer_range: [47, 48]
322
+ - sources:
323
+ - model: Qwen/Qwen2.5-72B-Instruct
324
+ layer_range: [48, 49]
325
+ - sources:
326
+ - model: Qwen/Qwen2.5-72B-Instruct
327
+ layer_range: [48, 49]
328
+ - sources:
329
+ - model: Qwen/Qwen2.5-72B-Instruct
330
+ layer_range: [49, 50]
331
+ - sources:
332
+ - model: Qwen/Qwen2.5-72B-Instruct
333
+ layer_range: [49, 50]
334
+ - sources:
335
+ - model: Qwen/Qwen2.5-72B-Instruct
336
+ layer_range: [50, 51]
337
+ - sources:
338
+ - model: Qwen/Qwen2.5-72B-Instruct
339
+ layer_range: [50, 51]
340
+ - sources:
341
+ - model: Qwen/Qwen2.5-72B-Instruct
342
+ layer_range: [51, 52]
343
+ - sources:
344
+ - model: Qwen/Qwen2.5-72B-Instruct
345
+ layer_range: [51, 52]
346
+ - sources:
347
+ - model: Qwen/Qwen2.5-72B-Instruct
348
+ layer_range: [52, 53]
349
+ - sources:
350
+ - model: Qwen/Qwen2.5-72B-Instruct
351
+ layer_range: [52, 53]
352
+ - sources:
353
+ - model: Qwen/Qwen2.5-72B-Instruct
354
+ layer_range: [53, 54]
355
+ - sources:
356
+ - model: Qwen/Qwen2.5-72B-Instruct
357
+ layer_range: [53, 54]
358
+ - sources:
359
+ - model: Qwen/Qwen2.5-72B-Instruct
360
+ layer_range: [54, 55]
361
+ - sources:
362
+ - model: Qwen/Qwen2.5-72B-Instruct
363
+ layer_range: [54, 55]
364
+ - sources:
365
+ - model: Qwen/Qwen2.5-72B-Instruct
366
+ layer_range: [55, 56]
367
+ - sources:
368
+ - model: Qwen/Qwen2.5-72B-Instruct
369
+ layer_range: [55, 56]
370
+ - sources:
371
+ - model: Qwen/Qwen2.5-72B-Instruct
372
+ layer_range: [56, 57]
373
+ - sources:
374
+ - model: Qwen/Qwen2.5-72B-Instruct
375
+ layer_range: [56, 57]
376
+ - sources:
377
+ - model: Qwen/Qwen2.5-72B-Instruct
378
+ layer_range: [57, 58]
379
+ - sources:
380
+ - model: Qwen/Qwen2.5-72B-Instruct
381
+ layer_range: [57, 58]
382
+ - sources:
383
+ - model: Qwen/Qwen2.5-72B-Instruct
384
+ layer_range: [58, 59]
385
+ - sources:
386
+ - model: Qwen/Qwen2.5-72B-Instruct
387
+ layer_range: [58, 59]
388
+ - sources:
389
+ - model: Qwen/Qwen2.5-72B-Instruct
390
+ layer_range: [59, 60]
391
+ - sources:
392
+ - model: Qwen/Qwen2.5-72B-Instruct
393
+ layer_range: [59, 60]
394
+ - sources:
395
+ - model: Qwen/Qwen2.5-72B-Instruct
396
+ layer_range: [60, 61]
397
+ - sources:
398
+ - model: Qwen/Qwen2.5-72B-Instruct
399
+ layer_range: [60, 61]
400
+ - sources:
401
+ - model: Qwen/Qwen2.5-72B-Instruct
402
+ layer_range: [61, 62]
403
+ - sources:
404
+ - model: Qwen/Qwen2.5-72B-Instruct
405
+ layer_range: [61, 62]
406
+ - sources:
407
+ - model: Qwen/Qwen2.5-72B-Instruct
408
+ layer_range: [62, 63]
409
+ - sources:
410
+ - model: Qwen/Qwen2.5-72B-Instruct
411
+ layer_range: [62, 63]
412
+ - sources:
413
+ - model: Qwen/Qwen2.5-72B-Instruct
414
+ layer_range: [63, 64]
415
+ - sources:
416
+ - model: Qwen/Qwen2.5-72B-Instruct
417
+ layer_range: [63, 64]
418
+ - sources:
419
+ - model: Qwen/Qwen2.5-72B-Instruct
420
+ layer_range: [64, 65]
421
+ - sources:
422
+ - model: Qwen/Qwen2.5-72B-Instruct
423
+ layer_range: [64, 65]
424
+ - sources:
425
+ - model: Qwen/Qwen2.5-72B-Instruct
426
+ layer_range: [65, 66]
427
+ - sources:
428
+ - model: Qwen/Qwen2.5-72B-Instruct
429
+ layer_range: [65, 66]
430
+ - sources:
431
+ - model: Qwen/Qwen2.5-72B-Instruct
432
+ layer_range: [66, 67]
433
+ - sources:
434
+ - model: Qwen/Qwen2.5-72B-Instruct
435
+ layer_range: [66, 67]
436
+ - sources:
437
+ - model: Qwen/Qwen2.5-72B-Instruct
438
+ layer_range: [67, 68]
439
+ - sources:
440
+ - model: Qwen/Qwen2.5-72B-Instruct
441
+ layer_range: [67, 68]
442
+ - sources:
443
+ - model: Qwen/Qwen2.5-72B-Instruct
444
+ layer_range: [68, 69]
445
+ - sources:
446
+ - model: Qwen/Qwen2.5-72B-Instruct
447
+ layer_range: [68, 69]
448
+ - sources:
449
+ - model: Qwen/Qwen2.5-72B-Instruct
450
+ layer_range: [69, 70]
451
+ - sources:
452
+ - model: Qwen/Qwen2.5-72B-Instruct
453
+ layer_range: [69, 70]
454
+ - sources:
455
+ - model: Qwen/Qwen2.5-72B-Instruct
456
+ layer_range: [70, 71]
457
+ - sources:
458
+ - model: Qwen/Qwen2.5-72B-Instruct
459
+ layer_range: [70, 71]
460
+ - sources:
461
+ - model: Qwen/Qwen2.5-72B-Instruct
462
+ layer_range: [71, 72]
463
+ - sources:
464
+ - model: Qwen/Qwen2.5-72B-Instruct
465
+ layer_range: [71, 72]
466
+ - sources:
467
+ - model: Qwen/Qwen2.5-72B-Instruct
468
+ layer_range: [72, 73]
469
+ - sources:
470
+ - model: Qwen/Qwen2.5-72B-Instruct
471
+ layer_range: [72, 73]
472
+ - sources:
473
+ - model: Qwen/Qwen2.5-72B-Instruct
474
+ layer_range: [73, 74]
475
+ - sources:
476
+ - model: Qwen/Qwen2.5-72B-Instruct
477
+ layer_range: [73, 74]
478
+ - sources:
479
+ - model: Qwen/Qwen2.5-72B-Instruct
480
+ layer_range: [74, 75]
481
+ - sources:
482
+ - model: Qwen/Qwen2.5-72B-Instruct
483
+ layer_range: [74, 75]
484
+ - sources:
485
+ - model: Qwen/Qwen2.5-72B-Instruct
486
+ layer_range: [75, 76]
487
+ - sources:
488
+ - model: Qwen/Qwen2.5-72B-Instruct
489
+ layer_range: [75, 76]
490
+ - sources:
491
+ - model: Qwen/Qwen2.5-72B-Instruct
492
+ layer_range: [76, 77]
493
+ - sources:
494
+ - model: Qwen/Qwen2.5-72B-Instruct
495
+ layer_range: [76, 77]
496
+ - sources:
497
+ - model: Qwen/Qwen2.5-72B-Instruct
498
+ layer_range: [77, 80]
499
+ merge_method: passthrough
500
+ dtype: float16
501
  ```