File size: 4,921 Bytes
a943f6e
 
 
25961ce
7a4993e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a943f6e
34b8ac2
f98fa4b
a943f6e
61fd04b
1eb3310
4a444af
90ddfd5
1eb3310
 
aa29969
 
 
1eb3310
 
938ca38
 
 
1eb3310
 
 
 
a943f6e
1eb3310
 
 
938ca38
 
 
 
 
 
 
 
 
 
2267960
974d2d4
1eb3310
a943f6e
1eb3310
6d3ba8d
25a5708
1eb3310
 
4bbee9c
763ccf6
a943f6e
 
1eb3310
 
 
 
 
b9b82ec
7a4993e
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
---
library_name: transformers
tags:
- not-for-all-audiences
model-index:
- name: OpenCrystal-12B-L3
  results:
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: IFEval (0-Shot)
      type: HuggingFaceH4/ifeval
      args:
        num_few_shot: 0
    metrics:
    - type: inst_level_strict_acc and prompt_level_strict_acc
      value: 40.71
      name: strict accuracy
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: BBH (3-Shot)
      type: BBH
      args:
        num_few_shot: 3
    metrics:
    - type: acc_norm
      value: 31.84
      name: normalized accuracy
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MATH Lvl 5 (4-Shot)
      type: hendrycks/competition_math
      args:
        num_few_shot: 4
    metrics:
    - type: exact_match
      value: 7.93
      name: exact match
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: GPQA (0-shot)
      type: Idavidrein/gpqa
      args:
        num_few_shot: 0
    metrics:
    - type: acc_norm
      value: 7.49
      name: acc_norm
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MuSR (0-shot)
      type: TAUR-Lab/MuSR
      args:
        num_few_shot: 0
    metrics:
    - type: acc_norm
      value: 5.74
      name: acc_norm
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MMLU-PRO (5-shot)
      type: TIGER-Lab/MMLU-Pro
      config: main
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 29.34
      name: accuracy
    source:
      url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Darkknight535/OpenCrystal-12B-L3
      name: Open LLM Leaderboard
---
### OpenCrystal-12B-L3
This is a finetuned language model. (I recommend using this one v2 and v2.1 are not good enough)

![Rohma](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/resolve/main/1s.jpg)

### 128K??
[L3.1 Variant here](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3.1-128K)

### Instruct Template
Default llama3 instruct and context preset, but here is the one i use.
[Instruct](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/blob/main/Llama%203%20%5BInstruct%5D.json)
[Context](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/blob/main/Llama%203%20%5BContext%5D.json)

### Samplers


## Creative
```
Temp : 1.23
Min P : 0.05
Repetition Penalty : 1.05

[And everything else neutral]
```

## Normal
```
Temp : 0.6 - 0.8
Min P : 0.1
Repetition Penalty : 1.1

[And everything else neutral]
```


### Pro Tip
- You can uncheck *Include Names* option in sillytavern, to force it to speak as others dynamically. **Not Recommended**
### Features

- Can speak as other npc automatically.
- Creative (Swipes are crazy.)
- Coherent (Sometime gets horny)
- Output feels like you're using Character.ai
- Follows prompt better
- Likes higher context length. (12K easily tested)
- can summarize and generate image prompts well [The Above image's prompt is generated in a roleplay by this model] (Possible : Due to llama-3-instruct as base)


### Instruct Prompt
```
You're {{char}}, follow {{char}} personality and plot of the story, Don't impersonate as {{user}}, Speak as others NPC except {{user}} when needed. Be Creative, Create various interesting events and situations during the story.
```

### FeedBack
[FeedBack here](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/discussions/1)
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Darkknight535__OpenCrystal-12B-L3)

|      Metric       |Value|
|-------------------|----:|
|Avg.               |20.51|
|IFEval (0-Shot)    |40.71|
|BBH (3-Shot)       |31.84|
|MATH Lvl 5 (4-Shot)| 7.93|
|GPQA (0-shot)      | 7.49|
|MuSR (0-shot)      | 5.74|
|MMLU-PRO (5-shot)  |29.34|