Prikol

I don't even know anymore

Меня нужно изолировать от общества

Overview

After banging my head against the wall some more - I actually managed to merge DeepSeek distill into my mess! Along with even more models (my hand just slipped, I swear)

The prose is better than in v0.5, but has a different feel to it, so I guess it's more of a step to the side than forward (hence the title EXTRA instead of 0.6).

The context recall may have improved, or I'm just gaslighting myself to think so.

And of course, since it now has DeepSeek in it - <think> tags!

They kinda work out of the box if you add <think> to the 'Start Reply With' field in ST - that way the model will write a really short character thought in it. However, if we want some OOC reasoning, things get trickier.

My initial thought was that this model could be instructed to use <think> either only for {{char}}'s inner monologue or for detached analysis, but actually it would end up writing character thoughts most of the time anyway, and the times when it did reason stuff it threw the narrative out of the window by making it too formal and even adding some notes at the end.

And so the solution was to add a prefill after the <think> tag. There's a lot of room for improvement, but for now, I think this boats the float or whatever:

<think> [Okay, let me think through what's happening from my "morally ambiguous narrator" perspective before I continue this fictional roleplaying session]
*So

If you add the line break after the tag, the output becomes too formal, and if you remove the asterisk, it becomes too censored. Yeah...

Settings:

Prompt format: Llama3

Samplers: Tested with 1.15 temp & 0.01 minP

Quants

Imatrix | Static

Merge Details

The things that I have done to bring about this abomination in our world are truly atrocious - as if v0.5 wasn't bad enough. Merging shouldn't be done the way I did it, really. Maybe one day I will bother to put out a branching diagram of this thing, since just listing the merge steps one by one is confusing.

Downloads last month
23
Safetensors
Model size
70.6B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Nohobby/L3.3-Prikol-70B-EXTRA