Feedback after some use

#1
by AlecFoster - opened

Hello, I'm quite torn on this model... I like it's output a lot, but it definitely still has repetition issues, not as bad as some models and seems to mostly become an issue when the conversations are longer.

So far I've had the most luck against repetition using the LatitudeGames/Harbinger-24B model, but that model also has some other issues as well... So that's why I'm torn between these models, I like the output a bit more from Codex-24B-Small-3.2 but the repetition does become a bit problematic since most of my sessions tend to go fairly long ~16K tokens a lot of the times.

Some other issues I've had with Codex-24B-Small-3.2 is user message misunderstandings, sometimes it tends to misunderstand what I'm saying quite drastically, forcing me to add more "metadata" to guide it towards the intended meaning. But I've found Codex-24B-Small-3.2 to be more flexible than Harbinger-24B if you suddenly introduce another character into the scenario. At least for me Harbinger-24B seems to need a lot of extra guiding to get it to consistently add dialogue for multiple characters if they aren't specifically mentioned in the system prompt (now this might be more an issue with how I define the character and world, so I wont blame it to much for it).

In summary I've really enjoyed messing around with Codex-24B-Small-3.2 and I'll keep messing around with it more to see if in the end I like it more than Harbinger-24B. These two models have made me quite excited though for the possibility of larger versions eventually, since I've mostly found the ~20B-30B parameter count to be okay, but not quite enough for more complex spatial scenes. Really wish Mistral would have released Mistral Medium openly, would have been really fun to see what you could have done with a slightly larger Mistral model.

Really wish I could mess around quicker, but my limited compute makes it quite slow to test longer scenarios and a larger variety of scenarios. So far I've tested the following temperatures 0.5, 0.55 and 0.75.

Thank you again for a fun model, I look forward to seeing what you make next :)

Sign up or log in to comment