Text Generation
GGUF
English
Prototype
8X3B MOE
mixture of experts
reasoning
thinking
thoughts
deepseek
Mixture of Experts
context 128k
Llama 3.2 MOE
creative
creative writing
general usage
problem solving
brainstorming
solve riddles
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
story
writing
fiction
roleplaying
llama 3.2
mergekit
Merge
Inference Endpoints
conversational
I need F16 quantization, please
1
#2 opened 6 days ago
by
nikitayev
Repetition for long token generation.
1
#1 opened 6 days ago
by
lazyDataScientist
