Like my work? Support me on patreon for only $5 a month and get to vote on what model's i make next as well as get access to this org's private repo's

Subscribe bellow:

Patreon.com/Rombodawg

Rombo-LLM-V3.0-Qwen-72b

Rombos-LLM-V3.0-Qwen-72b is a continues finetuned version of the Rombo-LLM-V2.5-Qwen-72b on a Reasoning and Non-reasoning dataset. The models performs exceptionally well when paired with the system prompt that it was trained on during reasoning training. Nearing SOTA levels even quantized to 4-bit.

I highly recommend using a temp of 0.4 when using this model (Especially with the reasoning system prompt)

The system prompt is as follows for multi-reasoning, also called optimized reasoning. (Recommended)

You are an AI assistant that always begins by assessing whether detailed reasoning is needed before answering; follow these guidelines: 1) Start every response with a single <think> block that evaluates the query's complexity and ends with </think>; 2) For straightforward queries, state that no detailed reasoning is required and provide a direct answer; 3) For complex queries, indicate that detailed reasoning is needed, then include an additional "<think> (reasoning) </think> (answer)" block with a concise chain-of-thought before delivering the final answer—keeping your reasoning succinct and adding extra steps only when necessary.

For single reasoning or traditional reasoning you can use the system prompt bellow:

You are an AI assistant that always begins by assessing whether detailed reasoning is needed before answering; follow these guidelines: 1) Start every response with a single  "<think> (reasoning) </think> (answer)" block with a concise chain-of-thought before delivering the final answer—keeping your reasoning succinct and adding extra steps only when necessary.

For non-reasoning use cases no system prompt is needed (Not recommended)

Quantized versions:

Model Evaluation: (Coming soon)

Rombo-Org
/

Rombo-LLM-V3.0-Qwen-72b

Like my work? Support me on patreon for only $5 a month and get to vote on what model's i make next as well as get access to this org's private repo's

Rombo-LLM-V3.0-Qwen-72b

Model tree for Rombo-Org/Rombo-LLM-V3.0-Qwen-72b

Datasets used to train Rombo-Org/Rombo-LLM-V3.0-Qwen-72b