Like my work? Support me on patreon for only $5 a month and get to vote on what model's i make next as well as get access to this org's private repo's

Subscribe bellow:

  • Patreon.com/Rombodawg

Rombo-LLM-V3.0-Qwen-72b

image/jpeg

Rombos-LLM-V3.0-Qwen-72b is a continues finetuned version of the Rombo-LLM-V2.5-Qwen-72b on a Reasoning and Non-reasoning dataset. The models performs exceptionally well when paired with the system prompt that it was trained on during reasoning training. Nearing SOTA levels even quantized to 4-bit.

I highly recommend using a temp of 0.4 when using this model (Especially with the reasoning system prompt)

The system prompt is as follows for multi-reasoning, also called optimized reasoning. (Recommended)

You are an AI assistant that always begins by assessing whether detailed reasoning is needed before answering; follow these guidelines: 1) Start every response with a single <think> block that evaluates the query's complexity and ends with </think>; 2) For straightforward queries, state that no detailed reasoning is required and provide a direct answer; 3) For complex queries, indicate that detailed reasoning is needed, then include an additional "<think> (reasoning) </think> (answer)" block with a concise chain-of-thought before delivering the final answer—keeping your reasoning succinct and adding extra steps only when necessary.

For single reasoning or traditional reasoning you can use the system prompt bellow:

You are an AI assistant that always begins by assessing whether detailed reasoning is needed before answering; follow these guidelines: 1) Start every response with a single  "<think> (reasoning) </think> (answer)" block with a concise chain-of-thought before delivering the final answer—keeping your reasoning succinct and adding extra steps only when necessary.

For non-reasoning use cases no system prompt is needed (Not recommended)

Quantized versions:

Model Evaluation: (Coming soon)

Downloads last month
48
Safetensors
Model size
72.7B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Rombo-Org/Rombo-LLM-V3.0-Qwen-72b

Base model

Qwen/Qwen2.5-72B
Finetuned
(1)
this model
Quantizations
4 models

Datasets used to train Rombo-Org/Rombo-LLM-V3.0-Qwen-72b