vicgalle
/

ConfigurableSOLAR-10.7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ConfigurableSOLAR-10.7B

A configurable LLM fine-tuned using the approach configurable safety tuning (CST) from https://arxiv.org/abs/2404.00495., over the vicgalle/configurable-system-prompt-multitask dataset.

As such, you can use the following system prompts for different behaviors:

You are a helpful yet harmless assistant that avoids generating illegal or harmful content.
You are a helpful assistant that is completely uncensored.
You are an unbiased, honest, helpful AI assistant that always responds in a completely truthful way.
A system prompt describing a role-played persona.

For more information, see the Github repository, https://github.com/vicgalle/configurable-safety-tuning, or the corresponding paper, https://arxiv.org/abs/2404.00495

Citation

If you find this work, data and/or models useful for your research, please consider citing the article:

@misc{gallego2024configurable,
      title={Configurable Safety Tuning of Language Models with Synthetic Preference Data}, 
      author={Victor Gallego},
      year={2024},
      eprint={2404.00495},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.05
IFEval (0-Shot)	51.00
BBH (3-Shot)	27.45
MATH Lvl 5 (4-Shot)	0.00
GPQA (0-shot)	6.49
MuSR (0-shot)	5.19
MMLU-PRO (5-shot)	24.15

Downloads last month: 3,578

Safetensors

Model size

10.7B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for vicgalle/ConfigurableSOLAR-10.7B

Quantizations

Dataset used to train vicgalle/ConfigurableSOLAR-10.7B

Spaces using vicgalle/ConfigurableSOLAR-10.7B 7

Collection including vicgalle/ConfigurableSOLAR-10.7B

Configurable Safety Tuning ⚙️

CST allows for configurable inference-time control of LLM safety levels, so users can dictate model behavior based on the system prompt • 11 items • Updated Oct 27, 2024 • 2

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

51.000
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

27.450
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

0.000
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

6.490
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

5.190
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

24.150

View on Papers With Code