nbeerbower
/

Lyra4-Gutenberg2-12B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Lyra4-Gutenberg2-12B

Sao10K/MN-12B-Lyra-v4 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.

Features an increased sequence length from Lyra4-Gutenberg-12B.

Method

ORPO Finetuned using 2x RTX 3090 for 3 epochs.

Training data was formatted with ChatML.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.74
IFEval (0-Shot)	25.85
BBH (3-Shot)	33.73
MATH Lvl 5 (4-Shot)	10.50
GPQA (0-shot)	8.39
MuSR (0-shot)	11.49
MMLU-PRO (5-shot)	28.51

Downloads last month: 109

Safetensors

Model size

12.2B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for nbeerbower/Lyra4-Gutenberg2-12B

Base model

Sao10K/MN-12B-Lyra-v4

Finetuned

(2)

this model

Merges

Quantizations

Datasets used to train nbeerbower/Lyra4-Gutenberg2-12B

Spaces using nbeerbower/Lyra4-Gutenberg2-12B 3

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

25.850
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

33.730
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

10.500
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

8.390
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

11.490
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

28.510

View on Papers With Code