monkeypostulate
/

gpt-neo-1.3B

Model card Files Files and versions Community

monkeypostulate commited on Oct 28, 2024

Commit

4543671

·

verified ·

1 Parent(s): 3be466d

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -1,3 +1,22 @@
 ---
 datasets:
 - mlabonne/orpo-dpo-mix-40k

+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B, optimized with ORPO (Optimized Regularization for Prompt Optimization) Trainer. Fine-tuning was performed using a subset of the [meta-llama/Llama-3.2-1B  dataset, with only 100 samples selected to enable rapid training with ORPO’s efficient approach.
+**Fine-tuning Method:** ORPO
+**Dataset:** mlabonne/orpo-dpo-mix-40k
+**Evaluation**
+The model was evaluated on the following benchmarks, with the following performance metrics:
+|  Tasks  |Version|Filter|n-shot| Metric |   |Value |   |Stderr|
+|---------|------:|------|-----:|--------|---|-----:|---|-----:|
+|hellaswag|      1|none  |     0|acc     |↑  | 0.4772 |±  | 0.0050 |
+|         |       |none  |     0|acc_norm|↑  |0.6366 |±  | 0.0048 |
 ---
 datasets:
 - mlabonne/orpo-dpo-mix-40k