hawei commited on
Commit
7e75399
·
verified ·
1 Parent(s): 628d4e0

Update model card with benchmark result plotting

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -72,4 +72,19 @@ model-index:
72
  verified: false
73
  ---
74
  # Control-LLM-Llama3.1-8B-Math16
75
- This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72
  verified: false
73
  ---
74
  # Control-LLM-Llama3.1-8B-Math16
75
+ This is a fine-tuned model of Llama-3.1-8B-Instruct for mathematical tasks on OpenMath2 dataset.
76
+
77
+ ## Evaluation Results
78
+ Here is an overview of the evaluation results and findings:
79
+
80
+ ### Benchmark Result and Catastrophic Forgetting on OpenMath
81
+ The following plot illustrates benchmark result and catastrophic forgetting mitigation on the OpenMath2 dataset.
82
+
83
+ ![Catastrophic Forgetting](plots/catastrophic_forgetting_openmath.png)
84
+
85
+ ### Alignment Comparison
86
+ The plot below highlights the alignment comparison of the model trained with Control LLM and Full Parameter Tuning.
87
+
88
+ ![Alignment Comparison](plots/alignment_comparison.png)
89
+
90
+