grounded-ai
/

phi4-mini-judge

Generated from Trainer

hallucination-detection

toxicity-detection

relevance-evaluation

Model card Files Files and versions

Metrics Training metrics Community

Jlonge4 commited on Jun 8

Commit

15fc5ad

·

verified ·

1 Parent(s): 9d6984b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ This repository contains our comprehensive AI safety evaluation PEFT adapter mod
 ## Model Performance
-Our Phi-4-Mini-Judge model achieves strong performance across all three evaluation dimensions on a balanced test set of 105 samples (35 per task):
 ### Overall Performance
 - **Total Accuracy: 81.90%** (86/105 correct predictions)

 ## Model Performance
+The GroundedAI Phi-4-Mini-Judge model achieves strong performance across all three evaluation dimensions on a balanced test set of 105 samples (35 per task):
 ### Overall Performance
 - **Total Accuracy: 81.90%** (86/105 correct predictions)