HPAI-BSC
/

Llama3.1-Aloe-Beta-8B

Question Answering

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JordiBayarri commited on Oct 31, 2024

Commit

4062100

·

verified ·

1 Parent(s): 5693173

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -82,11 +82,9 @@ The Beta model has been developed to excel in several different medical tasks. F
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/2NW3im0aH2u6RKp969sjx.png)
-We also compared the performance of the model in the general domain, using the OpenLLM Leaderboard benchmark:
-TO BE UPDATED
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/Ym6v3LsMdfwetXbg6twQP.png)
 ## Uses
@@ -276,7 +274,7 @@ The model is aligned using the Direct Preference Optimization (DPO) technique th
 2. Red-Teaming Alignment: This step further fine-tunes the model to resist a variety of potential attacks, enhancing its robustness and security. Dataset will be shared soon. In this stage, we set the learning rate to 1e-7.
 <!---
-^^^ LINKS TO DPO DATA ^^^
 -->

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/2NW3im0aH2u6RKp969sjx.png)
+We also compared the performance of the model in the general domain, using the OpenLLM Leaderboard benchmark. Aloe-Beta gets competitive results with the current SOTA general models in the most used general benchmarks and outperforms the medical models:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6620f941eba5274b5c12f83d/zbT3BzclSegfh6p2hC-3a.png)
 ## Uses
 2. Red-Teaming Alignment: This step further fine-tunes the model to resist a variety of potential attacks, enhancing its robustness and security. Dataset will be shared soon. In this stage, we set the learning rate to 1e-7.
 <!---
+^^^ LINKS TO DPO DATA (DPO added, missing the RT^^^
 -->