mkurman commited on
Commit
88ca35b
·
verified ·
1 Parent(s): 50fb940

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +169 -3
README.md CHANGED
@@ -1,3 +1,169 @@
1
- ---
2
- license: llama3.2
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: llama3.2
5
+ library_name: transformers
6
+ base_model:
7
+ - meta-llama/Llama-3.2-1B-Instruct
8
+ - Llama-3.2-SUN-2.5B-chat
9
+ datasets:
10
+ - argilla/OpenHermesPreferences
11
+ - argilla/magpie-ultra-v0.1
12
+ - argilla/Capybara-Preferences-Filtered
13
+ - mlabonne/open-perfectblend
14
+ - HuggingFaceTB/everyday-conversations-llama3.1-2k
15
+ - WizardLMTeam/WizardLM_evol_instruct_V2_196k
16
+ - ProlificAI/social-reasoning-rlhf
17
+ - allenai/tulu-3-sft-mixture
18
+ - allenai/llama-3.1-tulu-3-8b-preference-mixture
19
+ pipeline_tag: text-generation
20
+ model-index:
21
+ - name: Llama-3.2-SUN-1B-Instruct
22
+ results:
23
+ - task:
24
+ type: text-generation
25
+ name: Text Generation
26
+ dataset:
27
+ name: IFEval (0-Shot)
28
+ type: HuggingFaceH4/ifeval
29
+ args:
30
+ num_few_shot: 0
31
+ metrics:
32
+ - type: inst_level_strict_acc and prompt_level_strict_acc
33
+ value: 64.13
34
+ name: strict accuracy
35
+ source:
36
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
37
+ name: Open LLM Leaderboard
38
+ - task:
39
+ type: text-generation
40
+ name: Text Generation
41
+ dataset:
42
+ name: BBH (3-Shot)
43
+ type: BBH
44
+ args:
45
+ num_few_shot: 3
46
+ metrics:
47
+ - type: acc_norm
48
+ value: 9.18
49
+ name: normalized accuracy
50
+ source:
51
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
52
+ name: Open LLM Leaderboard
53
+ - task:
54
+ type: text-generation
55
+ name: Text Generation
56
+ dataset:
57
+ name: MATH Lvl 5 (4-Shot)
58
+ type: hendrycks/competition_math
59
+ args:
60
+ num_few_shot: 4
61
+ metrics:
62
+ - type: exact_match
63
+ value: 4.61
64
+ name: exact match
65
+ source:
66
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
67
+ name: Open LLM Leaderboard
68
+ - task:
69
+ type: text-generation
70
+ name: Text Generation
71
+ dataset:
72
+ name: GPQA (0-shot)
73
+ type: Idavidrein/gpqa
74
+ args:
75
+ num_few_shot: 0
76
+ metrics:
77
+ - type: acc_norm
78
+ value: 0.0
79
+ name: acc_norm
80
+ source:
81
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
82
+ name: Open LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: MuSR (0-shot)
88
+ type: TAUR-Lab/MuSR
89
+ args:
90
+ num_few_shot: 0
91
+ metrics:
92
+ - type: acc_norm
93
+ value: 4.05
94
+ name: acc_norm
95
+ source:
96
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
97
+ name: Open LLM Leaderboard
98
+ - task:
99
+ type: text-generation
100
+ name: Text Generation
101
+ dataset:
102
+ name: MMLU-PRO (5-shot)
103
+ type: TIGER-Lab/MMLU-Pro
104
+ config: main
105
+ split: test
106
+ args:
107
+ num_few_shot: 5
108
+ metrics:
109
+ - type: acc
110
+ value: 8.68
111
+ name: accuracy
112
+ source:
113
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=meditsolutions/Llama-3.2-SUN-1B-Instruct
114
+ name: Open LLM Leaderboard
115
+ ---
116
+
117
+ # MedIT SUN HDIC 1B Instruct
118
+
119
+ <div align="center">
120
+ <img src="https://i.ibb.co/PF0TdMJ/imagine-image-9a56cee7-0f4f-4cc2-b265-a5b8d04f266b.png" alt="Llama-3.2-MedIT-SUN-2.5B" style="border-radius: 10px; box-shadow: 0 4px 8px 0 rgba(0, 0, 0, 0.2), 0 6px 20px 0 rgba(0, 0, 0, 0.19); max-width: 100%; height: auto;">
121
+ </div>
122
+
123
+ **Base Model**
124
+ - Llama 3.2 1B -> MedIT SUN 2.5B -> MedIT SUN 1B -> Knowledge Injection from Llama 3.1 8B Instruct
125
+
126
+ **Mesh Size**
127
+ - 1B to 2.5B parameters [MedIT SUN 2.5B](https://huggingface.co/meditsolutions/Llama-3.2-SUN-2.5B-chat) -> layers mesh using MedIT-mesh technique and downscaled to 1B
128
+
129
+ **Extension Method**
130
+ - Proprietary technique developed by MedIT Solutions
131
+
132
+ **Fine-tuning**
133
+ - Open (or open subsets allowing for commercial use) open datasets from HF
134
+ - Open (or open subsets allowing for commercial use) SFT datasets from HF
135
+
136
+ **Training Status**
137
+ - Current version: instruct-1.0.0
138
+
139
+ **Key Features**
140
+ - Built on Llama 3.2 architecture
141
+ - Upscaled from 1B to 2.47B parameters
142
+ - Optimized for open-ended conversations
143
+ - Incorporates supervised fine-tuning for improved performance
144
+ - Layers meshing using the MedIT-mesh technique
145
+ - Downscaled to 1B
146
+ - Knowledge injection from Llama 3.1 8B Instruct using new technique developed by MedIT Solutions
147
+ - HDIC
148
+
149
+ **Use Case**
150
+ - General conversation and task-oriented interactions
151
+
152
+ **Limitations**
153
+ As the model is still in training, performance and capabilities may vary. Users should be aware that the model is not in its final form and may exhibit inconsistencies or limitations typical of in-progress AI models.
154
+
155
+ **Disclaimer and Safety Considerations**
156
+ The Model is designed to be used as a smart assistant but not as a knowledge source within your applications, systems, or environments. It is not intended to provide 100% accurate answers, especially in scenarios where high precision and accuracy are
157
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
158
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_meditsolutions__Llama-3.2-SUN-1B-Instruct)
159
+
160
+ | Metric |Value|
161
+ |-------------------|----:|
162
+ |Avg. |15.11|
163
+ |IFEval (0-Shot) |64.13|
164
+ |BBH (3-Shot) | 9.18|
165
+ |MATH Lvl 5 (4-Shot)| 4.61|
166
+ |GPQA (0-shot) | 0.00|
167
+ |MuSR (0-shot) | 4.05|
168
+ |MMLU-PRO (5-shot) | 8.68|
169
+