anujga
's Collections
Viewer
•
Updated
•
18.3M
•
253
•
42
Norquinal/claude_multiround_chat_30k
Viewer
•
Updated
•
32.2k
•
61
•
68
rombodawg/Everything_Instruct
Viewer
•
Updated
•
4.05M
•
129
•
54
Viewer
•
Updated
•
1.84M
•
1.02k
•
150
prometheus-eval/Feedback-Collection
Viewer
•
Updated
•
100k
•
389
•
115
Viewer
•
Updated
•
178k
•
30
•
9
Viewer
•
Updated
•
2.94M
•
10.7k
•
1.45k
sentence-transformers/gooaq
Viewer
•
Updated
•
3.01M
•
485
•
28
lightblue/rag_datasets_selected_14Bscored
Viewer
•
Updated
•
2.66M
•
60
lightblue/rag_datasets_selected_32B4scored_probs
Viewer
•
Updated
•
2.55M
•
23
Viewer
•
Updated
•
183k
•
650
•
291
Viewer
•
Updated
•
2.2M
•
5.56k
•
359
Viewer
•
Updated
•
182k
•
344
•
115
Viewer
•
Updated
•
375k
•
17.9k
•
629
suriyagunasekar/stackoverflow-with-meta-data
Viewer
•
Updated
•
19.9M
•
212
•
12
suriyagunasekar/stackoverflow-python-with-meta-data
Viewer
•
Updated
•
1.75M
•
254
•
12
Viewer
•
Updated
•
21.5M
•
177
•
19
Viewer
•
Updated
•
41.8M
•
68.9k
•
26
Viewer
•
Updated
•
1.33M
•
1.34k
•
51
rubenroy/GammaCorpus-v2-1m
Viewer
•
Updated
•
1M
•
8
•
9
rubenroy/GammaCorpus-Fact-QA-450k
Viewer
•
Updated
•
450k
•
25
•
11
CodeKapital/CookingRecipes
Viewer
•
Updated
•
2.23M
•
72
•
7
Infi-MM/InfiMM-WebMath-40B
Viewer
•
Updated
•
22.8M
•
2.6k
•
68
HuggingFaceTB/smol-smoltalk
Viewer
•
Updated
•
485k
•
1.09k
•
60
Viewer
•
Updated
•
1.66M
•
2.43k
•
60
allenai/real-toxicity-prompts
Viewer
•
Updated
•
99.4k
•
4.06k
•
93
Viewer
•
Updated
•
135k
•
1.69k
•
271
Viewer
•
Updated
•
75.6k
•
2.59k
•
63
Viewer
•
Updated
•
786k
•
14.3k
•
53
HuggingFaceM4/the_cauldron
Viewer
•
Updated
•
1.88M
•
125k
•
492
Viewer
•
Updated
•
838k
•
3.63k
•
375
chargoddard/WebInstructSub-prometheus
Viewer
•
Updated
•
2.39M
•
478
•
24
OpenLeecher/lmsys_chat_1m_clean
Viewer
•
Updated
•
273k
•
256
•
77
Viewer
•
Updated
•
130k
•
6.76k
•
21
Viewer
•
Updated
•
2.96M
•
67
•
4
Viewer
•
Updated
•
814k
•
770
•
285
Viewer
•
Updated
•
9.89k
•
3.39k
•
78
Wanfq/Explore_Instruct_Rewriting_32k
Viewer
•
Updated
•
32k
•
29
•
6
Locutusque/UltraTextbooks-2.0
Viewer
•
Updated
•
3.22M
•
104
•
50
CausalLM/Retrieval-SFT-Chat
Viewer
•
Updated
•
100k
•
90
•
53
CausalLM/Refined-Anime-Text
Viewer
•
Updated
•
1.02M
•
291
•
264
theothernet/ttr-prompting
Viewer
•
Updated
•
2.61M
•
17
Viewer
•
Updated
•
6.33M
•
6
shijli/amazon-reviews-multi
Viewer
•
Updated
•
2.52M
•
29
Viewer
•
Updated
•
1.16M
•
226
agentlans/combined-roleplay
Viewer
•
Updated
•
1.42M
•
116
•
4
PrimeIntellect/SYNTHETIC-1
Viewer
•
Updated
•
1.99M
•
777
•
58
Viewer
•
Updated
•
1.75M
•
307
•
101
arcee-ai/LLama-405B-Logits
Viewer
•
Updated
•
10k
•
811
•
11
togethercomputer/Long-Data-Collections
Viewer
•
Updated
•
4.12M
•
3.95k
•
150
Viewer
•
Updated
•
9.59M
•
1.3k
•
17
microsoft/orca-agentinstruct-1M-v1
Viewer
•
Updated
•
1.05M
•
2.11k
•
450
Viewer
•
Updated
•
4M
•
10.2k
•
46
SocialGrep/the-reddit-irl-dataset
Viewer
•
Updated
•
15.4M
•
145
•
1
Locutusque/deeplm-training-data
Viewer
•
Updated
•
2.17M
•
72
•
3
Viewer
•
Updated
•
936k
•
83.5k
•
291
bethgelab/CuratedThoughts
Viewer
•
Updated
•
222k
•
210
•
44
Viewer
•
Updated
•
700k
•
14.5k
•
131
alespalla/chatbot_instruction_prompts
Viewer
•
Updated
•
323k
•
1.15k
•
58
Viewer
•
Updated
•
437k
•
23k
•
397
EricLu/System-Prompt-Instruction-Real-world-Implementation-Training-set
Viewer
•
Updated
•
21.6k
•
114
•
10
data-is-better-together/10k_prompts_ranked
Viewer
•
Updated
•
10.3k
•
577
•
163
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
•
Updated
•
3.91M
•
4.63k
•
566
Viewer
•
Updated
•
99k
•
2.75k
•
76
Viewer
•
Updated
•
1.06M
•
55
•
14
Viewer
•
Updated
•
19.6M
•
14
Viewer
•
Updated
•
100k
•
136k
•
337