Running on CPU Upgrade 4.87k 4.87k MTEB Leaderboard 🥇 Select benchmarks and languages for text embeddings evaluation
Running 1.4k 1.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 72
hanspeterlyngsoeraaschoujensen/week41_train_en_input_output Viewer • Updated Sep 24, 2024 • 6.41k • 59
hanspeterlyngsoeraaschoujensen/deberta-v3-base-finetuned-nlp-course Question Answering • Updated Sep 23, 2024 • 100
hanspeterlyngsoeraaschoujensen/distilbert-base-uncased-finetuned-nlp-course Question Answering • Updated Sep 23, 2024 • 107
hanspeterlyngsoeraaschoujensen/mt5-base-finetuned-nlp-course Question Answering • Updated Sep 21, 2024 • 37
Llama 3.1 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.1 models, including the configurations, • 6 items • Updated Dec 6, 2024 • 15
Running 774 774 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training