Running 1.37k 1.37k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
BounharAbdelaziz/Morocco-Darija-Sentence-Embedding-v0.1 Feature Extraction • Updated 3 days ago • 203 • 2
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 187
atlasia/bert-base-multilingual-uncased-bs-2048-lr-0.2-ep-20-wp-0.05-gacc-1-gnm-1.0-v0.1 Sentence Similarity • Updated 6 days ago
Enhancing Semantic Similarity Understanding in Arabic NLP with Nested Embedding Learning Paper • 2407.21139 • Published Jul 30, 2024 • 4
view article Article Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other • 14 days ago • 11
view article Article Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 13 days ago • 10
view article Article Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 13 days ago • 10
view post Post 2257 IBM released ibm-granite/granite-vision-3.1-2b-preview, a small vision LM with impressive performance on different tasks 😮🔥 it comes with transformers and vLLM support from the get-go 💗 you can run it in Colab T4, so I built a notebook to put it to test, find it here: https://github.com/merveenoyan/smol-vision/blob/main/inference_gists/IBM_Granite_Vision.ipynb See translation 🚀 5 5 + Reply
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 19 days ago • 190