SuryaBench Collection Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction • 8 items • Updated 2 days ago • 3
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 13 days ago • 50
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 3 days ago • 32
NVIDIA Nemotron Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 3 items • Updated 2 days ago • 39
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 68
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 5 days ago • 195
Technical Report: Full-Stack Fine-Tuning for the Q Programming Language Paper • 2508.06813 • Published 12 days ago • 5
qqWen-Series Collection Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 6 items • Updated 14 days ago • 8
Aryabhata: An exam-focused language model for JEE Math Paper • 2508.08665 • Published 9 days ago • 16
👁️ LFM2-VL Collection LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 6 items • Updated 1 day ago • 31
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 16 days ago • 467
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 125
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 9 days ago • 60
LLMDet Collection See: https://github.com/huggingface/transformers/pull/37925 • 3 items • Updated Jun 26 • 3
MM Grounding DINO Collection See: https://github.com/huggingface/transformers/pull/37925 • 8 items • Updated Jun 26 • 4
view article Article Vision Language Model Alignment in TRL ⚡️ By sergiopaniego and 4 others • 14 days ago • 69
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 14 days ago • 311