neucodec Collection We introduce NeuCodec, a 0.8kbps audio codec that outputs audio at 24kHz. • 4 items • Updated 8 days ago • 1
NVIDIA Nemotron Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 3 items • Updated 3 days ago • 39
LLMDet Collection See: https://github.com/huggingface/transformers/pull/37925 • 3 items • Updated Jun 26 • 3
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware By RakshitAralimatti • 13 days ago • 14
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24 • 32
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other • May 2, 2022 • 8
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 29
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 14 days ago • 311
SmolDocling datasets Collection Datasets used to train SmolDocling • 6 items • Updated 21 days ago • 28
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 23 days ago • 156
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes Paper • 2507.11407 • Published Jul 15 • 54
KMMLU Redux & Pro Collection A Professional Korean Benchmark Suite for LLM Evaluation • 2 items • Updated Jul 15 • 6
OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 7 days ago • 41