Benchmarking Optimizers for Large Language Model Pretraining Paper • 2509.01440 • Published 5 days ago • 21
Running 1.06k 1.06k FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 26 days ago • 227
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published 16 days ago • 44
GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Paper • 2508.02831 • Published Aug 4 • 11
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published Jul 24 • 39