view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • 7 days ago • 40
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning Paper • 2508.18966 • Published 13 days ago • 56
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published Dec 11, 2024 • 49
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 22 days ago • 54
qqWen-Series Collection Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 11 items • Updated 11 days ago • 10
view article Article Topic 23: What is LLM Inference, it's challenges and solutions for it By Kseniase • Jan 17 • 15
view article Article 🪆 Introduction to Matryoshka Embedding Models By tomaarsen and 2 others • Feb 23, 2024 • 161