view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 3 days ago • 32
view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 9 days ago • 11
view article Article LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025 By kikikita and 1 other • 24 days ago • 17
view article Article How to Train Your LLM Web Agent: A Statistical Diagnosis By ppEmiliano • Jul 8 • 14
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance By tngtech • Apr 16 • 34
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance By tngtech • Jun 12 • 5
view article Article What's Software 3.0? (Spoiler: You're Already Using It) By fdaudens • Jun 19 • 2