Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Paper • 2508.02558 • Published Aug 4 • 10
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published Jun 17 • 45
Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache Paper • 2506.11886 • Published Jun 13 • 21
LongWanjuan: Towards Systematic Measurement for Long Text Quality Paper • 2402.13583 • Published Feb 21, 2024 • 1
DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels Paper • 2409.02465 • Published Sep 4, 2024 • 1
VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper • 2502.05173 • Published Feb 7 • 66
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way Paper • 2312.00407 • Published Dec 1, 2023 • 3
Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope Paper • 2407.15176 • Published Jul 21, 2024 • 3