Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding Paper • 2502.05609 • Published Feb 8 • 18
Benchmarking LLMs for Political Science: A United Nations Perspective Paper • 2502.14122 • Published Feb 19 • 2
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published Jan 10 • 73