NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11 • 79
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Paper • 2506.03295 • Published Jun 3 • 17
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper • 2505.10610 • Published May 15 • 54
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper • 2505.10610 • Published May 15 • 54
view article Article 🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI? By Kseniase • Dec 25, 2024 • 16
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4 • 10
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4 • 10
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7
Pre-Trianing Data Packing Collection [ACL'24] Analysing the Impact of Sequence Composition on Language Model Pre-Training. https://github.com/yuzhaouoe/pretraining-data-packing • 10 items • Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering • 5 items • Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering • 5 items • Updated Mar 3