Generating Skyline Datasets for Data Science Models Paper • 2502.11262 • Published 7 days ago • 5 • 2
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective Paper • 2502.14296 • Published 3 days ago • 41 • 2
Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above Paper • 2502.14127 • Published 4 days ago • 2 • 2
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Paper • 2502.14866 • Published 3 days ago • 7 • 2
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper • 2502.14678 • Published 3 days ago • 11 • 2
Unstructured Evidence Attribution for Long Context Query Focused Summarization Paper • 2502.14409 • Published 3 days ago • 3 • 2
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 3 days ago • 145 • 2
Generating $π$-Functional Molecules Using STGG+ with Active Learning Paper • 2502.14842 • Published 3 days ago • 3 • 2
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Paper • 2502.14802 • Published 3 days ago • 9 • 2
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 3 days ago • 62 • 8
Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information Paper • 2502.14258 • Published 3 days ago • 19 • 2
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data Paper • 2502.14044 • Published 4 days ago • 6 • 3
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC Paper • 2502.14282 • Published 3 days ago • 14 • 3
RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers Paper • 2502.14377 • Published 3 days ago • 10 • 2
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Paper • 2502.14846 • Published 3 days ago • 13 • 2
Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework Paper • 2502.13759 • Published 4 days ago • 2 • 2