Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published Oct 24, 2024 • 18
Open-Vocabulary Argument Role Prediction for Event Extraction Paper • 2211.01577 • Published Nov 3, 2022
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses Paper • 2408.08978 • Published Aug 16, 2024
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang Paper • 2509.00425 • Published 8 days ago • 10
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation Paper • 2307.04018 • Published Jul 8, 2023
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 42
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective Paper • 2310.11451 • Published Oct 17, 2023
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published Sep 30, 2024 • 55
QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization Paper • 2104.05938 • Published Apr 13, 2021 • 1
Towards a Unified Multi-Dimensional Evaluator for Text Generation Paper • 2210.07197 • Published Oct 13, 2022
DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization Paper • 2109.02492 • Published Sep 6, 2021