TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning Paper • 2508.20374 • Published 10 days ago • 21
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published 17 days ago • 44
WPO: Enhancing RLHF with Weighted Preference Optimization Paper • 2406.11827 • Published Jun 17, 2024 • 15
InFoBench: Evaluating Instruction Following Ability in Large Language Models Paper • 2401.03601 • Published Jan 7, 2024 • 7
OASum: Large-Scale Open Domain Aspect-based Summarization Paper • 2212.09233 • Published Dec 19, 2022 • 2
Scoring Sentence Singletons and Pairs for Abstractive Summarization Paper • 1906.00077 • Published May 31, 2019 • 2
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 Paper • 2305.14702 • Published May 24, 2023 • 1
Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention Paper • 2312.08618 • Published Dec 14, 2023 • 15
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models Paper • 2308.00304 • Published Aug 1, 2023 • 23