Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes Paper • 2507.12261 • Published Jul 16 • 1
Table Understanding and (Multimodal) LLMs: A Cross-Domain Case Study on Scientific vs. Non-Scientific Data Paper • 2507.00152 • Published Jun 30 • 1
Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework Paper • 2506.15538 • Published Jun 18 • 1
ELI-Why Collection 🧠 ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations ACL Findings 2025 • 4 items • Updated Jun 11 • 3
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability Paper • 2505.13963 • Published May 20 • 1
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals Paper • 2505.13972 • Published May 20 • 1
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods Paper • 2505.01198 • Published May 2 • 2
Do Large Language Models Latently Perform Multi-Hop Reasoning? Paper • 2402.16837 • Published Feb 26, 2024 • 30
Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published Jan 14 • 11
view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: By shanchen • Nov 8, 2024 • 13
A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 10
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 126 items • Updated 7 days ago • 111