Fine-grained Hallucination Detection and Editing for Language Models Paper • 2401.06855 • Published Jan 12, 2024 • 4
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated 17 days ago • 116
LLM-Eval: Unified Multi-Dimensional Automatic Evaluation for Open-Domain Conversations with Large Language Models Paper • 2305.13711 • Published May 23, 2023 • 2
Estimating Model Performance Under Covariate Shift Without Labels Paper • 2401.08348 • Published Jan 16, 2024 • 1