Direct Language Model Alignment from Online AI Feedback Paper • 2402.04792 • Published Feb 7, 2024 • 34
Binary Classifier Optimization for Large Language Model Alignment Paper • 2404.04656 • Published Apr 6, 2024 • 2