REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback
Paper
•
2505.06548
•
Published
•
30
Totally Free + Zero Barriers + No Login Required