DPO Collection Various useful datasets with preference optimization • 16 items • Updated about 1 month ago • 4
MetaSC: Test-Time Safety Specification Optimization for Language Models Paper • 2502.07985 • Published 12 days ago • 3 • 2
MetaSC: Test-Time Safety Specification Optimization for Language Models Paper • 2502.07985 • Published 12 days ago • 3
Toxic Commons Collection Tools for de-toxifying public domain data, especially multilingual and historical text data and data with OCR errors. • 3 items • Updated Oct 31, 2024 • 6