PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System Paper • 2410.02828 • Published Oct 1, 2024 • 1
Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle Paper • 2407.13833 • Published Jul 18, 2024 • 12
Fairlearn: Assessing and Improving Fairness of AI Systems Paper • 2303.16626 • Published Mar 29, 2023
A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications Paper • 2310.17750 • Published Oct 26, 2023 • 9