Bone: Block Affine Transformation as Parameter Efficient Fine-tuning Methods for Large Language Models Paper • 2409.15371 • Published Sep 19, 2024 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 53