Post Training Aligning Instruction Tuning with Pre-training Paper • 2501.09368 • Published Jan 16, 2025 Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21, 2024 Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21, 2024
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
LLM and Reasoning Papers Papers dump of LLM Reasoning domain Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46 Large Language Models are Zero-Shot Reasoners Paper • 2205.11916 • Published May 24, 2022 • 3 Let's Verify Step by Step Paper • 2305.20050 • Published May 31, 2023 • 11 Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Paper • 2201.11903 • Published Jan 28, 2022 • 15
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Paper • 2201.11903 • Published Jan 28, 2022 • 15
Post Training Aligning Instruction Tuning with Pre-training Paper • 2501.09368 • Published Jan 16, 2025 Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21, 2024 Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey Paper • 2403.14608 • Published Mar 21, 2024
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
LLM and Reasoning Papers Papers dump of LLM Reasoning domain Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46 Large Language Models are Zero-Shot Reasoners Paper • 2205.11916 • Published May 24, 2022 • 3 Let's Verify Step by Step Paper • 2305.20050 • Published May 31, 2023 • 11 Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Paper • 2201.11903 • Published Jan 28, 2022 • 15
Internal Consistency and Self-Feedback in Large Language Models: A Survey Paper • 2407.14507 • Published Jul 19, 2024 • 46
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Paper • 2201.11903 • Published Jan 28, 2022 • 15