Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
D-YZ 's Collections
waiting
Reasoning
Paper
Multimodal
RL
Model Architecture

Reasoning

updated Jul 15, 2024
Upvote
-

  • Chain-of-Thought Reasoning Without Prompting

    Paper • 2402.10200 • Published Feb 15, 2024 • 110

  • Teaching Large Language Models to Reason with Reinforcement Learning

    Paper • 2403.04642 • Published Mar 7, 2024 • 51

  • PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Paper • 2403.10704 • Published Mar 15, 2024 • 60

  • MathScale: Scaling Instruction Tuning for Mathematical Reasoning

    Paper • 2403.02884 • Published Mar 5, 2024 • 17

  • Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

    Paper • 2404.02575 • Published Apr 3, 2024 • 51

  • Advancing LLM Reasoning Generalists with Preference Trees

    Paper • 2404.02078 • Published Apr 2, 2024 • 47

  • Iterative Reasoning Preference Optimization

    Paper • 2404.19733 • Published Apr 30, 2024 • 50

  • ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

    Paper • 2405.09220 • Published May 15, 2024 • 29

  • LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

    Paper • 2405.18377 • Published May 28, 2024 • 21

  • Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

    Paper • 2407.08642 • Published Jul 11, 2024 • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略