Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
D-YZ 's Collections
waiting
Reasoning
Paper
Multimodal
RL
Model Architecture

Paper

updated Jun 5, 2024
Upvote
-

  • OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

    Paper • 2402.14658 • Published Feb 22, 2024 • 84

  • KAN: Kolmogorov-Arnold Networks

    Paper • 2404.19756 • Published Apr 30, 2024 • 114

  • Understanding the performance gap between online and offline alignment algorithms

    Paper • 2405.08448 • Published May 14, 2024 • 20

  • NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

    Paper • 2405.17428 • Published May 27, 2024 • 20

  • 2BP: 2-Stage Backpropagation

    Paper • 2405.18047 • Published May 28, 2024 • 27

  • VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections

    Paper • 2405.17991 • Published May 28, 2024 • 14

  • Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

    Paper • 2406.00888 • Published Jun 2, 2024 • 34

  • Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning

    Paper • 2406.00392 • Published Jun 1, 2024 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略