Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2508.18255

Hermes 4 Collection

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34
NousResearch/Hermes-4-14B

Text Generation • 0.0B • Updated 3 days ago • 2.24k • 67
NousResearch/Hermes-4-14B-FP8

Text Generation • 15B • Updated 3 days ago • 1.13k • 5
NousResearch/Hermes-4-405B-FP8

Text Generation • 406B • Updated 4 days ago • 2.23k • 16

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 31
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 5

Collection of LLMs

microsoft/phi-1_5

Text Generation • 1B • Updated Apr 29, 2024 • 73k • 1.34k
mistralai/Mistral-7B-v0.1

Text Generation • 7B • Updated Jul 24 • 366k • 3.95k
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 509 • 4.39k
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 175k • 1.76k

Snowflake/Arctic-Text2SQL-R1-7B

8B • Updated May 29 • 5.87k • 42
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 271
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 260
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 126

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34

Large Language Model (LLM) and NLP related papers.

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19, 2024 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20, 2024 • 13
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70

Hermes 4 Collection

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34
NousResearch/Hermes-4-14B

Text Generation • 0.0B • Updated 3 days ago • 2.24k • 67
NousResearch/Hermes-4-14B-FP8

Text Generation • 15B • Updated 3 days ago • 1.13k • 5
NousResearch/Hermes-4-405B-FP8

Text Generation • 406B • Updated 4 days ago • 2.23k • 16

Snowflake/Arctic-Text2SQL-R1-7B

8B • Updated May 29 • 5.87k • 42
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 271
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 260
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 126

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 31
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 5

Large Language Model (LLM) and NLP related papers.

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19, 2024 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20, 2024 • 13
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70

Collection of LLMs

microsoft/phi-1_5

Text Generation • 1B • Updated Apr 29, 2024 • 73k • 1.34k
mistralai/Mistral-7B-v0.1

Text Generation • 7B • Updated Jul 24 • 366k • 3.95k
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 509 • 4.39k
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 175k • 1.76k

Company

TOS Privacy About Jobs

Website

Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略