-
NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF
Updated • 18.3k • 62 -
OpenRLHF/Llama-3-8b-rlhf-100k
Text Generation • Updated • 448 • 3 -
robust-rlhf/Meta-Llama-3.1-8B-Instruct-bnb-4bit_ftjob-80cbcd764546
Updated -
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Llama3-8B-v1.1
Text Generation • Updated • 350 • 9
ratthachat chatpatanasiri
Jung
AI & ML interests
Knowledge-intensive model, Common-sense learning, Life-long machine learning,
Recent Activity
new activity
8 days ago
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Llama3-8B-v1.1:Notebook for redistil
updated
a collection
8 days ago
Reasoning LLMs
updated
a collection
9 days ago
Reasoning LLMs
Organizations
Collections
2
-
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 105 -
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking
Paper • 2410.12375 • Published • 3 -
NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF
Updated • 18.3k • 62