32 81 118

Hyunwoo Ko

Cartinoe5930

https://cartinoe5930.tistory.com/

AI & ML interests

NLP(LLM)

Recent Activity

updated a dataset about 15 hours ago

Cartinoe5930/sample_verifiable_math

published a dataset about 15 hours ago

Cartinoe5930/sample_verifiable_math

liked a dataset about 16 hours ago

OLAIR/Open-R1-Ko-SFT-v2.0

View all activity

Organizations

Cartinoe5930's activity

upvoted a paper 3 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 3 days ago • 32

upvoted an article 13 days ago

Article

Open R1: Update #2

and 6 others •

13 days ago

• 184

upvoted 2 papers 20 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 23 days ago • 105

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 20 days ago • 54

upvoted a collection about 1 month ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 528

upvoted 2 papers about 1 month ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 91

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 257

upvoted an article 3 months ago

Article

Releasing QwQ-LongCoT-130K

•

Dec 5, 2024

• 9

upvoted 2 articles 4 months ago

Article

Navigating Korean LLM Research #2: Evaluation Tools

•

Oct 23, 2024

• 7

Article

Navigating Korean LLM Research #1: Models

•

Oct 22, 2024

• 24

upvoted a paper 5 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 137

upvoted an article 6 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 77

upvoted a paper 6 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 118

upvoted 2 papers 7 months ago

EXAONE 3.0 7.8B Instruction Tuned Language Model

Paper • 2408.03541 • Published Aug 7, 2024 • 35

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23, 2024 • 22

upvoted an article 7 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 227

upvoted a paper 7 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 50

upvoted an article 7 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 81

upvoted an article 8 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 77

upvoted a collection 8 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 102 items • Updated 7 days ago • 97