7 9 8

Dmitrii Stoianov

heylimon

DimaStoyanov

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

System Message Generation for User Preferences using Open-Source Models

upvoted a paper 16 days ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

upvoted a paper 19 days ago

How to Synthesize Text Data without Model Collapse?

View all activity

Organizations

heylimon's activity

upvoted a paper 3 days ago

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published 7 days ago • 15

upvoted a paper 16 days ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published 18 days ago • 55

upvoted a paper 19 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 50

upvoted a collection 19 days ago

RL/Alignment

Collection

197 items • Updated Jun 18, 2024 • 25

upvoted a paper 19 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 20 days ago • 111

New activity in t-tech/T-lite-it-1.0 2 months ago

Как запустить квантованную модель?

#6 opened 2 months ago by

Without69

liked 2 datasets 2 months ago

google/trueteacher

Viewer • Updated Dec 26, 2023 • 1.38M • 103 • 20

lytang/LLM-AggreFact

Viewer • Updated Dec 20, 2024 • 59.7k • 1.24k • 21

upvoted a paper 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 107

commented a paper 2 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 107 •

New activity in t-tech/T-lite-it-1.0 2 months ago

Special Tokens

#4 opened 2 months ago by

strangelex42

New activity in t-tech/T-pro-it-1.0 2 months ago

Context size

#5 opened 2 months ago by

deksden

New activity in AnatoliiPotapov/T-lite-instruct-0.1 2 months ago

Fix ignored 'add_generation_prompt' in the chat template

#12 opened 6 months ago by

heylimon

liked a dataset 2 months ago

O1-OPEN/OpenO1-SFT

Viewer • Updated Dec 17, 2024 • 77.7k • 1.37k • 356

liked 2 datasets 4 months ago

nyuuzyou/EMERCOM-questions

Viewer • Updated Feb 23, 2024 • 25.7k • 57 • 1

nyuuzyou/9111-questions

Preview • Updated Feb 19, 2024 • 36 • 6

upvoted a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 89

commented a paper 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 89 •

New activity in AnatoliiPotapov/T-lite-0.1 7 months ago

Дальнейшее дообучение

#1 opened 7 months ago by

alamacra

liked a Space 9 months ago

772

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training