alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a dataset 20 days ago

SubliminalMisalignment/abliterated-distill-30k

published a dataset 20 days ago

SubliminalMisalignment/abliterated-distill-30k

updated a dataset 20 days ago

SubliminalMisalignment/safe-distill-30k

View all activity

Organizations

upvoted a collection 3 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated 18 days ago • 242

upvoted an article 4 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

35

upvoted a collection 4 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30, 2025 • 78

upvoted a changelog 4 months ago

Changelog

Emoji Autocomplete in Discussions and Posts

Sep 11, 2025

• 67

upvoted 2 papers 4 months ago

Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM

Paper • 2503.17793 • Published Mar 22, 2025 • 23

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43

upvoted a collection 4 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 320

upvoted an article 5 months ago

Article

Curation is All You Need

Aug 1, 2025

•

2

upvoted 2 collections 5 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 18 days ago • 101

👁️ LFM2-VL

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated 6 days ago • 63

upvoted 3 articles 5 months ago

Article

Fine Tuning Gemma 3 For Human Alignment

May 17, 2025

•

4

Article

AHA Leaderboard

Mar 30, 2025

•

4

Article

Introducing : 🤏🏻🏭SmolFactory

Aug 10, 2025

•

8

upvoted a paper 5 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 51

upvoted an article 5 months ago

Article

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

Aug 5, 2025

•

8

upvoted a collection 5 months ago

cool datasets

205 items • Updated 6 days ago • 19

upvoted 2 articles 5 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

Jul 29, 2025

•

206

Article

AutoBench Run 2 Results are Out! Surprise: Gemini 2.5 Pro is not the Best Affordable Thinking Model

Apr 29, 2025

•

6

upvoted 2 collections 6 months ago

JSON Mode Reasoning

A collection of structured outputs reasoning dataset • 3 items • Updated Jul 23, 2025 • 3

Tool Use Reasoning

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23, 2025 • 9