126 79 1977

Nicky

NickyNicky

AI & ML interests

None yet

Recent Activity

liked a dataset 2 days ago

mlabonne/natural_reasoning-formatted

liked a model 2 days ago

google/siglip2-base-patch16-512

liked a Space 3 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

NickyNicky's activity

upvoted a paper 4 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 5 days ago • 42

upvoted an article 13 days ago

Article

Open R1: Update #2

and 6 others •

13 days ago

• 184

upvoted an article 17 days ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

upvoted a paper 17 days ago

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Paper • 2305.17679 • Published May 28, 2023 • 2

upvoted an article 17 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted an article 18 days ago

Article

Welcome to Inference Providers on the Hub 🔥

27 days ago

• 384

upvoted an article 21 days ago

Article

The AI tools for Art Newsletter - Issue 1

24 days ago

• 64

upvoted an article 26 days ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 39

upvoted 3 articles 27 days ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

•

30 days ago

• 12

Article

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

•

Jan 23

• 2

Article

We now support VLMs in smolagents!

about 1 month ago

• 84

upvoted a collection about 1 month ago

ProLIP

Collection

Official ProLIP weights • 4 items • Updated Dec 9, 2024 • 6

upvoted 6 articles about 1 month ago

Article

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

•

Dec 13, 2024

• 4

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

Jan 19

• 13

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 36

Article

Yay! Organizations can now publish blog Articles

and 3 others •

Jan 20

• 34

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

Jan 23

• 63

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 61

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 329

upvoted an article about 1 month ago

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Dec 23, 2024

• 39