Shikhar Singh

AxAI

axe--

AI & ML interests

Commonsense & Language Grounding

Recent Activity

liked a model 1 day ago

AIDC-AI/Ovis2-34B

liked a model 1 day ago

czczup/textnet-base

upvoted a paper 1 day ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

View all activity

Organizations

None yet

AxAI's activity

upvoted 4 papers 1 day ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 3 days ago • 65

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 3 days ago • 148

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 3 days ago • 87

upvoted a paper 3 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 4 days ago • 136

upvoted an article 4 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

5 days ago

• 53

upvoted 2 articles 10 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted an article 12 days ago

Article

Open R1: Update #2

and 6 others •

13 days ago

• 184

upvoted 2 papers about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 330

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 257

upvoted an article about 2 months ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

•

Aug 26, 2024

• 43

upvoted 8 papers 2 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

YOLO-World: Real-Time Open-Vocabulary Object Detection

Paper • 2401.17270 • Published Jan 30, 2024 • 36

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 60