32 71 80

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Small Models Struggle to Learn from Strong Reasoners

upvoted a paper 4 days ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

upvoted a paper 10 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

View all activity

Organizations

smajumdar94's activity

upvoted 2 papers 4 days ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 6 days ago • 25

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 67

upvoted a paper 10 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 10 days ago • 31

upvoted an article 16 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted a paper 20 days ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 43

upvoted 2 papers about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 257

upvoted a paper 2 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 50

upvoted 3 papers 3 months ago

upvoted a collection 4 months ago

steiner-preview

Collection

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 28

upvoted an article 4 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 50

upvoted a paper 5 months ago

CursorCore: Assist Programming through Aligning Anything

Paper • 2410.07002 • Published Oct 9, 2024 • 13

upvoted 2 articles 5 months ago

Article

Welcome, Gradio 5

Oct 9, 2024

• 127

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted a collection 5 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 525

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 223

upvoted a paper 6 months ago

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 34