SAMBIT CHAKRABORTY
sambitchakhf03
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
upvoted
a
paper
14 days ago
Demystifying Long Chain-of-Thought Reasoning in LLMs
Organizations
Collections
5
-
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU
Paper • 2403.06504 • Published • 53 -
Stealing Part of a Production Language Model
Paper • 2403.06634 • Published • 91 -
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
Paper • 2406.14909 • Published • 15
models
2
datasets
None public yet