Shubham Toshniwal

stoshniwal

AI & ML interests

NLP, LLM

Recent Activity

Organizations

NVIDIA's profile picture

stoshniwal's activity

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B about 1 month ago

Tokenizer config is wrong

8
#10 opened about 1 month ago by
stoshniwal
upvoted an article 4 months ago
view article
Article

Fixing Gradient Accumulation

50
New activity in nvidia/OpenMathInstruct-2 4 months ago

Upload scaling_plot.jpg

#4 opened 4 months ago by
shtoshni