Arunkumar Venkataramanan's picture

76 170

Arunkumar Venkataramanan

ArunkumarVR

·

https://arunkumarramanan.github.io

AI & ML interests

AGI Research: Reasoning, Safety & Alignment (Superalignment), Generative AI (GenAI), Multi-Modal Foundation Models (FMs), Large Language Models (LLMs), Transformers & Diffusion Models, Open LLM Training, Optimization & Finetuning, Serving & Inference

Recent Activity

liked a model 3 days ago

perplexity-ai/r1-1776

updated a model 3 days ago

DeepBrainz/DeepBrainz-0.5B-v0.01

liked a model 3 days ago

Qwen/Qwen2.5-0.5B-Instruct-GGUF

View all activity

Organizations

ArunkumarVR's activity

upvoted a paper 8 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 16 days ago • 114

upvoted 3 collections 12 days ago

RLHFlow MATH Process Reward Model

This is a collection of datasets and models of process reward modeling. • 15 items • Updated Nov 9, 2024 • 10

Skywork-o1-Open

Skywork o1 open model collections • 3 items • Updated Nov 27, 2024 • 21

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 75

upvoted an article 12 days ago

Article

Open R1: Update #2

By

and 6 others •

13 days ago

• 184

upvoted 2 collections 21 days ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 117

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 21 days ago • 55

upvoted an article 21 days ago

Article

Open-R1: Update #1

By

and 7 others •

22 days ago

• 286

upvoted an article 22 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

25 days ago

• 46

upvoted a paper 23 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 24 days ago • 27

upvoted 2 collections 23 days ago

IndicBERT v2

IndicBERT v2 is a multilingual BERT model pretrained on IndicCorp v2, an Indic monolingual corpus of 20.9 billion tokens, covering 24 consitutionally • 4 items • Updated Oct 15, 2024 • 3

IndicLLMSuite

Largest Collections of Pretraining and Instruction Finetuning datasets for 22 Indic languages. • 4 items • Updated Nov 5, 2024 • 15

upvoted a collection 26 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 12 items • Updated 4 days ago • 79

upvoted an article 26 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

upvoted 6 collections 27 days ago

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 40

DeepSeek-Math

DeepSeek Math series • 4 items • Updated Aug 16, 2024 • 20

DeepSeekCoder-V2

6 items • Updated Sep 5, 2024 • 93

DeepSeek-V2

8 items • Updated Jan 3 • 27

DeepSeek-Prover

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16, 2024 • 26

DeepSeek-VL2

5 items • Updated 14 days ago • 69