In a Training Loop 🔄

75 124 262

Asankhaya Sharma

codelion

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

updated a model 5 days ago

codelion/dhara-70m

new activity 5 days ago

codelion/dhara-70m:1024 in max_position_embeddings

commented on their article 5 days ago

The Optimal Architecture for Small Language Models

View all activity

Organizations

upvoted an article 11 days ago

Article

The Optimal Architecture for Small Language Models

9 days ago

•

upvoted a paper 17 days ago

Universal Reasoning Model

Paper • 2512.14693 • Published 19 days ago • 40

upvoted an article about 1 month ago

Article

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

Dec 3, 2025

•

upvoted a paper about 1 month ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21, 2025 • 29

upvoted an article 2 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted an article 3 months ago

Article

Python Is All You Need? Introducing Dria-Agent-α

Jan 10, 2025

•

upvoted a collection 3 months ago

Dhara Foundational Models

Collection

Diffusion Language Models combining deep narrow networks, Canon layers (depthwise causal convolutions), and WSD (Warmup-Stable-Decay) training. • 1 item • Updated 9 days ago • 2

upvoted a paper 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

upvoted an article 3 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

upvoted a collection 4 months ago

Mem-Agent

Collection

Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5, 2025 • 4

upvoted a paper 4 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

upvoted an article 4 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11, 2025

•

upvoted a collection 5 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 12 days ago • 87

upvoted an article 5 months ago

Article

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Aug 9, 2025

•

upvoted 2 papers 5 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 129

upvoted 2 articles 5 months ago

Article

Towards Open Evolutionary Agents

Aug 4, 2025

•

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Aug 3, 2025

•

upvoted a collection 5 months ago

GLM-4.5

Collection

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11, 2025 • 252

upvoted a paper 5 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320

Asankhaya Sharma

AI & ML interests

Recent Activity

Organizations

codelion's activity

The Optimal Architecture for Small Language Models

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Python Is All You Need? Introducing Dria-Agent-α

mem-agent: Equipping LLM Agents with Memory Using RL

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Towards Open Evolutionary Agents

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

🎉 Free Image Generator Now Available!