Lokendra Bairwa's picture

8

Lokendra Bairwa

lokendra-drizz

·

AI & ML interests

None yet

Recent Activity

updated a collection 6 days ago

upvoted a paper 6 days ago

Essential-Web v1.0: 24T tokens of organized web data

upvoted a paper 6 days ago

Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

View all activity

Organizations

None yet

updated a collection 6 days ago

General

2 items • Updated 6 days ago

upvoted 5 papers 6 days ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 45

Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

Paper • 2508.15202 • Published 18 days ago • 4

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published 17 days ago • 132

CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning

Paper • 2508.15868 • Published 18 days ago • 3

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Paper • 2508.19229 • Published 12 days ago • 19

upvoted a paper about 1 month ago

Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning

Paper • 2507.22565 • Published Jul 30 • 9

updated a collection about 1 month ago

Prompt Optimization

1 item • Updated Jul 29

upvoted a paper about 1 month ago

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published Jul 25 • 26

upvoted a paper 2 months ago

Orthogonal Finetuning Made Scalable

Paper • 2506.19847 • Published Jun 24 • 7