Salman Rahman's picture

7

Salman Rahman

salmannyu

·

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

upvoted a paper 12 days ago

CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper 4 months ago

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

View all activity

Organizations

salmannyu's activity

upvoted a paper 12 days ago

CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging

Paper • 2502.05664 • Published 15 days ago • 22

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

upvoted a paper 4 months ago

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

Paper • 2411.00412 • Published Nov 1, 2024 • 10

upvoted 3 papers 10 months ago

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

Paper • 2404.12753 • Published Apr 19, 2024 • 43

Scaling Instructable Agents Across Many Simulated Worlds

Paper • 2404.10179 • Published Mar 13, 2024 • 28

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

authored 2 papers 11 months ago

Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model

Paper • 2402.10965 • Published Feb 14, 2024

Understanding Disparities in Post Hoc Machine Learning Explanation

Paper • 2401.14539 • Published Jan 25, 2024

upvoted a paper 11 months ago

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 52