7 15 7

Fang Wu

fangwu97

https://smiles724.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

upvoted a paper 2 months ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

View all activity

Organizations

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108

liked a Space 2 months ago

The Smol Training Playbook

📚

2.77k

The secrets to building world-class LLMs

upvoted a paper 2 months ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

Paper • 2510.20976 • Published Oct 23, 2025 • 2

commented a paper 2 months ago

L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks

Paper • 2510.20976 • Published Oct 23, 2025 • 2 •

liked a model 2 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 3.47M • 3.03k

updated a model 2 months ago

fangwu97/DeepSearch-1.5B

Text Generation • 2B • Updated Oct 20, 2025 • 22 • 8

New activity in fangwu97/DeepSearch-1.5B 2 months ago

Could you share the training code?

#2 opened 3 months ago by

sunshin5

liked a dataset 2 months ago

ethan1115/protein_combined

Preview • Updated Oct 21, 2025 • 1 • 1

upvoted 2 papers 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 106

updated a dataset 3 months ago

fangwu97/ProteinData

Viewer • Updated Oct 8, 2025 • 219k • 13

published a dataset 3 months ago

fangwu97/ProteinData

Viewer • Updated Oct 8, 2025 • 219k • 13

New activity in fangwu97/DeepSearch-1.5B 3 months ago

Add pipeline tag and hyperlink paper in model card

#1 opened 3 months ago by

nielsr

upvoted a paper 3 months ago

Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning

Paper • 2509.11420 • Published Sep 14, 2025 • 2

liked a dataset 3 months ago

chao1224/ProteinDT

Updated Jul 7, 2024 • 164 • 3

authored 5 papers 3 months ago

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6, 2025 • 161

Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts

Paper • 2509.23188 • Published Sep 27, 2025 • 3

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62

Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning

Paper • 2509.11420 • Published Sep 14, 2025 • 2

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

Fang Wu

AI & ML interests

Recent Activity

Organizations

fangwu97's activity

The Smol Training Playbook

Could you share the training code?

Add pipeline tag and hyperlink paper in model card

🎉 Free Image Generator Now Available!