4 10 4

Ruida WANG

RickyDeSkywalker

[email protected]

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

upvoted a paper 13 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

upvoted a paper about 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

View all activity

Organizations

authored a paper 13 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published 14 days ago • 8

upvoted a paper 13 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published 14 days ago • 8

upvoted a paper about 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 120

upvoted 2 papers 4 months ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 26

commented a paper 4 months ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 26 •

upvoted a paper 5 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23

updated 2 models 6 months ago

RickyDeSkywalker/TheoremLlama

Text Generation • 8B • Updated Aug 4, 2025 • 7

RickyDeSkywalker/LoT-Solver

7B • Updated Aug 4, 2025 • 2

upvoted 3 papers 6 months ago

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

Paper • 2506.07001 • Published Jun 8, 2025 • 4

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Paper • 2503.03205 • Published Mar 5, 2025 • 4

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published Jul 27, 2025 • 26

updated a model 8 months ago

RickyDeSkywalker/LoT-Solver-Godel

7B • Updated May 27, 2025 • 2 • 1

updated a dataset 8 months ago

RickyDeSkywalker/LoT-CorrectionData

Preview • Updated May 27, 2025 • 9

published a dataset 8 months ago

RickyDeSkywalker/LoT-CorrectionData

Preview • Updated May 27, 2025 • 9

liked a model 11 months ago

RickyDeSkywalker/LoT-Solver-Godel

7B • Updated May 27, 2025 • 2 • 1

published a model 11 months ago

RickyDeSkywalker/LoT-Solver-Godel

7B • Updated May 27, 2025 • 2 • 1

liked a model 11 months ago

kfdong/STP_model_Lean

Text Generation • 7B • Updated Mar 24, 2025 • 4 • 3

published a model 11 months ago

RickyDeSkywalker/LoT-Solver

7B • Updated Aug 4, 2025 • 2

liked a model 12 months ago

RickyDeSkywalker/TheoremLlama

Text Generation • 8B • Updated Aug 4, 2025 • 7

Ruida WANG

AI & ML interests

Recent Activity

Organizations

RickyDeSkywalker's activity

🎉 Free Image Generator Now Available!