3 31 12

Tianyu Pang

P2333

https://p2333.github.io/

P2333

AI & ML interests

Machine Learning

Recent Activity

upvoted a paper 4 days ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

upvoted a paper 4 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a collection 16 days ago

Perception Encoder

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published 5 days ago • 76

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 6 days ago • 60

upvoted a collection 16 days ago

Perception Encoder

Collection

17 items • Updated Jul 11 • 67

upvoted a collection 17 days ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 16 days ago • 276

liked a dataset 26 days ago

TIGER-Lab/AceCode-V2-122K

Viewer • Updated 24 days ago • 123k • 161 • 4

liked a model 2 months ago

nvidia/AceReason-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 11 • 7.42k • • 56

upvoted a paper 3 months ago

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28 • 29

commented a paper 3 months ago

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28 • 29 •

upvoted 2 papers 3 months ago

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27 • 26

Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment

Paper • 2505.21494 • Published May 27 • 8

authored a paper 3 months ago

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 24

upvoted a paper 3 months ago

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 24

commented a paper 3 months ago

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 24 •

authored 2 papers 3 months ago

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Paper • 2505.15141 • Published May 21 • 4

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 42

upvoted 2 papers 4 months ago

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Paper • 2505.15141 • Published May 21 • 4

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 42

authored a paper 4 months ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

upvoted a paper 4 months ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

authored a paper 5 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21 • 47

Tianyu Pang

AI & ML interests

Recent Activity

Organizations

P2333's activity