5 11 9

Xiaotian Han

xiaotianhan

https://hanxiaotian.github.io/

hanxiaotian

AI & ML interests

Multimodal LLM

Recent Activity

authored a paper 10 days ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

upvoted a paper 10 days ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

new activity 26 days ago

Infi-MM/InfiMM-WebMath-40B:Update metadata: Add `library_name`

View all activity

Organizations

authored a paper 10 days ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

Paper • 2508.05731 • Published 13 days ago • 25

upvoted a paper 10 days ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

Paper • 2508.05731 • Published 13 days ago • 25

New activity in Infi-MM/InfiMM-WebMath-40B 26 days ago

Update metadata: Add `library_name`

#5 opened 26 days ago by

nielsr

New activity in Infi-MM/InfiMM-WebMath-40B about 1 month ago

Update task category, add tags, and include survey paper link for InfiMM-WebMath-40B

#4 opened about 1 month ago by

nielsr

authored a paper 4 months ago

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Paper • 2504.14239 • Published Apr 19 • 14

upvoted a paper 4 months ago

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Paper • 2504.14239 • Published Apr 19 • 14

authored 2 papers 6 months ago

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Paper • 2502.11573 • Published Feb 17 • 8

Thinking Preference Optimization

Paper • 2502.13173 • Published Feb 17 • 17

authored 2 papers 7 months ago

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model

Paper • 2405.17815 • Published May 28, 2024

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Paper • 2501.04575 • Published Jan 8 • 24

upvoted a paper 7 months ago

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Paper • 2501.04575 • Published Jan 8 • 24

upvoted a paper 9 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 65

authored a paper 10 months ago

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Paper • 2410.18666 • Published Oct 24, 2024 • 19

liked a model 10 months ago

shallowdream204/DreamClear

Updated Oct 26, 2024 • 7 • 22

upvoted a paper 10 months ago

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Paper • 2410.18666 • Published Oct 24, 2024 • 19

commented a paper 10 months ago

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Paper • 2410.18666 • Published Oct 24, 2024 • 19 •

updated a dataset 11 months ago

Infi-MM/InfiMM-WebMath-40B

Viewer • Updated 26 days ago • 22.8M • 1.34k • 68

liked a Space 11 months ago

Dailypapershackernews

📈

authored a paper 11 months ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 51

New activity in Infi-MM/InfiMM-WebMath-40B 11 months ago

[bot] Conversion to Parquet

#1 opened 11 months ago by

parquet-converter

Xiaotian Han

AI & ML interests

Recent Activity

Organizations

xiaotianhan's activity

Update metadata: Add `library_name`

Update task category, add tags, and include survey paper link for InfiMM-WebMath-40B

Dailypapershackernews

[bot] Conversion to Parquet