XING SUN

tedsun

https://www.sunxing.org/

AI & ML interests

LLM MLLM Agent

Recent Activity

authored a paper 29 days ago

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

authored a paper 29 days ago

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

authored a paper 29 days ago

Adaptive Dual Reasoner: Large Reasoning Models Can Think Efficiently by Hybrid Reasoning

View all activity

Organizations

None yet

authored 5 papers 29 days ago

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

Paper • 2509.12618 • Published Sep 16 • 1

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Paper • 2511.02347 • Published Nov 4 • 8

Adaptive Dual Reasoner: Large Reasoning Models Can Think Efficiently by Hybrid Reasoning

Paper • 2510.10207 • Published Oct 11

RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning

Paper • 2509.25958 • Published Sep 30

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25 • 26

upvoted a paper 30 days ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25 • 26

upvoted a paper about 2 months ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21 • 9

authored 2 papers about 2 months ago

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

Paper • 2303.17561 • Published Mar 30, 2023

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21 • 41

upvoted a paper about 2 months ago

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21 • 41

authored a paper 2 months ago

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Paper • 2510.09607 • Published Oct 10 • 2

upvoted 5 papers 2 months ago

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Paper • 2510.09607 • Published Oct 10 • 2

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2 • 16

Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Paper • 2502.05177 • Published Feb 7 • 2

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 50

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Paper • 2505.03739 • Published May 6 • 9

upvoted a paper 3 months ago

CoDiEmb: A Collaborative yet Distinct Framework for Unified Representation Learning in Information Retrieval and Semantic Textual Similarity

Paper • 2508.11442 • Published Aug 15 • 3

authored 3 papers 3 months ago

XING SUN

AI & ML interests

Recent Activity

Organizations

tedsun's activity

🎉 Free Image Generator Now Available!