wongyukim's picture

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

upvoted a paper about 13 hours ago

Self-Improving World Modelling with Latent Actions

upvoted a paper about 13 hours ago

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

View all activity

Organizations

None yet

upvoted 11 papers about 13 hours ago

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Paper • 2602.03442 • Published 7 days ago • 19

Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published 4 days ago • 19

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

Paper • 2602.05940 • Published 4 days ago • 16

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Paper • 2602.06034 • Published 4 days ago • 8

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Paper • 2602.05547 • Published 5 days ago • 11

SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Paper • 2602.05975 • Published 4 days ago • 12

LatentMem: Customizing Latent Memory for Multi-Agent Systems

Paper • 2602.03036 • Published 7 days ago • 14

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published 4 days ago • 20

Reinforced Attention Learning

Paper • 2602.04884 • Published 5 days ago • 20

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 5 days ago • 47

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Paper • 2602.02474 • Published 7 days ago • 51

upvoted 7 papers 4 days ago

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

Paper • 2601.22859 • Published 11 days ago • 17

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published 7 days ago • 26

VLS: Steering Pretrained Robot Policies via Vision-Language Models

Paper • 2602.03973 • Published 6 days ago • 21

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published 6 days ago • 74

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 5 days ago • 88

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published 7 days ago • 114

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 5 days ago • 243

upvoted 2 papers 5 days ago

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Paper • 2602.03786 • Published 6 days ago • 83

ObjEmbed: Towards Universal Multimodal Object Embeddings

Paper • 2602.01753 • Published 8 days ago • 5