VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement Paper • 2312.04885 • Published Dec 8, 2023
UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations Paper • 2505.08787 • Published May 13 • 14