Submitted by akhaliq 18 MotionGPT: Finetuned LLMs are General-Purpose Motion Generators · 10 authors 1
Submitted by akhaliq 8 Point-Cloud Completion with Pretrained Text-to-image Diffusion Models · 3 authors
Submitted by akhaliq 7 RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation · 39 authors 1
Submitted by akhaliq 7 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision · 8 authors 1
Submitted by akhaliq 7 Guiding Language Models of Code with Global Context using Monitors · 5 authors 2
Submitted by akhaliq 7 BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models · 11 authors
Submitted by akhaliq 6 Meta-Personalizing Vision-Language Models to Find Named Instances in Video · 5 authors