GROOT: Learning to Follow Instructions by Watching Gameplay Videos Paper • 2310.08235 • Published Oct 12, 2023 • 1
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models Paper • 2311.05997 • Published Nov 10, 2023 • 37
Image Inpainting via Tractable Steering of Diffusion Models Paper • 2401.03349 • Published Nov 28, 2023
ProAgent: Building Proactive Cooperative AI with Large Language Models Paper • 2308.11339 • Published Aug 22, 2023
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation Paper • 2403.05313 • Published Mar 8, 2024 • 9
Scaling Up Probabilistic Circuits by Latent Variable Distillation Paper • 2210.04398 • Published Oct 10, 2022 • 2
Sparse Probabilistic Circuits via Pruning and Growing Paper • 2211.12551 • Published Nov 22, 2022 • 2
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents Paper • 2302.01560 • Published Feb 3, 2023 • 1
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction Paper • 2301.10034 • Published Jan 21, 2023
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Paper • 2407.00114 • Published Jun 27, 2024 • 13
Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits Paper • 2302.08086 • Published Feb 16, 2023
Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households Paper • 2404.09001 • Published Apr 13, 2024
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper • 2410.17856 • Published Oct 23, 2024 • 52