SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence Paper • 2505.12703 • Published May 19 • 1
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published Nov 5, 2024 • 71
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata Paper • 2406.13213 • Published Jun 19, 2024 • 1
GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning Paper • 2506.00785 • Published Jun 1 • 2
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings Paper • 2502.19781 • Published Feb 27 • 1
3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework Paper • 2512.17459 • Published 9 days ago • 11
Running on CPU Upgrade 267 Omni Image Editor 🖼 267 Image edit, text to image, face swap, image upscale
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 19 days ago • 111
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 18 days ago • 46
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 21
ScribbleLight: Single Image Indoor Relighting with Scribbles Paper • 2411.17696 • Published Nov 26, 2024 • 1
Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View Paper • 2507.21371 • Published Jul 28 • 2
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published Feb 24 • 13
A multi-view contrastive learning framework for spatial embeddings in risk modelling Paper • 2511.17954 • Published Nov 22 • 1
LoFi: Vision-Aided Label Generator for Wi-Fi Localization and Tracking Paper • 2412.05074 • Published Dec 6, 2024 • 1