Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sthio90 's Collections
Multimodal-AI
Agentic-AI
Agentic-Search

Multimodal-AI

updated 18 days ago
Upvote
-

  • SpatialLM: Training Large Language Models for Structured Indoor Modeling

    Paper • 2506.07491 • Published Jun 9 • 49

  • Story2Board: A Training-Free Approach for Expressive Storyboard Generation

    Paper • 2508.09983 • Published 24 days ago • 67

  • Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

    Paper • 2503.01710 • Published Mar 3 • 6

  • HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

    Paper • 2507.21809 • Published Jul 29 • 124

  • Matrix-Game: Interactive World Foundation Model

    Paper • 2506.18701 • Published Jun 23 • 72

  • Qwen/Qwen-Image-Edit

    Image-to-Image • Updated 13 days ago • 120k • • 1.68k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略