3 9 9

dnk

dnkdnk

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Subject-Consistent and Pose-Diverse Text-to-Image Generation

liked a dataset 3 months ago

ontocord/MixtureVitae-VALID

upvoted a paper 3 months ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Subject-Consistent and Pose-Diverse Text-to-Image Generation

Paper • 2507.08396 • Published Jul 11 • 15

liked a dataset 3 months ago

ontocord/MixtureVitae-VALID

Updated Apr 26 • 2.72k • 16

upvoted a paper 3 months ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

Paper • 2506.01674 • Published Jun 2 • 28

updated a dataset 5 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 80 • 5

liked a dataset 5 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 80 • 5

upvoted a paper 5 months ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30 • 95

published a dataset 5 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 80 • 5

New activity in openai/clip-vit-large-patch14 7 months ago

OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it...

👀 2

#31 opened about 1 year ago by

dnkdnk

upvoted a paper 8 months ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6 • 56

upvoted 3 papers 9 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 98

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Paper • 2412.09283 • Published Dec 12, 2024 • 19

liked a dataset 9 months ago

AnonMegumi/InstanceVid

Preview • Updated Dec 16, 2024 • 72 • 4

liked a Space 10 months ago

RAG Demo

👀

Generate detailed images from prompts and layouts

liked a model 10 months ago

black-forest-labs/FLUX.1-Canny-dev

Text-to-Image • Updated Jun 27 • 7.95k • • 222

upvoted a paper 10 months ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 37

New activity in openai/clip-vit-large-patch14 about 1 year ago

OSError: It looks like the config file is not a valid JSON file.

👍 2

#2 opened almost 3 years ago by

xvjiarui

liked 2 Spaces about 1 year ago

440

Open Sora

⚡

333

MLLM-guided Image Editing (MGIE)

👩

Transform images based on textual instructions

liked a model about 1 year ago

tsujuifu/ml-mgie

Updated Feb 9, 2024 • 22

dnk

AI & ML interests

Recent Activity

Organizations

dnkdnk's activity

OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it...

RAG Demo

OSError: It looks like the config file is not a valid JSON file.

Open Sora

MLLM-guided Image Editing (MGIE)