1 7 4

Weixi Feng

weixifeng

https://weixi-feng.github.io

AI & ML interests

Vision and Language, Multimodality, Diffusion Models

Recent Activity

upvoted a paper 25 days ago

Complex Logical Instruction Generation

upvoted a paper 5 months ago

Describe Anything: Detailed Localized Image and Video Captioning

upvoted a paper 5 months ago

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

View all activity

Organizations

None yet

upvoted a paper 25 days ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 26 days ago • 39

upvoted 2 papers 5 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 63

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 25

liked a Space 11 months ago

187

T2V Turbo V2

🔥

Efficient T2V generation

upvoted a paper about 1 year ago

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

Paper • 2406.10210 • Published Jun 14, 2024 • 79

authored a paper about 1 year ago

TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation

Paper • 2406.08656 • Published Jun 12, 2024 • 8

upvoted a paper about 1 year ago

TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation

Paper • 2406.08656 • Published Jun 12, 2024 • 8

commented a paper about 1 year ago

TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation

Paper • 2406.08656 • Published Jun 12, 2024 • 8 •

upvoted a paper about 1 year ago

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published May 29, 2024 • 22

authored a paper about 1 year ago

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 29

upvoted a paper about 1 year ago

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 29

liked a model about 1 year ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 12.6k • • 4.83k

authored 6 papers about 1 year ago

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Paper • 2305.15393 • Published May 24, 2023 • 1

VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View

Paper • 2307.06082 • Published Jul 12, 2023

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

Paper • 2212.05032 • Published Dec 9, 2022 • 1

Reward Guided Latent Consistency Distillation

Paper • 2403.11027 • Published Mar 16, 2024

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published May 29, 2024 • 22

Neuro-Symbolic Procedural Planning with Commonsense Prompting

Paper • 2206.02928 • Published Jun 6, 2022

liked a model about 1 year ago

jiachenli-ucsb/T2V-Turbo-VC2

Text-to-Video • Updated Jun 1, 2024 • 23

liked a Space over 1 year ago

11.4k

Stable Diffusion 2-1

🔥

Generate images from text prompts

Weixi Feng

AI & ML interests

Recent Activity

Organizations

weixifeng's activity

T2V Turbo V2

Stable Diffusion 2-1