6 38 7

Harold Chen

Harold328

https://haroldchen19.github.io/

HaroldChen19

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 1 day ago

Plenoptic Video Generation

upvoted a paper 1 day ago

Choreographing a World of Dynamic Objects

upvoted a paper 3 days ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

View all activity

Organizations

None yet

authored a paper 24 days ago

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Paper • 2512.14442 • Published 25 days ago • 10

authored a paper about 1 month ago

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published Nov 28, 2025 • 43

authored a paper about 2 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 42

authored 2 papers 3 months ago

Go with Your Gut: Scaling Confidence for Autoregressive Image Generation

Paper • 2509.26376 • Published Sep 30, 2025 • 9

FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning

Paper • 2509.11796 • Published Sep 15, 2025

authored a paper 5 months ago

Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation

Paper • 2508.10858 • Published Aug 14, 2025

authored a paper 8 months ago

FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance

Paper • 2505.13437 • Published May 19, 2025 • 6

authored a paper 9 months ago

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17, 2025 • 20

authored 2 papers 10 months ago

Temporal Regularization Makes Your Video Generator Stronger

Paper • 2503.15417 • Published Mar 19, 2025 • 22

LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

Paper • 2503.08619 • Published Mar 11, 2025 • 20

authored 7 papers about 1 year ago

SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization

Paper • 2501.01245 • Published Jan 2, 2025 • 5

Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding

Paper • 2408.16272 • Published Aug 29, 2024

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Paper • 2310.18340 • Published Oct 22, 2023

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

Paper • 2404.09640 • Published Apr 15, 2024

OmniCreator: Self-Supervised Unified Generation with Universal Editing

Paper • 2412.02114 • Published Dec 3, 2024 • 14

GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting

Paper • 2405.07472 • Published May 13, 2024

FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs

Paper • 2407.02157 • Published Jul 2, 2024

Harold Chen

AI & ML interests

Recent Activity

Organizations

Harold328's activity

🎉 Free Image Generator Now Available!