Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
Mu Cai's picture
7 11 3

Mu Cai

mucai
dark-pen's profile picture tolgacangoz's profile picture variante's profile picture
·
https://pages.cs.wisc.edu/~mucai/
  • MuCai7
  • mu-cai

AI & ML interests

Computer Vision, Deep Learning, 3D Vision, Vision and Language,

Recent Activity

upvoted a paper about 1 month ago
Relational Visual Similarity
upvoted a paper 2 months ago
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation
commented on a paper 2 months ago
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation
View all activity

Organizations

vgbench's profile picture CounterCurate's profile picture

authored a paper about 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16
authored 3 papers over 1 year ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required