Pritam Sarkar

pritamqu
·

AI & ML interests

multimodal learning with vision, language, and audio; generative modeling; large multimodal models (LMMs); multimodal LLMs (MLLMs); AI agents; alignments; representation learning; self-supervised and unsupervised learning; vision-language models; audio-visual models; foundation models; computer vision

Recent Activity

liked a dataset about 1 month ago
WHB139426/Grounded-VideoLLM
updated a dataset 11 months ago
pritamqu/VCRBench
View all activity

Organizations

None yet
Free AI Image Generator No sign-up. Instant results. Open Now