MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens Paper • 2404.03413 • Published Apr 4, 2024 • 26
openai/clip-vit-large-patch14-336 Zero-Shot Image Classification • Updated Oct 4, 2022 • 4.55M • • 227