VL-Thinking-Data PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 17 • 1
PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 17 • 1
MLLMDataV1 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 28 • 3 pengshuai-rin/multimath-300k Viewer • Updated Aug 20, 2024 • 1.19M • 155 • 11 OpenFace-CQUPT/HumanCaption-HQ-311K Viewer • Updated Jun 9, 2025 • 313k • 76 • 17 remyxai/vqasynth_spacellava Viewer • Updated Oct 24, 2024 • 28k • 58 • 14
MLLM-08 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 28 • 3
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133
MLLMDataV2 DAMO-NLP-SG/multimodal_textbook Updated Mar 17, 2025 • 1.36k • 159 zwq2018/Multi-modal-Self-instruct Viewer • Updated Jan 27, 2025 • 76k • 390 • 31 taesiri/GameplayCaptions-Gemini-pro-vision Viewer • Updated Apr 7, 2024 • 70.7k • 140 • 6 5CD-AI/LLaVA-CoT-o1-Instruct Viewer • Updated Nov 27, 2024 • 58.5k • 114 • 109
EfficientLLM Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 57
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 57
VL-Thinking-Data PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 17 • 1
PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 17 • 1
MLLMDataV2 DAMO-NLP-SG/multimodal_textbook Updated Mar 17, 2025 • 1.36k • 159 zwq2018/Multi-modal-Self-instruct Viewer • Updated Jan 27, 2025 • 76k • 390 • 31 taesiri/GameplayCaptions-Gemini-pro-vision Viewer • Updated Apr 7, 2024 • 70.7k • 140 • 6 5CD-AI/LLaVA-CoT-o1-Instruct Viewer • Updated Nov 27, 2024 • 58.5k • 114 • 109
MLLMDataV1 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 28 • 3 pengshuai-rin/multimath-300k Viewer • Updated Aug 20, 2024 • 1.19M • 155 • 11 OpenFace-CQUPT/HumanCaption-HQ-311K Viewer • Updated Jun 9, 2025 • 313k • 76 • 17 remyxai/vqasynth_spacellava Viewer • Updated Oct 24, 2024 • 28k • 58 • 14
EfficientLLM Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 57
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 57
MLLM-08 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 28 • 3
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133