Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Paper • 2601.04720 • Published Jan 8 • 57
view article Article Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks Nov 15, 2025 • 7
Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning Paper • 2511.02818 • Published Nov 4, 2025 • 15
SelectMix: Enhancing Label Noise Robustness through Targeted Sample Mixing Paper • 2509.11265 • Published Sep 14, 2025 • 1
Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning Paper • 2509.17971 • Published Sep 22, 2025 • 1
Token Activation Map to Visually Explain Multimodal LLMs Paper • 2506.23270 • Published Jun 29, 2025 • 5
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models Paper • 2504.14032 • Published Apr 18, 2025 • 7
E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker Paper • 2510.22733 • Published Oct 26, 2025 • 32
Heavy Labels Out! Dataset Distillation with Label Space Lightening Paper • 2408.08201 • Published Aug 15, 2024 • 21
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published Oct 22, 2025 • 62
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published Oct 7, 2025 • 55
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Paper • 2510.06590 • Published Oct 8, 2025 • 77
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published Oct 3, 2025 • 99
Granite 4.0 Collection IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth. • 38 items • Updated 8 days ago • 23