In a Training Loop 🔄

Dmitry Ryumin

DmitryRyumin

https://dmitryryumin.github.io

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Recent Activity

upvoted a paper 14 days ago

Phi-4-reasoning-vision-15B Technical Report

upvoted a paper 2 months ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

liked a Space 3 months ago

huggingface/ai-deadlines

View all activity

Organizations

upvoted a paper 14 days ago

Phi-4-reasoning-vision-15B Technical Report

Paper • 2603.03975 • Published 16 days ago • 19

upvoted a paper 2 months ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published Jan 8 • 57

upvoted an article 4 months ago

Article

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks

Nov 15, 2025

•

upvoted a paper 4 months ago

Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning

Paper • 2511.02818 • Published Nov 4, 2025 • 15

upvoted 12 papers 5 months ago

SelectMix: Enhancing Label Noise Robustness through Targeted Sample Mixing

Paper • 2509.11265 • Published Sep 14, 2025 • 1

Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning

Paper • 2509.17971 • Published Sep 22, 2025 • 1

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 103

Token Activation Map to Visually Explain Multimodal LLMs

Paper • 2506.23270 • Published Jun 29, 2025 • 5

LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models

Paper • 2504.14032 • Published Apr 18, 2025 • 7

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Paper • 2510.22733 • Published Oct 26, 2025 • 32

Heavy Labels Out! Dataset Distillation with Label Space Lightening

Paper • 2408.08201 • Published Aug 15, 2024 • 21

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Paper • 2510.06590 • Published Oct 8, 2025 • 77

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

upvoted 3 collections 6 months ago

upvoted a paper 6 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

Dmitry Ryumin

AI & ML interests

Recent Activity

Organizations

DmitryRyumin's activity

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks

🎉 Free Image Generator Now Available!