1 39 75

gerald hewes

gerald29

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

LLM-based User Profile Management for Recommender System

upvoted a paper 2 days ago

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

upvoted a paper 2 days ago

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

View all activity

Organizations

None yet

gerald29's activity

upvoted 10 papers 2 days ago

LLM-based User Profile Management for Recommender System

Paper • 2502.14541 • Published 3 days ago • 4

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Paper • 2502.14802 • Published 3 days ago • 9

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Paper • 2502.14044 • Published 4 days ago • 6

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published 3 days ago • 10

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published 3 days ago • 13

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published 3 days ago • 10

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published 5 days ago • 22

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 3 days ago • 22

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 3 days ago • 50

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

upvoted a paper 3 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 4 days ago • 136

upvoted a paper 14 days ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 17 days ago • 33

upvoted an article 18 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted a paper 27 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published about 1 month ago • 51

upvoted 4 papers about 1 month ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 24

upvoted a collection about 2 months ago

Sa2VA Model Zoo

Collection

Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated 14 days ago • 29

upvoted a paper 3 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 128