Merve Noyan

mervenoyan

AI & ML interests

My actual account is hf.co/merve this account is opened before I started working at HF, sometimes I accidentally open PRs here lol

Recent Activity

new activity 6 days ago

google/gemma-3-270m:Fix model tag

upvoted an article 13 days ago

Vision Language Model Alignment in TRL ⚡️

new activity about 1 month ago

merve/smol-vision:Update README.md

View all activity

Organizations

New activity in google/gemma-3-270m 6 days ago

Fix model tag

#3 opened 6 days ago by

mervenoyan

upvoted an article 13 days ago

Article

Vision Language Model Alignment in TRL ⚡️

and 4 others •

14 days ago

• 69

New activity in merve/smol-vision about 1 month ago

Update README.md

#1 opened about 1 month ago by

mervenoyan

liked a Space about 2 months ago

GLM-4.1V-9B-Thinking-Demo

🐢

THUDM/GLM-4.1V-9B-Thinking Demo

New activity in merve/vlm_test_images about 2 months ago

Upload trimmed.mp4

#2 opened about 2 months ago by

mervenoyan

liked 4 models 2 months ago

Update tokenizer_config.json

#2 opened 2 months ago by

mervenoyan

New activity in kernels-community/README 2 months ago

Update README.md

#2 opened 2 months ago by

mervenoyan

upvoted a paper 3 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 128

upvoted an article 3 months ago

Article

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

and 2 others •

Jun 2

• 13

New activity in transformers-community/support 3 months ago

Latest vision & multimodal releases in transformers

🔥 2

#8 opened 3 months ago by

mervenoyan

upvoted an article 3 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

•

Mar 1, 2020

• 237

New activity in nvidia/DAM-3B-Video 4 months ago

Add license

#1 opened 4 months ago by

mervenoyan

New activity in nvidia/DAM-3B 4 months ago

Add license

#2 opened 4 months ago by

mervenoyan

Code snippet to use the model

❤️ 8

#1 opened 4 months ago by

mervenoyan

New activity in nvidia/DAM-3B-Self-Contained 4 months ago

License

❤️ 1

#1 opened 4 months ago by

merve

posted an update 4 months ago

Post

640

Why do people sleep on DSE multimodal retrieval models? 👀

They're just like ColPali, but highly scalable, fast and you can even make them more efficient with binarization or matryoshka with little degradation 🪆

I made a small collection of them so you can get started merve/multimodal-dse-retrievers-67fe71a9c8f1ad26a48859c3

Image taken from MCDSE blog https://huggingface.co/blog/marco/announcing-mcdse-2b-v1

Merve Noyan

AI & ML interests

Recent Activity

Organizations

mervenoyan's activity

Fix model tag

Vision Language Model Alignment in TRL ⚡️

Update README.md

GLM-4.1V-9B-Thinking-Demo

Upload trimmed.mp4

Update tokenizer_config.json

Update README.md

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

Latest vision & multimodal releases in transformers

How to generate text: using different decoding methods for language generation with Transformers

Add license

Add license

Code snippet to use the model

License