view article Article Vision Language Model Alignment in TRL ⚡️ By sergiopaniego and 4 others • 14 days ago • 69
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 128
view article Article AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan By evijit and 2 others • Jun 2 • 13
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 237
view post Post 640 Why do people sleep on DSE multimodal retrieval models? 👀They're just like ColPali, but highly scalable, fast and you can even make them more efficient with binarization or matryoshka with little degradation 🪆I made a small collection of them so you can get started merve/multimodal-dse-retrievers-67fe71a9c8f1ad26a48859c3Image taken from MCDSE blog https://huggingface.co/blog/marco/announcing-mcdse-2b-v1 See translation 🤗 1 1 + Reply