PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 13 days ago • 63
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 13 days ago • 296
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 108
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models Paper • 2502.10458 • Published 12 days ago • 27
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published 9 days ago • 29
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon Paper • 2502.07445 • Published 12 days ago • 11
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 751
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents Paper • 2502.05957 • Published 14 days ago • 16
📀 Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 37
Nordic embedding training data Collection This is a collection of synthetic datasets for embedding model training in Danish, Swedish and Norwegian (bokmål). • 15 items • Updated 28 days ago • 4
Luganda Fine-Tuned LLMs Collection A collection of Large Language Models fine-tuned on Luganda. The number in the name shows the LoRA Rank used during training. • 7 items • Updated Jan 20 • 1