Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a Space 3 days ago

google/paligemma2-10b-mix

upvoted an article 3 days ago

SmolVLM2: Bringing Video Understanding to Every Device

upvoted an article 3 days ago

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

View all activity

Organizations

None yet

mrdbourke's activity

upvoted 2 articles 3 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

4 days ago

• 136

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

5 days ago

• 53

upvoted a paper 5 days ago

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 35

upvoted a collection 6 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 73

upvoted a collection 10 days ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 141

upvoted an article 12 days ago

Article

Open R1: Update #2

By

and 6 others •

13 days ago

• 184

upvoted an article 13 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted a collection 13 days ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 13 days ago • 63

upvoted a collection 15 days ago

Core ML Segment Anything 2

8 items • Updated Oct 4, 2024 • 29

upvoted 2 collections 24 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 28 days ago • 100

Mistral Small

5 items • Updated 24 days ago • 4

upvoted 2 articles 24 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 148

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 39

upvoted an article 25 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 751

upvoted 2 collections 25 days ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 3 days ago • 34

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 28 days ago • 360

upvoted an article 26 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

upvoted a collection 3 months ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated Jan 10 • 85

upvoted 2 papers 3 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43