14 19 49

Tony Wu

tonywu71

https://tonywu71.notion.site/Hi-I-m-Tony-e937d2baf5ab4669904b04fd24513499?pvs=74

AI & ML interests

RAG, LLMs, ASR

Recent Activity

upvoted an article about 4 hours ago

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted an article 2 days ago

SigLIP 2: A better multilingual vision language encoder

View all activity

Organizations

tonywu71's activity

upvoted an article about 4 hours ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 282

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

upvoted an article 2 days ago

Article

SigLIP 2: A better multilingual vision language encoder

3 days ago

• 71

upvoted an article 4 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

5 days ago

• 53

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

upvoted an article 18 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

20 days ago

• 106

upvoted an article 19 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

upvoted an article about 1 month ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

upvoted a paper about 2 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 73

upvoted a paper 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

upvoted an article 4 months ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27, 2024

• 38

upvoted a collection 4 months ago

Leaderboards and benchmarks ✨

Collection

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 90 items • Updated 18 days ago • 96

upvoted an article 5 months ago

Article

Document Similarity Search with ColPali

•

Sep 21, 2024

• 49

upvoted a paper 6 months ago

GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Paper • 2409.06595 • Published Sep 10, 2024 • 38

upvoted an article 7 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 71

upvoted an article 8 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 205

upvoted a paper 8 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 44