Anthony Ivan S

anthonyivn

anthonyivn2

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

microsoft/OmniParser-v2.0

upvoted a paper 12 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper 12 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

View all activity

Organizations

None yet

anthonyivn's activity

liked a model 6 days ago

microsoft/OmniParser-v2.0

Image-Text-to-Text • Updated 5 days ago • 4.54k • 881

upvoted 2 papers 12 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 13 days ago • 134

upvoted an article 18 days ago

Article

Open-source DeepResearch – Freeing our search agents

20 days ago

• 1.08k

liked a model 26 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 4.43M • • 9.99k

upvoted a paper about 1 month ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41

upvoted 2 articles about 1 month ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 77

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 148

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 95

liked a model about 1 month ago

jxm/cde-small-v2

Feature Extraction • Updated 20 days ago • 11.3k • 77

liked 4 models about 2 months ago

liked 2 models 2 months ago

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 667k • 354

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 10M • 765

liked a Space 2 months ago

518

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked a dataset 2 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jan 8 • 12.5B • 69.3k • 432

liked a model 3 months ago

nvidia/Hymba-1.5B-Base

Text Generation • Updated Jan 2 • 3.15k • 139

upvoted a paper 4 months ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 66