3 128 219

Anthonny Olime

Aviv-anthonnyolime

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

facebook/audiobox-aesthetics

liked a model 2 days ago

facebook/musicgen-small

liked a model 2 days ago

stabilityai/stable-audio-open-1.0

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 98

upvoted 2 papers 3 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 4 days ago • 136

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published 17 days ago • 26

upvoted 2 papers 5 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 12 days ago • 43

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 7 days ago • 133

upvoted 2 papers 6 days ago

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

Paper • 2502.08127 • Published 11 days ago • 49

Large Language Diffusion Models

Paper • 2502.09992 • Published 9 days ago • 75

upvoted an article 8 days ago

Article

Fixing Open LLM Leaderboard with Math-Verify

10 days ago

• 24

upvoted an article 13 days ago

Article

Open R1: Update #2

and 6 others •

13 days ago

• 184

upvoted a paper 17 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 19 days ago • 190

upvoted a collection 17 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 3 days ago • 239

upvoted 2 articles 23 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

27 days ago

• 770

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

23 days ago

• 36

upvoted 2 collections 25 days ago

image

Collection

241 items • Updated 4 days ago • 3

Papers - Google

Collection

53 items • Updated Nov 2, 2024 • 2

upvoted a paper 25 days ago

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

Paper • 2403.18818 • Published Mar 27, 2024 • 26

upvoted a paper 26 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 329

upvoted an article 26 days ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

•

30 days ago

• 12

upvoted 2 papers about 1 month ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 77

LaDiMo: Layer-wise Distillation Inspired MoEfier

Paper • 2408.04278 • Published Aug 8, 2024 • 1