rotem israeli's picture

rotem israeli

irotem98

·

https://rotem154154.github.io

rotem154154

AI & ML interests

None yet

Recent Activity

liked a model about 15 hours ago

Qwen/Qwen3.5-35B-A3B-FP8

liked a dataset about 16 hours ago

karpathy/climbmix-400b-shuffle

liked a model 1 day ago

0xSero/gemma-4-21b-a4b-it-REAP

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Think Anywhere in Code Generation

Paper • 2603.29957 • Published 8 days ago • 25

upvoted a paper 13 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published 26 days ago • 21

upvoted 2 collections about 1 month ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 4 days ago • 132

Qwen3-Embedding

6 items • Updated Dec 31, 2025 • 155

upvoted 2 papers about 1 month ago

Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

Paper • 2602.20161 • Published Feb 23 • 23

SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models

Paper • 2602.18993 • Published Feb 22 • 4

upvoted a paper about 2 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 12 • 80

upvoted a collection about 2 months ago

timm DINOv3

Meta AI's DINOv3 weights in timm. ViTs with `qkvb` have a zero QV bias present, otherwise bias is disabled. QKV bias are all 0 in original weights. • 18 items • Updated Sep 19, 2025 • 32

upvoted a paper about 2 months ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

upvoted a collection about 2 months ago

Dr.Kernel

8 items • Updated Feb 6 • 4

upvoted 2 papers 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

Paper • 2602.02092 • Published Feb 2 • 18

upvoted a collection 2 months ago

Qwen3-VL-Embedding

2 items • Updated Jan 8 • 65

upvoted 7 papers 2 months ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 222

TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization

Paper • 2512.09196 • Published Dec 9, 2025 • 1

TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators

Paper • 2502.14752 • Published Feb 20, 2025 • 1

QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation

Paper • 2511.20100 • Published Nov 25, 2025 • 1

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8, 2025 • 31

Liger Kernel: Efficient Triton Kernels for LLM Training

Paper • 2410.10989 • Published Oct 14, 2024 • 2

ConCuR: Conciseness Makes State-of-the-Art Kernel Generation

Paper • 2510.07356 • Published Oct 8, 2025 • 2