Vaibhav Singh

veb-101

veb-101

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

The Curse of Depth in Large Language Models

upvoted a paper 3 months ago

Scaling Properties of Diffusion Models for Perceptual Tasks

upvoted a collection 4 months ago

Cosmos Tokenizer

View all activity

Organizations

None yet

veb-101's activity

upvoted a paper 10 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 15 days ago • 31

upvoted a paper 3 months ago

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12, 2024 • 13

upvoted a collection 4 months ago

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39

liked a model 4 months ago

lion-ai/MedImageInsights

Updated Nov 4, 2024 • 55

upvoted a paper 5 months ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2, 2024 • 41

liked a model 5 months ago

pyannote/speaker-diarization

Automatic Speech Recognition • Updated May 10, 2024 • 7.11M • 964

liked 3 Spaces 6 months ago

The timm Leaderboard

🏆

Display and analyze PyTorch Image Models leaderboard

timm CAM

📚

timm Attention Visualization

👁

Visualize attention maps for images using selected models

upvoted an article 6 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

upvoted a collection 8 months ago

MobileNetV4 pretrained weights

Collection

Weights for MobileNet-V4 pretrained in timm • 17 items • Updated Sep 22, 2024 • 18

upvoted a paper 8 months ago

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12, 2024 • 25

upvoted a paper 9 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted an article 9 months ago

Article

MobileNet-V4 (now in timm)

•

Jun 17, 2024

• 43

updated a model 9 months ago

veb-101/MobileViT-v1-Keras-3

Updated May 15, 2024 • 1

liked a model 10 months ago

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3, 2024 • 1.2k • 101

upvoted 3 papers 11 months ago

upvoted a paper 12 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 63