Sidharth's picture

2 5 23

Sidharth

sidhusmart

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

upvoted a paper 3 days ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

updated a Space about 1 month ago

sidhusmart/nhs-buddy

View all activity

Organizations

None yet

upvoted a collection 2 days ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated 6 days ago • 106

upvoted a paper 3 days ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published 6 days ago • 42

upvoted 2 papers over 1 year ago

Video ReCap: Recursive Captioning of Hour-Long Videos

Paper • 2402.13250 • Published Feb 20, 2024 • 27

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 37

upvoted a collection almost 2 years ago

📦 3D creation workflow

Going from a text prompt to a nice 3D model • 3 items • Updated May 5 • 30