3 44 5

Dan Jacobellis PRO

danjacobellis

https://danjacobellis.net

danjacobellis

AI & ML interests

Signal processing, information theory, data compression

Recent Activity

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted a paper 4 days ago

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

upvoted a paper 4 days ago

Continuous Diffusion Model for Language Modeling

View all activity

Organizations

None yet

danjacobellis's activity

upvoted a paper 2 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 3 days ago • 99

upvoted 3 papers 4 days ago

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

Paper • 2502.09838 • Published 10 days ago • 9

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 6 days ago • 48

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 10 days ago • 30

upvoted a paper 5 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 7 days ago • 133

upvoted a paper 24 days ago

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

Paper • 2501.17578 • Published 25 days ago • 1

upvoted 3 papers 26 days ago

upvoted 4 papers about 1 month ago

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 28

The Geometry of Tokens in Internal Representations of Large Language Models

Paper • 2501.10573 • Published Jan 17 • 9

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 89

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Paper • 2501.07783 • Published Jan 14 • 7

upvoted a collection about 2 months ago

NeMo Audio Codecs

Collection

A series of Neural Audio Codecs • 5 items • Updated Jan 17 • 11

upvoted 2 collections 2 months ago

DC-AE

Collection

Deep Compression Autoencoder • 17 items • Updated about 1 month ago • 15

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39

upvoted 3 papers 2 months ago

Generalized Gaussian Model for Learned Image Compression

Paper • 2411.19320 • Published Nov 28, 2024 • 1

Learned Compression for Compressed Learning

Paper • 2412.09405 • Published Dec 12, 2024 • 13

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

Paper • 2412.06676 • Published Dec 9, 2024 • 9

upvoted a paper 3 months ago

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Paper • 2411.17459 • Published Nov 26, 2024 • 11