DREAM: Efficient Dataset Distillation by Representative Matching Paper • 2302.14416 • Published Feb 28, 2023
MLLMs-Augmented Visual-Language Representation Learning Paper • 2311.18765 • Published Nov 30, 2023 • 1
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published May 7 • 28
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning Paper • 2509.01644 • Published 5 days ago • 26
Autoregressive Speech Synthesis without Vector Quantization Paper • 2407.08551 • Published Jul 11, 2024 • 17
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Paper • 2406.05370 • Published Jun 8, 2024 • 19