CASA
Collection
CASA: Cross-Attention as Self-Attention for Efficient Vision-Language Fusion on long context streaming inputs
•
6 items
•
Updated
•
6
Totally Free + Zero Barriers + No Login Required