Two-Stream Transformer Architecture for Long Video Understanding Paper • 2208.01753 • Published Aug 2, 2022
Rethinking movie genre classification with fine-grained semantic clustering Paper • 2012.02639 • Published Dec 4, 2020
Geo-Sign: Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation Paper • 2506.00129 • Published May 30 • 1
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization Paper • 2310.03456 • Published Oct 5, 2023
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization Paper • 2307.12659 • Published Jul 24, 2023