Venue Category Estimation

Neural Multimodal Cooperative Learning Towards Micro-video Understanding

The prevailing characteristics of micro-videos result in the less descriptive power of each modality. The micro-video representations, several pioneer efforts proposed, are limited in implicitly exploring the consistency between different modality …