Abstract: In weakly supervised video anomaly detection (WVAD), where only video-level labels indicating the presence or absence of abnormal events are available, the primary challenge arises from the ...
A novel multimodal object detection framework with sparse transformer and explicit attention module. The illustration of our proposed multimodal object detection framework is shown in the following ...
2025/12/23: The camera-ready version of AAAI has been updated on Arxiv, which includes an introduction to 6.78M dataset. 2025/11/09: Good news! Our paper has been accepted by AAAI 2026. And in the ...
What started as a normal river metal detecting trip quickly turned into a wild adventure. Every signal brought a surprise, and not all of them were what we expected. This is one hunt we won’t forget.
Abstract: This paper improves upon the Pix2Seq object detector by extending it for videos. In the process, it introduces a new way to perform end-to-end video object detection that improves upon ...