Adaptive Proxy Anchor Loss For Deep Metric Learning
Nguyen Phan, Sen Tran, Ta Duc Huy, Soan T.M. Duong, Chanh D.Tr. Nguyen, Trung Bui, Steven Q.H. Truong
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:13:20
Video object detection aims to detect and track each object in a given video. However, due to the problem of appearance deterioration in the video, it is still challenging to obtain good results when we apply traditional image object detection methods to videos. in this paper, we propose a new feature aggregation method, called Dual Feature Aggregation (DualFeat) for video object detection. By effectively combining the temporal and spatial attention mechanisms, we make full use of the temporal and spatial information in videos. Meanwhile, we leverage a real-time tracker to track detected objects in video frames, where features are aggregated again with previously obtained features. Such a way helps to obtain more comprehensive and richer features, greatly improving the accuracy of video object detection. We perform experiments on the ILSVRC2017 dataset, and the experimental results also verify the effectiveness of our method.