Short-term anchor linking and long-term self-guided attention for video object detection
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10347/27098
Files in this item
Metadata
Title: | Short-term anchor linking and long-term self-guided attention for video object detection |
Author: | Cores Costa, Daniel Brea Sánchez, Víctor Manuel Mucientes Molina, Manuel Felipe |
Affiliation: | Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Información |
Subject: | Video object detection | Spatio-temporal features | Convolutional neural networks | |
Date of Issue: | 2021 |
Publisher: | Elsevier |
Citation: | Image and Vision Computing. Volume 110, June 2021, 104179 |
Abstract: | We present a new network architecture able to take advantage of spatio-temporal information available in videos to boost object detection precision. First, box features are associated and aggregated by linking proposals that come from the same anchor box in the nearby frames. Then, we design a new attention module that aggregates short-term enhanced box features to exploit long-term spatio-temporal information. This module takes advantage of geometrical features in the long-term for the first time in the video object detection domain. Finally, a spatio-temporal double head is fed with both spatial information from the reference frame and the aggregated information that takes into account the short- and long-term temporal context. We have tested our proposal in five video object detection datasets with very different characteristics, in order to prove its robustness in a wide number of scenarios. Non-parametric statistical tests show that our approach outperforms the state-of-the-art. Our code is available at https://github.com/daniel-cores/SLTnet |
Publisher version: | https://doi.org/10.1016/j.imavis.2021.104179 |
URI: | http://hdl.handle.net/10347/27098 |
DOI: | 10.1016/j.imavis.2021.104179 |
E-ISSN: | 0262-8856 |
Rights: | © 2021 The Authors. Published by Elsevier B.V. This work is licenced under a CC Attribution-NonCommercial-NoDerivatives 4.0 International licence (CC BY-NC-ND 4.0) |
Collections
-
- CiTIUS-Artigos [175]
The following license files are associated with this item: