Mgn: Multi-Glimpse Network for Action Recognition
In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018-04-01
Online
unknown
Zugriff:
Current state-of-the-art action recognition approaches rely on optical flow to extract the local motion information and ignore the importance of global description of the videos. In this paper, we present a novel architecture, named Multi-Glimpse Network (MGN), to boost the performance of action recognition by combining the local and global information of the videos. Specifically, MGN makes predictions through two important modules, Local Glimpse and Global Glimpse. Local Glimpse extracts the local spatiotemporal features of different periods using temporal sampling method. Global Glimpse aggregates the extracted local features to develop global description of the videos. These two modules are complementary and indispensable. Our MGN achieves competitive results on four video action benchmarks of UCF10l, HMDB51, ODAR and Penn.
Titel: |
Mgn: Multi-Glimpse Network for Action Recognition
|
---|---|
Autor/in / Beteiligte Person: | Gu, Huxiang ; Xiang, Shiming ; Yu, Tingzhao ; Pan, Chunhong ; Guo, Chaoxu |
Link: | |
Zeitschrift: | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018-04-01 |
Veröffentlichung: | IEEE, 2018 |
Medientyp: | unknown |
DOI: | 10.1109/icassp.2018.8462612 |
Schlagwort: |
|
Sonstiges: |
|