Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network

Hao Ren; Wu Ran; Xingson Liu; Haoran Ren; Hong Lu; Rui Zhang; Cheng Jin

doi:10.1109/ICME55011.2023.00177

2023 IEEE International Conference on Multimedia and Expo (ICME)

Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network

Year: 2023, Pages: 1008-1013

DOI Bookmark: 10.1109/ICME55011.2023.00177

Authors

Hao Ren, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Wu Ran, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Xingson Liu, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Haoran Ren, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Hong Lu, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Rui Zhang, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China
Cheng Jin, Fudan University,Shanghai Key Lab of Intelligent Information Processing School of Computer Science,Shanghai,China

Abstract

Weakly-supervised temporal action localization task aims to localize temporal boundaries of action instances by using only video-level labels. Existing methods primarily adopt Multi-Instance-Learning (MIL) scheme to handle this task. The effectiveness of MIL scheme depends heavily on the selection of top-k action snippets, which is unstable and requires manual tuning. To address these deficiencies, we propose an Adaptive Clustering and Refining Network (ACRNet). Specifically, we present an action-aware clustering strategy that is adaptable and requires no manual tuning to separate action and background snippets of diverse videos based on intra-class activation distribution. And a cluster refining step is included to eliminate false action snippets by considering inter-class activation distribution, which greatly improves robustness and localization accuracy. Extensive experiments on THUMOS14, ActivityNet 1.2&1.3 benchmarks show that our method achieves state-of-the-art performance.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
IEEE Transactions on Pattern Analysis & Machine Intelligence
Weakly Supervised Temporal Action Localization Through Contrastive Learning
2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Unleashing the Potential of Adjacent Snippets for Weakly-supervised Temporal Action Localization
2023 IEEE International Conference on Multimedia and Expo (ICME)
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Action Unit Memory Network for Weakly Supervised Temporal Action Localization
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Weakly-supervised Temporal Action Localization with Adaptive Clustering and Refining Network

Authors

Abstract

Related Articles