A video summarization framework based on activity attention modeling using deep features for smart campus surveillance system

PeerJ Comput Sci. 2022 Mar 25:8:e911. doi: 10.7717/peerj-cs.911. eCollection 2022.

Abstract

Like other business domains, digital monitoring has now become an integral part of almost every academic institution. These surveillance systems cover all the routine activities happening on the campus while producing a massive volume of video data. Selection and searching the desired video segment in such a vast video repository is highly time-consuming. Effective video summarization methods are thus needed for fast navigation and retrieval of video content. This paper introduces a keyframe extraction method to summarize academic activities to produce a short representation of the target video while preserving all the essential activities present in the original video. First, we perform fine-grain activity recognition using a realistic Campus Activities Dataset (CAD) by modeling activity attention scores using a deep CNN model. In the second phase, we use the generated attention scores for each activity category to extract significant video frames. Finally, we evaluate the inter-frame similarity index used to reduce the number of redundant frames and extract only the representative keyframes. The proposed framework is tested on different videos, and the experimental results show the performance of the proposed summarization process.

Keywords: Dats science; Deep learning; Emerging technologies; Machine learning.

Grants and funding

This work was supported by the Princess Nourah bint Abdulrahman University Researchers Supporting Project number PNURSP2022R161, Princes Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.