Innovative Video Classification Method based on Deep Learning Approach

V. Hemamalini; D. Jayasutha; V. R. Vinothini; R. Manjula Devi; A. Kumar; E. Anitha

doi:10.2174/0118722121248139231023111754

ISSN: 1872-2121
E-ISSN: 2212-4047

Innovative Video Classification Method based on Deep Learning Approach
Authors: V. Hemamalini¹, D. Jayasutha², V. R. Vinothini³, R. Manjula Devi⁴, A. Kumar⁵ and E. Anitha⁶
View Affiliations Hide Affiliations

¹ Department of Networking and Communications, School of Computing, SRM Institute of Science and Technology, Chennai, India ; ² Department of Computer Science and Engineering, Arjun College of Technology, Thamaraikulam, Coimbatore, India ; ³ Department of Mathematics, Bannari Amman Institute of Technology, Sathyamangalam, Erode, 638401, India ; ⁴ Department of Computer Science and Design, Kongu Engineering College, Perundurai, Erode, 638060, India ; ⁵ Department of Civil Engineering, University Institute of Engineering & Technology, Maharshi Dayanand University, Rohtak, 124001, India ; ⁶ Department of Artificial Intelligence and Data Science, Sri Eshwar College of Engineering, Coimbatore, 641202, India
Source: Recent Patents on Engineering, Volume 19, Issue 2, Feb 2025, E271023222880
DOI: https://doi.org/10.2174/0118722121248139231023111754
- Received: 22 Mar 2023
- Accepted: 27 Jul 2023
- Available online: 27 Oct 2023

Abstract

Background

The automated classification of videos through artificial neural networks is addressed in this work. To explore the concepts and measure the results, the data set UCF101 is used, consisting of video clips taken from YouTube to recognize actions. The study is carried out with the authors' resources to determine the feasibility of independent research in the area.

Methods

This work was developed in the Python programming language using the Keras library with Tensorflow as the back-end. The objective is to develop a network that presents performance compatible with the state of the art in terms of classifying videos according to the actions taken.

Results

Given the hardware limitations, there is considerable distance between the implementation possibilities in this work and what is known as the state-of-the-art.

Conclusion

Throughout the work, some aspects in which this limitation influenced the development are presented, but it is shown that this realization is feasible and that obtaining expressive results is possible 98.6% accuracy is obtained in the UCF101 data set, compared to the 98 percentage points of the best result ever reported, using, however, considerably fewer resources. In addition, the importance of transfer learning in achieving expressive results as well as the different performances of each architecture are reviewed. Thus, this work may open doors to carry patent-based outcomes.

Article metrics loading...

/content/journals/eng/10.2174/0118722121248139231023111754

2023-10-27

2025-08-03

From This Site

/content/journals/eng/10.2174/0118722121248139231023111754

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

References

JeghamI. Ben KhalifaA. AlouaniI. MahjoubM.A. Vision-based human action recognition: An overview and real world challenges.Forensic Sci. Int. Digit. Investig.20203220090110.1016/j.fsidi.2019.200901
[Google Scholar]
FeichtenhoferC. FanH. MalikJ. HeK. Slowfast networks for video recognitionProceedings of the IEEE/CVF International Conference on Computer VisionSeoul, South Korea, pp. 6202-6211, 2019.
[Google Scholar]
BeddiarD.R. NiniB. SabokrouM. HadidA. Vision-based human activity recognition: A survey.Multimedia Tools Appl.20207941-42305093055510.1007/s11042‑020‑09004‑3
[Google Scholar]
MonfortM. VondrickC. OlivaA. AndonianA. ZhouB. RamakrishnanK. BargalS.A. YanT. BrownL. FanQ. GutfreundD. Moments in time dataset: One million videos for event understanding.IEEE Trans. Pattern Anal. Mach. Intell.202042250250810.1109/TPAMI.2019.290146430802849
[Google Scholar]
SongL. YuG. YuanJ. LiuZ. Human pose estimation and its application to action recognition: A survey.J. Vis. Commun. Image Represent.20217610305510.1016/j.jvcir.2021.103055
[Google Scholar]
LiJ. LiuX. ZhangM. WangD. Spatio-temporal deformable 3D ConvNets with attention for action recognition.Pattern Recognit.20209810703710.1016/j.patcog.2019.107037
[Google Scholar]
MajumderS. KehtarnavazN. A review of real-time human action recognition involving vision sensing, Real-Time Image Processing and Deep Learning.Int. Society Optics Photonics202111736117360A
[Google Scholar]
ZhouY. SunX. LuoC. ZhaZ-J. ZengW. Spatiotemporal fusion in 3d cnns: A probabilistic viewProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, pp. 9829-9838, 2020.10.1109/CVPR42600.2020.00985
[Google Scholar]
MajdM. SafabakhshR. Correlational convolutional LSTM for human action recognition.Neurocomputing202039622422910.1016/j.neucom.2018.10.095
[Google Scholar]
QiuZ. YaoT. NgoC-W. TianX. MeiT. Learning spatio-temporal representation with local and global diffusionProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Long Beach, CA, USA, pp. 12056-12065, 2019.10.1109/CVPR.2019.01233
[Google Scholar]
HuangG. BorsA.G. Learning spatio-temporal representations with temporal squeeze poolingProceedings of the ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)IEEEBarcelona, Spain21032107, 202010.1109/ICASSP40776.2020.9054200
[Google Scholar]
LiuQ. CheX. BieM. R-stan: R-STAN: Rpublisheresidual spatial-temporal attention network for action recognition.IEEE Access20197822468225510.1109/ACCESS.2019.2923651
[Google Scholar]
KalfaogluM.E. KalkanS. AlatanA.A. Late temporal modeling in 3d cnn architectures with bert for action recognitionProceedings of the European Conference on Computer VisionSpringerGlasgow, Scotland731747, 202010.1007/978‑3‑030‑68238‑5_48
[Google Scholar]
ChiL. TianG. MuY. TianQ. Two-stream video classification with cross-modality attentionProceedings of the IEEE/CVF International Conference on Computer Vision Workshops. 2019. Seoul, Korea
[Google Scholar]
LiX. ShuaiB. TigheJ. Directional temporal modeling for action recognitionProceedings of the European Conference on Computer Vision275291SpringerGlasgow, Scotland2020
[Google Scholar]
HongJ. ChoB. HongY. ByunH. Contextual action cues from camera sensor for multi-stream action recognition.Sensors2019196138210.3390/s1906138230897792
[Google Scholar]
XuJ. SongR. WeiH. GuoJ. ZhouY. HuangX. A fast human action recognition network based on spatio-temporal features.Neurocomputing202144135035810.1016/j.neucom.2020.04.150
[Google Scholar]
DibaA. FayyazM. SharmaV. Large scale holistic video understandingProceedings of the European Conference on Computer VisionSpringerGlasgow, Scotland593610, 2020
[Google Scholar]
LiuZ. LiZ. WangR. ZongM. JiW. Spatiotemporal saliency-based multi-stream networks with attention-aware LSTM for action recognition.Neural. Comput. Appl.20203218145931460210.1007/s00521‑020‑05144‑7
[Google Scholar]
WangX. GaoL. WangP. SunX. LiuX. Two-stream 3D convNet fusion for action recognition in videos with arbitrary size and length.IEEE Trans. Multimed.201820363464410.1109/TMM.2017.2749159
[Google Scholar]
LiS. ShengY. LuoZ. MinH. SuS. CaoD. Human behavior recognition method based on deep neural network.U.S. Patent 109919031B2019
[Google Scholar]
TianlianfangW.Q. QichaoW. QiliangD. 23Method for detecting and identifying abnormal behaviors of multiple persons based on machine visionU.S. Patent 109522793B2015
[Google Scholar]
https://www.crcv.ucf.edu/data/UCF101.php

/content/journals/eng/10.2174/0118722121248139231023111754

Innovative Video Classification Method based on Deep Learning Approach

Recent Pat Eng 19, E271023222880 (2025); https://doi.org/10.2174/0118722121248139231023111754

/content/journals/eng/10.2174/0118722121248139231023111754

Data & Media loading...

Article Type: Research Article

Keyword(s): 3D convolution; artificial intelligence; artificial neural networks; Convolutional neural network; Tensorflow; video classification

Innovative Video Classification Method based on Deep Learning Approach

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

A Review of Clustering Algorithms: Comparison of DBSCAN and K-mean with Oversampling and t-SNE

Recent Methods and Challenges in Brain Tumor Detection Using Medical Image Processing

A Blockchain based Fund Management Scheme for Financial Transactions in NGOs

Convolutional Neural Network Based Intelligent Advertisement Search Framework for Online English Newspapers

Numerical Analysis of Johnson-Cook Damage Model Parameters Effects on the Cutting Simulation of AISI 1045

A Sentiment Analysis Based Approach for Customer Segmentation

An Applicative Survey on Few-shot Learning

DDoS Attack Detection in Software Defined Networks by Various Metrics

An Overview of Developable Surfaces in Geometric Modeling

Breast Cancer Segmentation Recognition Using Explored DCT-DWT based Compression