A combined motion-audio school bullying detection algorithm
Ye, Liang; Wang, Peng; Wang, Le; Ferdinando, Hany; Seppänen, Tapio; Alasaarela, Esko (2018-07-05)
Ye, L., Wang, P., Wang, L., Ferdinando, H., Seppänen, T., & Alasaarela, E. (2018). A Combined Motion-Audio School Bullying Detection Algorithm. International Journal of Pattern Recognition and Artificial Intelligence, 32(12), 1850046. https://doi.org/10.1142/s0218001418500465
Preprint of an article published in International Journal of Pattern Recognition and Artificial Intelligence, Volume: 32, Issue: 12, 2018, Article Number: 1850046 https://doi.org/10.1142/s0218001418500465 © World Scientific Publishing Company] https://www.worldscientific.com/worldscinet/ijprai.
https://rightsstatements.org/vocab/InC/1.0/
https://urn.fi/URN:NBN:fi-fe2020042822798
Tiivistelmä
Abstract
School bullying is a common social problem, which affects children both mentally and physically, making the prevention of bullying a timeless topic all over the world. This paper proposes a method for detecting bullying in school based on activity recognition and speech emotion recognition. In this method, motion and voice data are gathered by movement sensors and a microphone, followed by extraction of a set of motion and audio features to distinguish bullying incidents from daily life events. Among extracted motion features are both time-domain and frequency-domain features, while audio features are computed with classical MFCCs. Feature selection is implemented using the wrapper approach. At the next stage, these motion and audio features are merged to form combined feature vectors for classification, and LDA is used for further dimension reduction. A BPNN is trained to recognize bullying activities and distinguish them from normal daily life activities. The authors also propose an action transition detection method to reduce computational complexity for practical use. Thus, the bullying detection algorithm will only run, when an action transition event has been detected. Simulation results show that the combined motion-audio feature vector outperforms separate motion features and acoustic features, achieving an accuracy of 82.4% and a precision of 92.2%. Moreover, with the action transition method, the computation cost can be reduced by half.
Kokoelmat
- Avoin saatavuus [34304]