Skip to main content

Human Scream Detection Through Three-Stage Supervised Learning and Deep Learning

  • Conference paper
  • First Online:
Inventive Systems and Control

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 204))

  • 1345 Accesses

  • 4 Citations

Abstract

As for situation responsiveness, audio and video signals are very important. Audio is significant because it can enlighten us concerning situations, character, time, and place. The study describes a distress audio (scream or shout) event classification system which precisely classifies an audio event as ambient noise, screams, and shouts. Scream is a high-pitch vocalized sound in the absence of phonological structure. This study, researched the classification system using a three-phase SVM-based classifier model for segregating human distress sound from noise and then scream from a shout. The training of SVM-based classifier is done with audio MFCC as feature vectors, appropriately chosen from a set of 400 audio sets, which are selected according to a two-phase process. A classifier is trained and tested with each feature subset. SVM-based classifier analyses and predicts the sound which works with linear kernel and radial basis function (rbf) kernel. The obtained classification performance then again passes through the Multilayer Perceptron Model. When the model gets any sound then it tries to identify patterns in it using perceptron weights and biases if it can’t get success it slightly changes its weights for getting the correct result if it successfully detects a scream in sound then it calls the emergency function. Our results demonstrate that the system can generate a 90% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 169.00
Price excludes VAT (USA)
Softcover Book
USD 219.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. M.K. Nandwana, A. Ziaei, J.H. Hansen, Robust unsupervised detection of human screams in noisy acoustic environments, in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (IEEE, 2015), pp. 161–165

    Google Scholar 

  2. G. Valenzise, L. Gerosa, M. Tagliasacchi, F. Antonacci, A. Sarti, Scream and gunshot detection and localization for audio-surveillance systems, in 2007 IEEE Conference on Advanced Video and Signal Based Surveillance (IEEE, 2007), pp. 21–26

    Google Scholar 

  3. S. Ntalampiras, I. Potamitis, N. Fakotakis, On acoustic surveillance of hazardous situations, in IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 165–168 (2009)

    Google Scholar 

  4. M.A. Sehili, B. Lecouteux, M. Vacher, F. Portet, D. Istrate, B. Dorizzi, J. Boudy, Sound environment analysis in smart home, in International Joint Conference on Ambient Intelligence (Springer, Berlin, Heidelberg, 2012), pp. 208–223

    Google Scholar 

  5. H.D. Tran, H. Li, Sound event recognition with probabilistic distance SVMs. IEEE Trans. Audio Speech Lang. Process. 19(6), 1556–1568 (2010)

    Article  Google Scholar 

  6. W. Huang, T.K. Chiew, H. Li, T.S. Kok, J. Biswas, Scream detection for home applications, in 2010 5th IEEE Conference on Industrial Electronics and Applications (IEEE, 2010), pp. 2115–2120

    Google Scholar 

  7. S. Ntalampiras, I. Potamitis, N. Fakotakis, On acoustic surveillance of hazardous situations, in 2009 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE, 2009), pp. 165–168

    Google Scholar 

  8. L. Gerosa, G. Valenzise, M. Tagliasacchi, F. Antonacci, A. Sarti, Scream and gunshot detection in noisy environments, in 2007 15th European Signal Processing Conference (IEEE, 2007), pp. 1216–1220

    Google Scholar 

  9. J. Pohjalainen, P. Alku, T. Kinnunen, Shout detection in noise, in 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (IEEE, 2011), pp. 4968–4971

    Google Scholar 

  10. .K. Mittal, B. Yegnanarayana, Production features for detection of shouted speech, in 2013 IEEE 10th Consumer Communications and Networking Conference (CCNC) (IEEE, 2013), pp. 106–111

    Google Scholar 

  11. A. Sharma, S. Kaul, Two-stage supervised learning-based method to detect screams and cries in urban environments. IEEE/ACM Trans. Audio Speech Lang. Process. 24(2), 290–299 (2015)

    Article  Google Scholar 

  12. S. Chung, Y. Chung, Scream sound detection based on SVM and GMM, in RTET-17, 2017

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ashutosh Shankhdhar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shankhdhar, A., Rachit, Kumar, V., Mathur, Y. (2021). Human Scream Detection Through Three-Stage Supervised Learning and Deep Learning. In: Suma, V., Chen, J.IZ., Baig, Z., Wang, H. (eds) Inventive Systems and Control. Lecture Notes in Networks and Systems, vol 204. Springer, Singapore. https://doi.org/10.1007/978-981-16-1395-1_28

Download citation

Publish with us

Policies and ethics