Baby Crying Sound Classification using Convolutional Neural Network

Authors

  • Naufal Fikri Muhammad Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Malaysia
  • Raimi Dewan IJN-UTM Cardiovascular Engineering Centre, Institute of Human Centered Engineering, Universiti Teknologi Malaysia, Malaysia
  • Jaysuman Pusppanathan Department of Biomedical Engineering and Health Sciences, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Malaysia
  • Faishal Adilah Suryanata Advanced Radio Frequency & Microwave Research Group, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Malaysia

DOI:

https://doi.org/10.11113/humentech.v3n1.66

Keywords:

Baby cry, Convolutional neural network, Machine learning, Mel-frequency cepstral coefficient, Sound classification

Abstract

Crying is a crucial means of communication for newborns, crying is a newborn's early form of communication. Many individuals are unable to recognise a baby's intention from cry unless they have the appropriate training or expertise, such as nurses, paediatricians, and childcare professionals. Accurately interpreting a baby's cry can be challenging. In this research paper, the study uses a method for classifying baby crying sounds using a Convolutional Neural Network (CNN) and the dataset includes belly pain, burping, discomfort, hungry, and tired for total of 3,495 one-second-long audio clips. The research methodology involves preprocessing the audio data, extracting Mel-Frequency Cepstral Coefficients (MFCC) as features, and training the CNN model. To determine the optimal architecture, two different configurations of the CNN model are evaluated. The settings for both configurations are the same, except for the layers. The first configuration utilizes 100, 200, and 100 neurons for the respective layers, while the second configuration employs 256, 512, and 256 neurons for each layer. the results have already been evaluated that the second configuration, with deeper and more complex layers, achieves higher accuracy (86%) compared to the first configuration (84%). The study demonstrates the effectiveness of CNNs in classifying baby cries and highlights the importance of model architecture in achieving accurate classification results. Future research could explore larger and more diverse datasets to improve generalizability.

Published

06-02-2024

How to Cite

Muhammad, N. F., Dewan, R., Pusppanathan, J., & Suryanata, F. A. (2024). Baby Crying Sound Classification using Convolutional Neural Network. Journal of Human Centered Technology, 3(1), 67–74. https://doi.org/10.11113/humentech.v3n1.66

Issue

Section

Articles

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)