Speech Segregation in Background Noise Based on Deep Learning

Awotunde, Joseph Bamidele and Ogundokun, Roseline Oluwaseun and Ayo, Femi Emmanuel and Matiluko, Opeyemi Emmanuel (2020) Speech Segregation in Background Noise Based on Deep Learning. IEEE Access, 8. pp. 169568-169575. ISSN 2169-3536

[thumbnail of Speech_Segregation_in_Background_Noise_Based_on_Deep_Learning.pdf]
Preview
Text
Speech_Segregation_in_Background_Noise_Based_on_Deep_Learning.pdf - Published Version

Download (4MB) | Preview

Abstract

The most important way several people communicate is through speech. Speech is used to convey other information such as speaker communication, emotion, and attitude. Therefore, it is the most convenient and natural means of communication. The concept of speech segregation or processing involves sorting out wanted speech from noises in the background. Recently, a supervised learning approach was formulated for speech segregation problems. The latest trend in speech processing comprises the utilization of deep learning systems to increase the computational speed and performance of speech processing tasks. Hence, this study employed the use of a convolutional neural network to segregate speech in background noise. The convolutional neural network was used to explain the features of presenter auditory and consecutive subtleties. An unadapted speaker model was originally utilized to separate the two vocalizations gestures; they were then applied to the assessed signal-to-noise ratio (SNR) participation. The participation of SNR was thereafter applied to modify the speaker prototypes for re-estimating the speech signals that iterated twice before convergence. The developed method was tested on the TIMIT dataset. The results showed the strength of the developed method for speech segregation in background noise. Also, the findings of the study suggested that the method enhanced isolation performance and congregated reasonably fast. It was deduced that the system is simple and performs better in comparison to ultramodern speech processing methods in some input SNR conditions.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
Depositing User: Unnamed user with email opmat01@yahoo.com
Date Deposited: 23 Oct 2024 15:12
Last Modified: 23 Oct 2024 15:12
URI: https://ecrtd-digital-library.org/id/eprint/1

Actions (login required)

View Item
View Item
Search Screen...
For better search result, please place phrase searches inside quotes (" ") and capitalize proper nouns (eg. America, Nigeria, United Kingdom)