Author(s)
Term
4. term
Education
Publication year
2021
Submitted on
2021-06-03
Pages
51 pages
Abstract
Within the field of signal processing, a commonly occurring problem is that of denoising signals. This problem is especially relevant within the domain of speech processing. In recent years, deep learning models have shown state of the art performance in speech enhancement applications, surpassing previous methods in both objective and subjective performance. Deep learning models have previously suffered from reduced performance on unknown noise levels, however, a recent discovery within the field of image processing, indicates that bias-free models can generalise better across noise levels. This report does not seek to create a new state of the art within speech enhancement, but instead investigates the implications of these bias-free models. For this, four different types of convolutional neural networks were selected and evaluated for their performance under both bias-free and conventional configurations. Generally, bias-free networks are not found to have any significant improvement in generalisation over regular networks. However, UNet achieved significantly better performance, in a bias-free configuration, within known SNR ranges and marginally better outside known SNR ranges. A denoising CNN with a conventional configuration performed best overall.
Documents
Colophon: This page is part of the AAU Student Projects portal, which is run by Aalborg University. Here, you can find and download publicly available bachelor's theses and master's projects from across the university dating from 2008 onwards. Student projects from before 2008 are available in printed form at Aalborg University Library.
If you have any questions about AAU Student Projects or the research registration, dissemination and analysis at Aalborg University, please feel free to contact the VBN team. You can also find more information in the AAU Student Projects FAQs.