Spectral Speech Enhancement using Deep Neural Networks: - Design, Analysis \& Evaluation -
Student thesis: Master thesis (including HD thesis)
- Anders Post Jacobsen
- Morten Kolbæk
4. term, Signal Processing and Computing, Master (Master Programme)
Speech enhancement is an important issue within a wide range of applications such as mobile phones, speech recognition and hearing aids. In various acoustic environments, especially at low \acp{SNR}, the goal of speech enhancement methods is to solve the cocktail party problem. Regarding intelligibility, different machine learning methods that aim to estimate an ideal binary mask have revealed promising results. This master's thesis covers the work of speech enhancement by use of the machine learning method \ac{DNN}. In particular, a MATLAB implementation of a system based on \acp{DNN} for estimating an ideal binary mask was carried out. Simulations have revealed that the proposed \ac{DNN} based speech enhancement algorithm can enhance noisy speech in terms of an intelligibility predictor (STOI) and a quality predictor (PESQ). Likewise, it has been found that by using a soft mask, instead of a binary mask, additional improvement in STOI and PESQ can be achieved. The project is suggested and motivated by both Aalborg University and Oticon A/S.
Language | English |
---|---|
Publication date | 2 Jun 2015 |
Number of pages | 160 |
Images
