Author(s)
Term
10. term
Education
Publication year
2007
Submitted on
2007-06-07
Pages
103 pages
Abstract
This thesis addresses the challenges of estimating wideband speech (0-8000 Hz) from narrowband speech (0-3400 Hz). This is done by estimating the missing upper spectral components from the narrowband speech using statistical approaches. Utilizing the Source-Filter model, the estimation problem is divided into estimating a wideband envelope and a wideband excitation signal. These two estimates are then combined to obtain an artificially extended wideband speech signal. Three methods based on Vector Quantization, Gaussian Mixture Models and Hidden Markov Models respectively, have been developed for estimation of the wideband envelope. Results show that the two later outperforms the method based on vector quantization, in both objective and audible results. Estimation of excitation is done by simple spectral replication. A new perceptual training procedure which utilizes Mel Frequency Cepstral Coefficients for estimation of the envelope is proposed. A formal listening test conclude, that the proposed method of extending the wideband speech, is preferred over bandlimited narrow band speech with a level of significance of more than 99
Keywords
Documents
Colophon: This page is part of the AAU Student Projects portal, which is run by Aalborg University. Here, you can find and download publicly available bachelor's theses and master's projects from across the university dating from 2008 onwards. Student projects from before 2008 are available in printed form at Aalborg University Library.
If you have any questions about AAU Student Projects or the research registration, dissemination and analysis at Aalborg University, please feel free to contact the VBN team. You can also find more information in the AAU Student Projects FAQs.