AAU Student Projects is unavailable between June 15th 1.30pm and 17th 1.30pm due to planned system maintenance. The projects cannot be downloaded during this period.
AAU Student Projects - visit Aalborg University's student projects portal
An executive master's programme thesis from Aalborg University
Book cover


Fractional-Order Noise Estimation for Speech Enhancement in Acoustic Environments with 1/f Noise: A Grünwald-Letnikov Modification of MCRA

Author

Term

4. Term

Publication year

2026

Submitted on

Abstract

This thesis introduces GL-MCRA, an improved method for estimating background noise in speech recordings. Standard MCRA uses recursive averaging with exponentially decaying memory, which is a poor match for 1/f noise whose correlations decay according to a power law. GL-MCRA replaces the first-order update with a Grünwald–Letnikov power-law weighted sum over past power measurements gated by a speech-presence detector. A fractional (non-integer) order r (0.1–0.9) controls how quickly the weights decay, giving the estimator long-memory behavior that better matches 1/f noise. We evaluated the method on the NOIZEUS corpus under seven noise types and four SNR levels. At r=0.9, GL-MCRA achieved higher segmental SNR improvement than standard MCRA in all conditions, with statistically significant gains of 0.23–0.48 dB (5.4–11.5%, p<0.001). Perceptual evaluation of speech quality supported the objective findings. In a mean opinion score (MOS) listening test with 17 participants, listeners reported a condition-dependent benefit. The main limitation of GL-MCRA is its higher computational complexity compared to standard MCRA.

Denne afhandling introducerer GL-MCRA, en forbedret metode til at estimere baggrundsstøj i taleoptagelser. Den klassiske MCRA-metode bygger på rekursiv middelværdi, som giver en eksponentielt aftagende hukommelse. Det passer dårligt til 1/f-støj, hvor korrelationerne aftager efter en potenslov. GL-MCRA erstatter den førsteordens opdatering med en Grünwald–Letnikov-vægtet sum, der vægter tidligere effektmålinger efter en potenslov og er styret af en taletilstedeværelses-detektor. En fraktionel (ikke-heltallig) orden r (0,1–0,9) bestemmer, hvor hurtigt vægtene aftager, og giver en langtidshukommelse, der bedre matcher 1/f-støj. Vi evaluerede metoden på NOIZEUS-korpusset under syv støjtyper og fire SNR-niveauer. Ved r=0,9 gav GL-MCRA større forbedring i segmental SNR end standard MCRA i alle betingelser, med statistisk signifikante gevinster på 0,23–0,48 dB (5,4–11,5 %, p<0,001). Perceptuelle vurderinger af talekvalitet understøttede de objektive fund. I en MOS-lytningstest med 17 deltagere rapporterede lyttere en betingelsesafhængig gevinst. Den primære begrænsning ved GL-MCRA er højere beregningsmæssig kompleksitet end standard MCRA.

[This abstract has been rewritten with the help of AI based on the project's original abstract]