AAU Student Projects - visit Aalborg University's student projects portal
A master's thesis from Aalborg University
Book cover


SNR Dependent Models for Parkinsons Disease Detection

Author

Term

4. Term

Publication year

2019

Abstract

Parkinson’s disease (PD) is a common neurodegenerative disorder, and analyzing short speech recordings offers a low‑cost aid for diagnosis. This thesis investigates the impact of environmental noise, expressed as signal‑to‑noise ratio (SNR), on automatic PD detection from speech, with a focus on how mismatch between training and test SNR affects performance. The central hypothesis is that better matching between training and test conditions at the model level can improve accuracy. Using a GMM‑UBM classification framework with PLP‑based speech features, SNR‑dependent models are built and performance is studied when training and test conditions are matched or mismatched across multiple SNR levels by combining a PD speech database with additive noise recordings. The system employs a universal background model, model adaptation, and log‑likelihood‑ratio scoring. The study quantifies how SNR mismatch degrades recognition and assesses whether SNR‑dependent modeling can mitigate this effect. Specific results are not provided in the excerpt; it describes the approach, feature extraction, modeling choices, data, and evaluation setup.

Parkinsons sygdom (PD) er en almindelig neurodegenerativ lidelse, og analyse af korte taleoptagelser kan bruges som et lavomkostnings redskab til støtte for diagnosticering. Denne afhandling undersøger betydningen af miljøstøj, målt som signal‑støj‑forhold (SNR), for automatisk PD‑detektion ud fra tale, med særligt fokus på, hvordan misforhold mellem trænings‑ og test‑SNR påvirker ydeevnen. Hypotesen er, at en bedre matchning mellem trænings‑ og testforhold på modelniveau kan forbedre nøjagtigheden. Inden for en GMM‑UBM‑klassifikationsramme og med PLP‑baserede taleegenskaber opbygges SNR‑afhængige modeller, og præstationen studeres, når trænings‑ og testbetingelser enten er matchet eller ikke matchet på tværs af flere SNR‑niveauer, ved at kombinere en PD‑taledatabase med additive støjoptagelser. Systemet anvender en universel baggrundsmodel, modeltilpasning og log‑likelihood‑ratio‑scoring. Undersøgelsen kvantificerer, hvordan SNR‑mismatch degraderer genkendelsen, og vurderer om SNR‑afhængig modellering kan afbøde denne effekt. De konkrete resultater er ikke angivet i uddraget; her beskrives tilgang, featureekstraktion, modeller, data og evalueringsopsætning.

[This apstract has been generated with the help of AI directly from the project full text]