Glimpse Proportion Maximization for Speech Intelligibility Enhancement

Author

Bejarano, Valentina Correa

Term

4. semester

Education

Computer Engineering, Master

Publication year

2025

Submitted on

2025-08-25

Pages

Abstract

This thesis investigates the use of Glimpse Proportion (GP) maximisation as an optimisation objective for Near-End Listening Enhancement (NELE). Unlike conventional methods based on the Speech Intelligibility Index (SII) or related metrics, the proposed approach employs a differentiable formulation of GP, enabling gradient-based optimisation under energy-preservation constraints. The method, introduced as GlimpseP, applies frequency-dependent, timeinvariant spectral weighting and is evaluated across multiple datasets (DANTALE II, AEMST, TIMIT) and noise conditions (stationary and competing speaker). Results show consistent improvements in objective intelligibility metrics, with particular advantages in fluctuating noise where glimpsing cues are most perceptually relevant. Compared to established baselines such as FractileASII, the proposed method demonstrates comparable or superior performance while maintaining robustness across conditions. These findings confirm the potential of GP as a perceptually grounded optimisation target for NELE.

Keywords

Glimpse Proportion ; NELE ; Gradient-Based

Documents

Download
View record in AAU Student Projects

A master's thesis from Aalborg University

Glimpse Proportion Maximization for Speech Intelligibility Enhancement