Term
4. term
Education
Publication year
2021
Submitted on
2021-06-11
Pages
18 pages
Abstract
Knowing bacterial interactions in waste water treatment plants (WWTPs) can help manage the cleaning process, optimize it, avoid eutrophication and avoid pollution of effluent waters. In this paper we propose to find bacterial interactions in WWTPs by clustering pairwise univariate time series consisting of bacterial abundances sampled from activated sludge. We do this by modifying a deep clustering method called DPSOM to take pairs of bacteria as input. We then propose to split these pairs into subsequences called windows and performing the clustering on these windows. These cluster of windows are then used to produce clusters of the original full-length pairs. To help understand the clustering of the pairs we provide visual explanations, with the LIME framework, of which features in a pair contribute to that pair's clustering. As the dataset contains no ground truth in terms of interactions, we propose to evaluate our model using non-standard clustering metrics such as Pearson correlation coefficient and cluster-based prediction in addition to the LIME explanations.
Documents
Colophon: This page is part of the AAU Student Projects portal, which is run by Aalborg University. Here, you can find and download publicly available bachelor's theses and master's projects from across the university dating from 2008 onwards. Student projects from before 2008 are available in printed form at Aalborg University Library.
If you have any questions about AAU Student Projects or the research registration, dissemination and analysis at Aalborg University, please feel free to contact the VBN team. You can also find more information in the AAU Student Projects FAQs.