Author(s)
Term
4. Term
Publication year
2022
Submitted on
2022-06-17
Pages
117 pages
Abstract
Deep learning is an area that is only sparsely explored for metagenomic bin- ning. The existing deep learning-based approaches usually preprocess raw DNA sequences into input features such as com- position and abundance and perform rep- resentation learning and clustering in two steps. The utilization of unprocessed DNA sequences as input shows promis- ing results for gene prediction. Joint deep clustering leads to better results for im- age clustering than basic approaches like k-means clustering. In this report, we in- vestigate the potential of joint end-to-end unsupervised learning and the utilization of unprocessed contigs as inputs for the task of metagenomic binning. We propose two new binners: Deep Con- volutional Metagenomic Binner (DCMB) and Deep Stacked Metagenomic Bin- ner (DSMB). Both binners utilize KL divergence-based joint deep clustering. The DCMB takes unprocessed contigs and the DSMB uses abundance and composi- tion as inputs. The performance of both binners is bench- marked on the CAMI Low dataset and compared to metagenomic binners VAMB, MetaBat2, and SolidBin. The results show that metagenomic information requires preprocessing to obtain meaningful repre- sentations and that joint end-to-end learn- ing slightly improves the number of recov- ered bins.
Keywords
metagenomics ; binning ; deep learning ; joint clustering ; end-to-end learning ; KL divergence ; VAMB ; metagenomic binning ; DCMB ; DSMB ; DVMB
Documents
Colophon: This page is part of the AAU Student Projects portal, which is run by Aalborg University. Here, you can find and download publicly available bachelor's theses and master's projects from across the university dating from 2008 onwards. Student projects from before 2008 are available in printed form at Aalborg University Library.
If you have any questions about AAU Student Projects or the research registration, dissemination and analysis at Aalborg University, please feel free to contact the VBN team. You can also find more information in the AAU Student Projects FAQs.