Visualisation Tool for Convolutional Neural Networks

Author

Marossy, Aron Jozsef

Term

4. term

Education

Innovative Communication Technologies and Entrepreneurship, Master

Publication year

2018

Abstract

This thesis addresses the limited insight that non-experts typically have into convolutional neural networks (CNNs) and investigates how a visualization tool can make their behavior more transparent. The central research question is how to design a tool for visualizing and understanding CNNs, with three sub-aims: detecting and communicating potential performance issues, making the tool accessible to newcomers, and linking input images to network activations. The scope is limited to feed-forward, static CNN architectures and focuses on improving model performance rather than training speed. The methodology combines a review of state-of-the-art techniques and existing software with analysis, requirements definition, conceptual design, and iterative agile development within an abductive research approach. The outcome is a local Python application with a browser-based interface that lets users load an existing network and dataset to visualize the architecture, layer and node parameters, and the network’s response to specific inputs. The tool also aims to flag typical causes of suboptimal performance and suggest possible remedies. The thesis motivates the need with known pitfalls and discusses practical challenges in a rapidly evolving field. The excerpt does not report quantitative evaluation results but documents the development of a working prototype and its intended uses.

Dette speciale adresserer det begrænsede indblik, som almindelige brugere og ikke-eksperter får i convolutionelle neurale netværk (CNN’er), og undersøger hvordan et visualiseringsværktøj kan gøre netværkenes adfærd mere transparent. Hovedspørgsmålet er, hvordan man designer et værktøj til at visualisere og forstå CNN’er, med fokus på tre delmål: at opdage og formidle potentielle ydelsesproblemer, at gøre værktøjet tilgængeligt for begyndere og at knytte inputbilleder til netværksaktiveringer. Projektet afgrænses til feed-forward, statiske CNN-arkitekturer og beskæftiger sig med forbedring af modelydelse frem for træningshastighed. Metodisk kombineres en gennemgang af state-of-the-art teknikker og eksisterende værktøjer med analyse, kravspecificering, konceptuel design og en iterativ, agil udviklingsproces under en abduktiv forskningsstil. Resultatet er en lokal Python-applikation med webbaseret grænseflade, hvor brugeren indlæser et eksisterende netværk og et datasæt for at få visualiseringer af arkitektur, lag- og nodeparametre samt netværkets respons på specifikke input. Værktøjet sigter også mod at identificere typiske årsager til suboptimal ydelse og foreslå mulige forbedringer. Specialet fremhæver motivationscaser, kendte faldgruber og praktiske udfordringer i et hurtigt skiftende felt. Uddraget indeholder ikke kvantitativ evaluering, men dokumenterer udviklingen af en funktionsdygtig prototype og dens tilsigtede anvendelser.

[This abstract has been generated with the help of AI directly from the project full text]

Documents

Download PDF
View record in AAU Student Projects

A master's thesis from Aalborg University

Visualisation Tool for Convolutional Neural Networks