Structural Simplicity in Neural Networks

System

NSC Web

Front Page

Getting Access

Support Email

support@nsc.liu.se

Feedback

Give Feedback

Structural Simplicity in Neural Networks

Title:	Structural Simplicity in Neural Networks
DNr:	NAISS 2024/22-1047
Project Type:	NAISS Small Compute
Principal Investigator:	Joel Wendin <joel.wendin@liu.se>
Affiliation:	Linköpings universitet
Duration:	2024-08-12 – 2025-09-01
Classification:	10202
Keywords:

Abstract

The aim of this project is to understand from a new point of view the ability of deep neural networks to successfully learn from data and generalize. There is a tendency of neural networks (trained in presence or absence of explicit regularization) to represent a given input-output relation in a structurally “simple” manner, in the sense that many parts of the network encode for the same, most informative feature(s) of the input. We speculate that the degree of simplicity, which is related to the ability of the models to learn the high-information modes of data and disregard the noise, can be quantified using graph theoretic measures on trained neural networks. So far we have observed the tendency to simplicity in the structure of small-scale networks of different architectures, e.g., ANNs and CNNs, trained on real data. To test the hypothesis further, we need to run experiments on medium to large-scale networks, which require significant computational resources.

National Supercomputer Centre at Linköping University

Abstract