Training and Understanding Modern Deep Networks
Title: |
Training and Understanding Modern Deep Networks |
DNr: |
Berzelius-2024-343 |
Project Type: |
LiU Berzelius |
Principal Investigator: |
Hossein Azizpour <azizpour@kth.se> |
Affiliation: |
Kungliga Tekniska högskolan |
Duration: |
2024-10-01 – 2025-04-01 |
Classification: |
10207 |
Keywords: |
|
Abstract
This proposal is for the scientific studies within the group of Hossein Azizpour. In Azizpour's group we try to understand modern deep networks and train them for impactful applications. Therefore, there are two main types of projects in the group. One is on the fundamentals of deep learning and the other on the application side.
Fundamental of deep networks: here three main tracks are pursued for the foundations of trustworthy deep network: (i) interpreting and understanding trained deep networks especially from the lens of a functional analysis, (ii) explaining individual decisions of a trained deep network in a human-interpretable way, and (iii) quantification and robustness to uncertainty in the decisions of a trained deep network.
Applications: on the application side, we apply the findings of the fundamental research on a few impactful applications including: (i) general computer vision, (ii) breast cancer diagnosis and prognosis specifically estimating risk of cancer and explaining the predictions, (iii) earth observation, especially uncertainty in the developments for detecting urbanization and forest fires, and (iv) protein structure modelling, especially the interpretable models and uncertainty of predictions, (v) modeling air flow in simulations and from real-world measurements using deep generative models.
As such all of the directions are either purely empirical or require empirical validations of the theories. Such empirical investigations for modern deep networks such as large ResNets and visual transformers can only be enabled with the help of a large GPU cluster such as Berzelius.
This proposal is for continuation of all the ongoing projects within Azizpour's group. The active projects that is requested to be continued is:
- Berzelius-2024-116
We have asked to increase the proposed quota to 12000 GPU/h since the projects on using deep generative models on both fluid flows and protein data are computationally demanding in both the method (generative models) and the size of the data.