Course: C6.5 Theories of Deep Learning (2022-23)

Lecturer: Profile: Jared Tanner

Course information

General prerequisites:

Only elementary linear algebra and probability are assumed in this course; with knowledge from the following prelims courses also helpful: linear algebra, probability, analysis, constructive mathematics, and statistics and data analysis. It is recommended that students have familiarity with some of: more advanced statistics, optimisation (B6.3, C6.2), networks (C5.4), and numerical linear algebra (C6.1), though none of these courses are required as the material is self contained.

Course term: Michaelmas

Course lecture information: 16 lectures

Course weight: 1

Course level: M

Course overview:

A course on theories of deep learning.

Learning outcomes:

Students will become familiar with the variety of architectures for deep nets, including the scattering transform and ingredients such as types of nonlinear transforms, pooling, convolutional structure, and how nets are trained. Students will focus their attention on learning a variety of theoretical perspectives on why deep networks perform as observed, with examples such as: dictionary learning and transferability of early layers, energy decay with depth, Lipschitz continuity of the net, how depth overcomes the curse of dimensionality, constructing adversarial examples, geometry of nets viewed through random matrix theory, and learning of invariance.

Course synopsis:

Deep learning is the dominant method for machines to perform classification tasks at reliability rates exceeding that of humans, as well as outperforming world champions in games such as go. Alongside the proliferating application of these techniques, the practitioners have developed a good understanding of the properties that make these deep nets effective, such as initial layers learning weights similar to those in dictionary learning, while deeper layers instantiate invariance to transforms such as dilation, rotation, and modest diffeomorphisms. There are now a number of theories being developed to give a mathematical theory to accompany these observations; this course will explore these varying perspectives.

Section outline

Select section General

Collapse Expand
General

Collapse all Expand all
- Select activity Announcements
  
  Announcements Forum
- Select activity Discussion Forum
  
  Discussion Forum
Select section Course Materials

Collapse Expand
Course Materials
- Select activity Sheet 1
  
  Sheet 1 Assignment
- Select activity Sheet 2
  
  Sheet 2 Assignment
- Select activity NeurIPS 2018 example file
  
  NeurIPS 2018 example file
- Select activity NeurIPS 2018 style file
  
  NeurIPS 2018 style file
- Select activity Sheet 3
  
  Sheet 3 Assignment
- Select activity Sheet 4
  
  Sheet 4 Assignment
- Select activity Lecture 1 slides
  
  Lecture 1 slides File
- Select activity Lecture 2 slides
  
  Lecture 2 slides File
- Select activity Lecture 3 slides
  
  Lecture 3 slides File
- Select activity Lecture 4 slides
  
  Lecture 4 slides File
- Select activity Lecture 5 slides
  
  Lecture 5 slides File
- Select activity Lecture 6 slides
  
  Lecture 6 slides File
- Select activity Lecture 7 slides
  
  Lecture 7 slides File
- Select activity Lecture 8 slides
  
  Lecture 8 slides File
- Select activity Lecture 9 slides
  
  Lecture 9 slides File
- Select activity Lecture 10 slides
  
  Lecture 10 slides File
- Select activity Lecture 11 slides
  
  Lecture 11 slides File
- Select activity Lecture 12 slides
  
  Lecture 12 slides File
- Select activity Lecture 13 slides
  
  Lecture 13 slides File
- Select activity Lecture 14 slides
  
  Lecture 14 slides File
- Select activity Lecture 15 slides
  
  Lecture 15 slides File
- Select activity Lecture 16 slides
  
  Lecture 16 slides File
- Select activity Example mini project on data augmentation
  
  Example mini project on data augmentation File
- Select activity Example mini project on optimisation
  
  Example mini project on optimisation File
- Select activity Example mini project on robustness
  
  Example mini project on robustness File

Main content blocks

Section outline

General

Course Materials