Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Last update: Jan 02, 2023

Overview

Deep Vision and Graphics

This repo supplements course "Deep Vision and Graphics" taught at YSDA @fall'21. The course is the successor of "Deep Learning" course taught at YSDA in 2015-2021. New course focuses more on applications of deep learning for computer vision.

Lecture and seminar materials for each week are in ./week* folders. Homeworks are in ./homework* folders.

General info

Telegram chat room (russian).
YSDA deadlines & admin stuff can be found at the YSDA LMS (ysda students only).
Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue

Syllabus

week01 Intro, recap of Neural network basics, optimization, backprop, biological networks
week02 Images, linear filtering, convolutional networks, batchnorms, augmentations
week03 ConvNet architectures and how to find them, sparse convolutions in 3D, ConvNets for videos, transfer learning
week04 Dense prediction: semantic segmentation, superresolution/image synthesis, perceptual losses
week05 Non-convolutional architectures: transformers (some recap of their use in NLP), mixers, FFT convolutions
week06 Visualizing and understanding deep architectures, adversarial examples
week07 Object detection, instance/panoptic segmentation, 2D/3D human pose estimation
week08 Representation learning: face recognition, verification tasks, self-supervised learning, image captioning
week09 Latent models (GLO, AEs, flow models, diffusion models, VQ-VAE, generative transformers, CLIP, DALL-E)
week10 Generative adversarial networks
week11 Shape and motion estimation: spatial transformers, optical flow, stereo, monodepth, point cloud generation, implicit and semi-implicit shape representations
week12 New view synthesis: multi-plane images, neural radiance fields, mesh-based and point-based representations for NVS, neural renderers

Contributors & course staff

Course materials and teaching performed by

Victor Lempitsky - all main track lectures
Victor Yurchenko - seminars, homeworks, admin stuff
Fedor Ratnikov - seminars, homeworks, admin staff
To be continued

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

Related tags

Overview

Deep Vision and Graphics

General info

Syllabus

Contributors & course staff

Owner

Yandex School of Data Analysis

Wordplay, an artificial Intelligence based crossword puzzle solver.

This package contains deep learning models and related scripts for RoseTTAFold

Implementing DeepMind's Fast Reinforcement Learning paper

Research on controller area network Intrusion Detection Systems

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Learning Facial Representations from the Cycle-consistency of Face (ICCV 2021)

Rotary Transformer

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Modeling CNN layers activity with Gaussian mixture model

Accelerated deep learning R&D

Fast, flexible and fun neural networks.

auto-tuning momentum SGD optimizer

TVNet: Temporal Voting Network for Action Localization

Convolutional neural network web app trained to track our infant’s sleep schedule using our Google Nest camera.

3D ResNet Video Classification accelerated by TensorRT

A modified version of DeepMind's Alphafold2 to divide CPU part (MSA and template searching) and GPU part (prediction model)

Unsupervised Image Generation with Infinite Generative Adversarial Networks

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

Transformer Tracking (CVPR2021)