Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

Custom IMDB Dataset is extracted between 2020-2021 and custom distilBERT model is trained for movie success probability prediction

This project is based on RIFE and aims to make RIFE more practical for users by adding various features and design new models

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

This is a model made out of Neural Network specifically a Convolutional Neural Network model

RP-GAN: Stable GAN Training with Random Projections

Maximum Spatial Perturbation for Image-to-Image Translation (Official Implementation)

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Simple keras FCN Encoder/Decoder model for MS-COCO (food subset) segmentation

traiNNer is an open source image and video restoration (super-resolution, denoising, deblurring and others) and image to image translation toolbox based on PyTorch.

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

PIXIE: Collaborative Regression of Expressive Bodies

Semi-supervised Implicit Scene Completion from Sparse LiDAR

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

A Simulated Optimal Intrusion Response Game

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package

Sparse Physics-based and Interpretable Neural Networks

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

NeurIPS 2021, self-supervised 6D pose on category level

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.