Pytorch implementation of face attention network

Last update: Dec 09, 2022

Related tags

Overview

Face Attention Network

Pytorch implementation of face attention network as described in Face Attention Network: An Effective Face Detector for the Occluded Faces. The baseline is RetinaNet followed by this repo.

Requirements

Python3
Pytorch0.4
torchvision
tensorboardX

Installation

Install packages.

sudo apt-get install tk-dev python-tk
pip install cffi
pip install cython
pip install pandas
pip install tensorboardX

Build NMS.

cd Face_Attention_Network/lib
sh build.sh

Create folders.

cd Face_Attention_Network/
mkdir ckpt mAP_txt summary weight

Datasets

You should prepare three CSV or TXT files including train annotations file, valid annotations file and label encoding file.

Annotations format

Two examples are as follows:

$image_path/img_1.jpg x1 y1 x2 y2 label
$image_path/img_2.jpg . . . . .

Images with more than one bounding box should use one row per box. When an image does not contain any bounding box, set them '.'.

Label encoding file

A TXT file (classes.txt) is needed to map label to ID. Each line means one label name and its ID. One example is as follows:

face 0

Pretrained Model

We use resnet18, 34, 50, 101, 152 as the backbone. You should download them and put them to /weight.

resnet18: https://download.pytorch.org/models/resnet18-5c106cde.pth
resnet34: https://download.pytorch.org/models/resnet34-333f7ec4.pth
resnet50: https://download.pytorch.org/models/resnet50-19c8e357.pth
resnet101: https://download.pytorch.org/models/resnet101-5d3b4d8f.pth
resnet152: https://download.pytorch.org/models/resnet152-b121ed2d.pth

Training

python train.py --csv_train <$path/train.txt> --csv_val <$path/val.txt> --csv_classes <$path/classes.txt> --depth <50> --pretrained resnet50-19c8e357.pth --model_name <model name to save>

Visualization Result

Detection result

Attention map at different level (P3~P7)

Pytorch implementation of face attention network

Related tags

Overview

Face Attention Network

Requirements

Installation

Datasets

Annotations format

Label encoding file

Pretrained Model

Training

Visualization Result

Reference

Owner

Hooks

Zero-Cost Proxies for Lightweight NAS

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

Fine-tuning StyleGAN2 for Cartoon Face Generation

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers

Weakly Supervised End-to-End Learning (NeurIPS 2021)

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

Deep Learning Package based on TensorFlow

Code for ICML 2021 paper: How could Neural Networks understand Programs?

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision

Pytorch code for "DPFM: Deep Partial Functional Maps" - 3DV 2021 (Oral)

Simple PyTorch hierarchical models.

Spatial Single-Cell Analysis Toolkit

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

End-to-End Object Detection with Fully Convolutional Network

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Experiments with the Robust Binary Interval Search (RBIS) algorithm, a Query-Based prediction algorithm for the Online Search problem.