Official PyTorch implementation of "The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation" (ICCV 21).

Last update: Dec 25, 2022

Related tags

Deep Learning center-group

Overview

CenterGroup

This the official implementation of our ICCV 2021 paper

The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation,
Guillem Brasó, Nikita Kister, Laura Leal-Taixé
We introduce CenterGroup, an attention-based framework to estimate human poses from a set of identity-agnostic keypoints and person center predictions in an image. Our approach uses a transformer to obtain context-aware embeddings for all detected keypoints and centers and then applies multi-head attention to directly group joints into their corresponding person centers. While most bottom-up methods rely on non-learnable clustering at inference, CenterGroup uses a fully differentiable attention mechanism that we train end-to-end together with our keypoint detector. As a result, our method obtains state-of-the-art performance with up to 2.5x faster inference time than competing bottom-up methods.

@article{Braso_2021_ICCV,
    author    = {Bras\'o, Guillem and Kister, Nikita and Leal-Taix\'e, Laura},
    title     = {The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation},
    journal = {ICCV},
    year      = {2021}
}

Main Results

With the code contained in this repo, you should be able to reproduce the following results.

Results on COCO val2017

Method	Detector	Multi-Scale Test	Input size	AP	AP.5	AP .75	AP (M)	AP (L)
CenterGroup	HigherHRNet-w32	✘	512	69.0	87.7	74.4	59.9	75.3
CenterGroup	HigherHRNet-w48	✘	640	71.0	88.7	76.5	63.1	75.2
CenterGroup	HigherHRNet-w32	✔	512	71.9	89.0	78.0	63.7	77.4
CenterGroup	HigherHRNet-w48	✔	640	73.3	89.7	79.2	66.4	76.7

Results on COCO test2017

Method	Detector	Multi-Scale Test	Input size	AP	AP .5	AP .75	AP (M)	AP (L)
CenterGroup	HigherHRNet-w32	✘	512	67.6	88.6	73.6	62.0	75.6
CenterGroup	HigherHRNet-w48	✘	640	69.5	89.7	76.0	65.0	76.2
CenterGroup	HigherHRNet-w32	✔	512	70.3	90.0	76.9	65.4	77.5
CenterGroup	HigherHRNet-w48	✔	640	71.4	90.5	78.1	67.2	77.5

Results on CrowdPose test

Method	Detector	Multi-Scale Test	Input size	AP	AP .5	AP .75	AP (E)	AP (M)	AP (H)
CenterGroup	HigherHRNet-w48	✘	640	67.6	87.6	72.7	74.2	68.1	61.1
CenterGroup	HigherHRNet-w48	✔	640	70.3	89.1	75.7	77.3	70.8	63.2

Installation

Please see docs/INSTALL.md

Model Zoo

Please see docs/MODEL_ZOO.md

Evaluation

To evaluate a model you have to specify its configuration file, its checkpoint, and the number of GPUs you want to use. All of our configurations and checkpoints are available here) For example, to run CenterGroup with a HigherHRNet32 detector and a single GPU you can run the following:

NUM_GPUS=1
./tools/dist_test.sh configs/centergroup2/coco/higherhrnet_w32_coco_512x512 models/centergroup/centergroup_higherhrnet_w32_coco_512x512.pth $NUM_GPUS 1234

If you want to use multi-scale testing, please add the --multi-scale flag, e.g.:

./tools/dist_test.sh configs/centergroup2/coco/higherhrnet_w32_coco_512x512 models/centergroup/centergroup_higherhrnet_w32_coco_512x512.pth $NUM_GPUS 1234 --multi-scale

You can also modify any other config entry with the --cfg-options entry. For example, to disable flip-testing, which is used by default, you can run:

./tools/dist_test.sh configs/centergroup2/coco/higherhrnet_w32_coco_512x512 models/centergroup/centergroup_higherhrnet_w32_coco_512x512.pth $NUM_GPUS 1234 --cfg-options model.test_cfg.flip_test=False

You may need to modify the checkpoint's path, depending on where you downloaded it, and the entry data_root in the config file, depending on where you stored your data.

Training HigherHRNet with Centers

TODO

Training CenterGroup

TODO

Demo

TODO

Acknowledgements

Our code is based on mmpose, which reimplemented HigherHRNet's work. We thank the authors of these codebases for their great work!

Official PyTorch implementation of "The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation" (ICCV 21).

Related tags

Overview

CenterGroup

Main Results

Results on COCO val2017

Results on COCO test2017

Results on CrowdPose test

Installation

Model Zoo

Evaluation

Training HigherHRNet with Centers

Training CenterGroup

Demo

Acknowledgements

Owner

Dynamic Vision and Learning Group

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention

Object tracking using YOLO and a tracker(KCF, MOSSE, CSRT) in openCV

MatchGAN: A Self-supervised Semi-supervised Conditional Generative Adversarial Network

Robotics environments

Learning Spatio-Temporal Transformer for Visual Tracking

Official Matlab Implementation for "Tiny Obstacle Discovery by Occlusion-aware Multilayer Regression", TIP 2020

PlenOctree Extraction algorithm

Creating Multi Task Models With Keras

Retinal vessel segmentation based on GT-UNet

Public repository created to store my custom-made tools for Just Dance (UbiArt Engine)

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Segmentation-Aware Convolutional Networks Using Local Attention Masks

SSD: A Unified Framework for Self-Supervised Outlier Detection [ICLR 2021]

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

The Official PyTorch Implementation of DiscoBox.

Official implementation for the paper: Multi-label Classification with Partial Annotations using Class-aware Selective Loss