Multiview Dataset Toolkit

Using multi-view cameras is a natural way to obtain a complete point cloud. However, there is to date only one multi-view 3D hand pose dataset– NYU. Furthermore, NYU is primarily used as a depth map dataset; although they also provided the RGB images, these RGB images are of low resolution and quality. FreiHand also records data using a multi- view setup, but the released images are not from corresponding viewpoints. In that sense, it can be regarded only as a single-view dataset containing multiple views rather than a true multi-view dataset.
To fill this gap, we present a new multi-view RGB-D 3D hand pose dataset. We use four RealSense D415 cameras in different views to record 4 RGB-D sequences from 4 subjects and the resolution of our recorded dataset is 640 × 480. We use a 21-joint model to annotate the hand pose. Additionally, we provide hand masks, 2D and 3D joint locations, hand meshes in the form of MANO parameters, real complete hand point clouds and full camera parameters. In particular, we provide extrinsic camera parameters so it is easy for users to use multi-view information.

Basic setup

download data
install basic requirements

pip install numpy matplotlib scikit-image transforms3d tqdm opencv-python trimesh pyrender

example code

python toolkit.py

Provided data

four views color images
four views depth images
intrinsic and extrinsic camera parameters
21 hand joints
- 0 wrist
- 1 mcp index, 2 pip index, 3 dip index, 4 tip index
- 5 mcp middle, 6 pip middle, 7 dip middle, 8 tip middle
- 9 mcp ring, 10 pip ring, 11 dip ring, 12 tip ring
- 13 mcp pinky, 14 pip pinky, 15 dip pinky, 16 tip pinky
- 17 mcp thumb, 18 pip thumb, 19 dip thumb, 20 tip thumb
mano parameters

Access the dataset

data usage in toolkit.py
- drawMesh
- drawPose4view
- getBetterDepth

Info for our camera calibration

here

Terms of use

@InProceedings{Local2021,
  author    = {Ziwei Yu, Linlin Yang, Shicheng Chen, Angela Yao},
  title     = {Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation},
  booktitle    = {British Machine Vision Conference (BMVC)},
  year      = {2021},
  url          = {"https://github.com/ShichengChen/multiviewDataset"}
}

Multiview Dataset Toolkit

Related tags

Overview

Multiview Dataset Toolkit

Basic setup

Provided data

Access the dataset

Info for our camera calibration

Terms of use

Owner

Code repository for the paper Computer Vision User Entity Behavior Analytics

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

ChainerRL is a deep reinforcement learning library built on top of Chainer.

A Python package for time series augmentation

PyTorchMemTracer - Depict GPU memory footprint during DNN training of PyTorch

Implementation of gaze tracking and demo

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

EMNLP 2020 - Summarizing Text on Any Aspects

PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.

Tool for working with Y-chromosome data from YFull and FTDNA

Framework for abstracting Amiga debuggers and access to AmigaOS libraries and devices.

Python library for tracking human heads with FLAME (a 3D morphable head model)

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

Supervised Classification from Text (P)

OpenDILab RL Kubernetes Custom Resource and Operator Lib

Crowd-sourced Annotation of Human Motion.