Single-Shot Motion Completion with Transformer

Last update: Dec 29, 2022

Related tags

Overview

Single-Shot Motion Completion with Transformer

👉 [Preprint] 👈

Abstract

Motion completion is a challenging and long-discussed problem, which is of great significance in film and game applications. For different motion completion scenarios (in-betweening, in-filling, and blending), most previous methods deal with the completion problems with case-by-case designs. In this work, we propose a simple but effective method to solve multiple motion completion problems under a unified framework and achieves a new state of the art accuracy under multiple evaluation settings. Inspired by the recent great success of attention-based models, we consider the completion as a sequence to sequence prediction problem. Our method consists of two modules - a standard transformer encoder with self-attention that learns long-range dependencies of input motions, and a trainable mixture embedding module that models temporal information and discriminates key-frames. Our method can run in a non-autoregressive manner and predict multiple missing frames within a single forward propagation in real time. We finally show the effectiveness of our method in music-dance applications.

State-of-the-art on Lafan1 dataset

With the help of Transformer, we achieve a new SOTA result on Lafan1 dataset.

Lengths = 30	L2Q	L2P	NPSS
Zero-Vel	1.51	6.60	0.2318
Interp.	0.98	2.32	0.2013
ERD-QV	0.69	1.28	0.1328
Ours	0.61	1.10	0.1222

Some results (blue appearaces represent keyframes):

Dance Infilling on Anidance Dataset

We also evaluate our method on the Anidance dataset:

Infilling on the test set (black skeletons are the keyframes):

(From Left to Right: Ours, Interp. and Ground Truth)

Infilling on random keyframes (keyframes are randomly chosen from the test set with a random order for simulating in-the-wild scenario):

(From Left to Right: Ours, Interp. and Ground Truth)

Dance blending

Our method can also work on complex dance movement completion:

Code

Coming soon

Citation

@misc{duan2021singleshot,
      title={Single-Shot Motion Completion with Transformer}, 
      author={Yinglin Duan and Tianyang Shi and Zhengxia Zou and Yenan Lin and Zhehui Qian and Bohan Zhang and Yi Yuan},
      year={2021},
      eprint={2103.00776},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Single-Shot Motion Completion with Transformer

Related tags

Overview

Single-Shot Motion Completion with Transformer

Abstract

State-of-the-art on Lafan1 dataset

Dance Infilling on Anidance Dataset

Dance blending

Code

Citation

Owner

FuxiCV

[ICLR 2021] Is Attention Better Than Matrix Decomposition?

Integrated physics-based and ligand-based modeling.

A big endian Gentoo port developed on a Pine64.org RockPro64

An implementation of the WHATWG URL Standard in JavaScript

Transfer SemanticKITTI labeles into other dataset/sensor formats.

A simple algorithm for extracting tree height in sparse scene from point cloud data.

Graph Analysis From Scratch

A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Implementation for Stankevičiūtė et al. "Conformal time-series forecasting", NeurIPS 2021.

A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.

EigenGAN Tensorflow, EigenGAN: Layer-Wise Eigen-Learning for GANs

Hide screen when boss is approaching.

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

Tensorflow implementation of Character-Aware Neural Language Models.

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

Collaborative forensic timeline analysis

Continuous Security Group Rule Change Detection & Response at scale