Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Last update: Nov 16, 2022

Related tags

Overview

UnRigidFlow

This is the official PyTorch implementation of UnRigidFlow (IJCAI2019).

Here are two sample results (~10MB gif for each) of our unsupervised models.

KITTI 15	Cityscapes

If you find this repo useful in your research, please consider citing:

@inproceedings{Liu:2019:unrigid, 
title = {Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity}, 
author = {Liang Liu, Guangyao Zhai, Wenlong Ye, Yong Liu}, 
booktitle = {International Joint Conference on Artificial Intelligence, IJCAI}, 
year = {2019}
}

Requirements

This codebase was developed and tested with Python 3.5, Pytorch>=0.4.1, OpenCV 3.4, CUDA 9.0 and Ubuntu 16.04.

Most of the python packages can be installed by

pip3 install -r requirements.txt

In addition, Optimized correlation with CUDA kernel should be compiled manually with:

cd <correlation_package>
python3 setup.py install

and add <correlation_package> to $PYTHONPATH.

Note that if you are use PyTorch >= 1.0, you should make some changes, see NVIDIA/flownet2-pytorch#98.

Just replace #include <torch/torch.h> with #include <torch/extension.h> , adding #include <ATen/cuda/CUDAContext.h> and then replacing all at::globalContext().getCurrentCUDAStream() with at::cuda::getCurrentCUDAStream().

Training and Evaluation

We are mainly focused on KITTI benchmark. You will need to download all of the KITTI raw data and calibration files to train the model. You will also need the training files of KITTI 2012 and KITTI 2015 with calibration files [1], [2] for validating the models.

The complete training contains 3 steps:

Train the flow model separately:

python3 train.py -c configs/KITTI_flow.json

Train the depth model separately:

python3 train.py -c configs/KITTI_depth_stereo.json

Train the flow and depth models jointly:

python3 train.py -c configs/KITTI_rigid_flow_stereo.json

For evaluation, just adding --e options and modifying the corresponding model path for the above commands.

Pre-trained Models

You can download our pre-trained models, we provide the models as follow:

KITTI_flow: The separately trained optical flow network on KITTI raw data (from scratch)
KITTI_stereo_depth: The stereo depth network on KITTI raw data.
KITTI_flow_joint: The optical flow network jointly trained with stereo depth on KITTI raw data.

Acknowledgement

This repository refers some snippets from several great work, including PWC-Net, monodepth, UnFlow, UnDepthFlow, DF-Net. Although most of these are TensorFlow implementations, we are grateful for the sharing of these works, which save us a lot of time.

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Related tags

Overview

UnRigidFlow

Requirements

Training and Evaluation

Pre-trained Models

Acknowledgement

Owner

Liang Liu

DeepFaceLab fork which provides IPython Notebook to use DFL with Google Colab

Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

Unsupervised Image Generation with Infinite Generative Adversarial Networks

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

An open source app to help calm you down when needed.

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

A simple code to perform canny edge contrast detection on images.

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

An implementation of Deep Graph Infomax (DGI) in PyTorch

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Simulating an AI playing 2048 using the Expectimax algorithm

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

A strongly-typed genetic programming framework for Python