3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Last update: Dec 26, 2022

Related tags

Deep Learning 3D-Reconstruction

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

代码：Part1

技术文档：[Markdown] [PDF]

原始图像：Original Images

点云结果：Point Cloud Results-1

效果图：

Part II 基于计算机视觉方法的点云到点云窗户识别

代码：Part2

技术文档：[Markdown] [PDF]

点云结果：Point Cloud Results-2

算法流程图：

Part III 基于ResNest的图像到点云的语义分割

代码：Part3

技术文档：[Markdown] [PDF]

语义分割结果：Semantic Segmentation Results

点云结果：Point Cloud Results-3

效果图：

参考文献

AA-RMVSNet [arXiv] [CVF] [PDF]

Wei Z, Zhu Q, Min C, et al. Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 6187-6196.

Cascade-MVSNet [arXiv] [CVF] [PDF]

Gu X, Fan Z, Zhu S, et al. Cascade cost volume for high-resolution multi-view stereo and stereo matching[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 2495-2504.

TransMVSNet [arXiv] [PDF]

Ding Y, Yuan W, Zhu Q, et al. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers[J]. arXiv preprint arXiv:2111.14600, 2021.

LoFTR [arXiv] [CVF] [PDF]

Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 8922-8931.

PatchmatchNet [arXiv] [CVF] [PDF]

Wang F, Galliani S, Vogel C, et al. PatchmatchNet: Learned Multi-View Patchmatch Stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 14194-14203.

ResNeSt [arXiv] [PDF]

Zhang H, Wu C, Zhang Z, et al. Resnest: Split-attention networks[J]. arXiv preprint arXiv:2004.08955, 2020.

致谢

稀疏重建部分使用Colmap完成相机参数的获取。

稠密重建部分的代码主要来源于AA-RMVSNet。

点云切割与可视化使用CloudCompare及Meshlab完成。

调用Open3D进行表面重建。

Cascade+Transformer的代码主要基于kwea123实现的pytorch-lightning版本的Cascade-MVSNetl以及LoFTR进行实现。

窗户识别算法中部分思路参考了Color Space的矩形识别算法，图像处理技术主要基于冈萨雷斯的数字图像处理（第三版）。

语义分割部分调用了PyTorch-Encoding。

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

12 Nov 30, 2022

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints Official implementation for Reducing Footskate in Human Motion Recon

38 Nov 1, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

MeshTransformer ✨ This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers. MEsh TRansfOrmer is a simple yet effec

473 Dec 31, 2022

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

SinIR (Official Implementation) Requirements To install requirements: pip install -r requirements.txt We used Python 3.7.4 and f-strings which are in

47 Oct 11, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 6, 2023

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Releases(7)

7(Feb 16, 2022)

White mesh generated by Neus
Source code(tar.gz)
Source code(zip)
dongbeiya_neus.ply(11.21 MB)
gym_north_neus.ply(21.28 MB)
gym_south_neus.ply(16.59 MB)
6(Feb 16, 2022)

White mesh generated by Colmap and Meshlab
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(19.11 MB)
dongbeiya.png(8.45 MB)
gym_north.ply(31.93 MB)
gym_north.png(8.73 MB)
gym_south.ply(26.97 MB)
gym_south.png(9.32 MB)
5(Dec 29, 2021)

Original images for reconstruction
Source code(tar.gz)
Source code(zip)
PIC2.zip(755.68 MB)
PIC2.z01(900.00 MB)
PIC2.z02(900.00 MB)
dby.zip(735.16 MB)
dby.z02(900.00 MB)
dby.z01(900.00 MB)
4(Dec 19, 2021)

Semantic Segmentation Results of Problem 3
Source code(tar.gz)
Source code(zip)
filtered_segmentation_result_dongbeiya.zip(661.17 MB)
filtered_segmentation_result_gym.zip(786.65 MB)
segmentation_result_dongbeiya.zip(64.31 MB)
segmentation_result_dongbeiya_block.zip(53.27 MB)
segmentation_result_gym.zip(4.72 MB)
3(Dec 19, 2021)

Point Cloud Results of Problem 3
Source code(tar.gz)
Source code(zip)
2(Dec 19, 2021)

Point Cloud Results of Problem 2
Source code(tar.gz)
Source code(zip)
gym_south_window.ply(627.30 MB)
gym_north_window.ply(808.62 MB)
dongbeiya_window.ply(1800.53 MB)
gym_window.ply(1603.31 MB)
1(Dec 19, 2021)

Point Cloud Results of Problem 1
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(731.13 MB)
gym_south.ply(696.19 MB)
gym_north.ply(707.89 MB)
gym.ply(1404.08 MB)

Owner

HMT_Curo

GitHub Repository

Ensembling Off-the-shelf Models for GAN Training

Data-Efficient GANs with DiffAugment project | paper | datasets | video | slides Generated using only 100 images of Obama, grumpy cats, pandas, the Br

1.2k Dec 26, 2022

Machine Learning Privacy Meter: A tool to quantify the privacy risks of machine learning models with respect to inference attacks, notably membership inference attacks

ML Privacy Meter Machine learning is playing a central role in automated decision making in a wide range of organization and service providers. The da

357 Jan 06, 2023

Convolutional Neural Network for 3D meshes in PyTorch

MeshCNN in PyTorch SIGGRAPH 2019 [Paper] [Project Page] MeshCNN is a general-purpose deep neural network for 3D triangular meshes, which can be used f

1.4k Jan 04, 2023

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral) [Project] [Paper] [Demo] [Related Work: A2RL (for Auto Image Cropping)] [C

402 Dec 27, 2022

The implementation of the paper "A Deep Feature Aggregation Network for Accurate Indoor Camera Localization".

A Deep Feature Aggregation Network for Accurate Indoor Camera Localization This is the PyTorch implementation of our paper "A Deep Feature Aggregation

9 Dec 09, 2022

Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud

Google Cloud Vertex AI Samples Welcome to the Google Cloud Vertex AI sample repository. Overview The repository contains notebooks and community conte

560 Dec 31, 2022

Overview of architecture and implementation of TEDS-Net, as described in MICCAI 2021: "TEDS-Net: Enforcing Diffeomorphisms in Spatial Transformers to Guarantee TopologyPreservation in Segmentations"

TEDS-Net Overview of architecture and implementation of TEDS-Net, as described in MICCAI 2021: "TEDS-Net: Enforcing Diffeomorphisms in Spatial Transfo

14 Jan 04, 2023

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation Experiment Setting: CIFAR10 (downloaded and saved in ./DATA

38 Dec 27, 2022

Attentional Focus Modulates Automatic Finger‑tapping Movements

"Attentional Focus Modulates Automatic Finger‑tapping Movements", in Scientific Reports

1 Dec 02, 2021

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

This is a fork of Fairseq(-py) with implementations of the following models: Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Se

490 Dec 15, 2022

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

Blitz - Bayesian Layers in Torch Zoo BLiTZ is a simple and extensible library to create Bayesian Neural Network Layers (based on whats proposed in Wei

722 Jan 08, 2023

My implementation of Image Inpainting - A deep learning Inpainting model

Image Inpainting What is Image Inpainting Image inpainting is a restorative process that allows for the fixing or removal of unwanted parts within ima

1 Dec 12, 2021

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

Translated in 🇰🇷 Korean/ Ludwig is a toolbox that allows users to train and test deep learning models without the need to write code. It is built on

8.7k Dec 31, 2022

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Visual Adversarial Imitation Learning using Variational Models (VMAIL) This is the official implementation of the NeurIPS 2021 paper. Project website

14 Nov 18, 2022

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

A Differentiable Recurrent Surface for Asynchronous Event-Based Data Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous

21 Oct 05, 2022

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Lidar with Velocity A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud. related paper: Lidar with Velocity : Motion

164 Dec 30, 2022

Scene-Text-Detection-and-Recognition (Pytorch)

Scene-Text-Detection-and-Recognition (Pytorch) Competition URL: https://tbrain.t

9 Jan 02, 2023

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

PRP Introduction This is the implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

39 Dec 29, 2022

Classification models 1D Zoo - Keras and TF.Keras

Classification models 1D Zoo - Keras and TF.Keras This repository contains 1D variants of popular CNN models for classification like ResNets, DenseNet

12 Jan 06, 2023

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

fasterGICP This package is an improvement of fast_gicp Please cite our paper if possible. W. Jikai, M. Xu, F. Farzin, D. Dai and Z. Chen, "FasterGICP:

79 Dec 31, 2022

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Related tags

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

Part II 基于计算机视觉方法的点云到点云窗户识别

Part III 基于ResNest的图像到点云的语义分割

参考文献

致谢

You might also like...

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Releases(7)

7(Feb 16, 2022)

6(Feb 16, 2022)

5(Dec 29, 2021)

4(Dec 19, 2021)

3(Dec 19, 2021)

2(Dec 19, 2021)

1(Dec 19, 2021)

Owner

HMT_Curo

Ensembling Off-the-shelf Models for GAN Training

Machine Learning Privacy Meter: A tool to quantify the privacy risks of machine learning models with respect to inference attacks, notably membership inference attacks

Convolutional Neural Network for 3D meshes in PyTorch

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

The implementation of the paper "A Deep Feature Aggregation Network for Accurate Indoor Camera Localization".

Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud

Overview of architecture and implementation of TEDS-Net, as described in MICCAI 2021: "TEDS-Net: Enforcing Diffeomorphisms in Spatial Transformers to Guarantee TopologyPreservation in Segmentations"

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Attentional Focus Modulates Automatic Finger‑tapping Movements

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

My implementation of Image Inpainting - A deep learning Inpainting model

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Scene-Text-Detection-and-Recognition (Pytorch)

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Classification models 1D Zoo - Keras and TF.Keras

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry