当前位置:网站首页>Fast and robust multi person 3D pose estimation from multiple views
Fast and robust multi person 3D pose estimation from multiple views
2022-04-23 02:15:00 【The code is too hard to knock. Hello】
Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views
subject :《 Fast robustness, multi view, multi person 3D Attitude estimation 》
author :

source :CVPR 2019
research contents :
Many people - Multiview - Unsupervised
Innovation points :
① A multi-channel matching algorithm is proposed , To find the... Detected across multiple views 2D The periodic correspondence of posture . The proposed matching algorithm can prune the error detection and deal with the partial overlap between views , Without knowing the real number of people in the scene ;
② Combine geometric and visual cues , To match the... Detected across views 2D posture .
Existing problems and technologies :
Through graph structure model [1] Direct reasoning 3D China and 2D Detect all assumptions of geometric compatibility . Its disadvantage is that the structural state space of the problem is huge , When the number of cameras is small , The method is not robust ( Sensitive to the number of cameras ), Therefore, this paper : Multi channel matching algorithm based on convex optimization .
Existing multi-channel matching 2D Attitude algorithm , Polar constraints are often used ( Judge 2 individual 2D Whether the posture is the same 3D Projection of attitude ), The disadvantages are as follows ① For occlusion and truncation positions , Estimation inaccuracy ;② Circular consistency constraints may be violated for each pair of views [2]
The main challenge : Find noise and incomplete 2D Find cross view correspondence in posture ( Who belongs to 2D Pose Match the view of )
In this paper, :
Multi channel matching algorithm based on convex optimization ( For all views detected 2D Pose In order to solve the problem of body clustering )
The same person with different views 2D Pose Consistent with key points
【 Periodic consistency constraint : Solve the global consistency correspondence generated by multi view matching ; Geometric consistency + Physical similarity ( From this, we can get 3D Posture ) So as to reduce fuzzy matching 】
For everyone and matching 2D Pose Deduce 3D Pose( The size of state space of multi attitude joint is reduced )
The framework of this paper :

3.1 Multi view matching process
(1) Parameters involved :
① Suppose there is V A camera ;
② In view i Bounding box detected in pi;
③ For a pair of views , Use Aij To represent the affinity matrix , Its elements represent affinity scores ;
④ The correspondence between the two sets of bounding boxes is estimated by the partial permutation matrix Pij Express
;
⑤
By N Composed of two joints 2D posture .
(2) Problem description
The affinity matrix Aij As input , Output the best partial permutation matrix Pij, To maximize the corresponding functions and cycle through multiple views ( Affinity matrix A It combines appearance similarity and geometric compatibility )
(3) Implementation process :
A: Affinity matrix of fusion
① Use Re-ID The network obtains the descriptor of the bounding box ( Such as : Black clothes 、 Long hair, etc ) after , And from “pool5” The feature vector is extracted as the descriptor of each bounding box . then , We calculate the Euclidean distance between the descriptors of the bounding box pair , And use sigmoid Function maps the distance to The value in is used as the appearance affinity score of this bounding box pair .
② Measure by distance : Geometric consistency calculation (Xi,Xj), Satisfy Dg It's very small .

![]()
Sum up , The fusion affinity matrix is obtained A by

![]()
B: Find the partial permutation matrix P( The estimated relationship between the two sets of bounding boxes )
① Suppose the permutation matrix P Satisfy

However P Can be broken down into P=YYT(Y Express 2D Bounding box and 3D The correspondence between )
②
Optimize it : No longer required P Semi positive definite , Only required P symmetry
(if P Symmetrical and Pii=Ii,0<P<1)
Solving this problem only ADMM Alternating method of algorithmic device , Introduce auxiliary variables Q Rewrite this question

Expand Lagrange
![]()
3.2 3DPS structure
(1) Parameters involved :① The joints i Location ti;
②2D View Vi
(2) Yes 3Dpose An estimate of the possibility :

Maximize P(T|I) General strategy :
Transform the state space into 3 Three dimensional mesh of dimensional space , Apply the maximum product method , But its complexity increases with the increase of grid space .
This article maximizes P(T|I) Strategy :
use 3D The state space is 2D Triangulation of joint pairs 3D coordinate ( Just check the joints in both views , Its reality 3D The location is included in the proposal ), Its state space parameters are reduced , And the accuracy increases .
Data sets :
(1)Campus Data sets :
3 people , Outside ,3 The camera
assessment :PCP( Correctly estimate the percentage of parts ), Measure body parts 3D Location accuracy .
(2)Shelf Data sets
4 people , Rack removal ,5 The camera , More occlusion .
(3)CMU Panoptic Data sets
Many people , indoor , A hundred Cameras , The method used in this paper for qualitative evaluation .
[1] In the structure diagram :
Nodes represent the joints of the body 3D Location , Edge coding, pairwise relationship between them .
The state space of each joint is usually represented by discretization 3D Spatial 3D grid . The possibility of a joint in a certain position is applied to all 2D The joint detector of the view gives , The paired potentials between the joints are constrained by the skeleton or at 2D The body parts detected in the view give . then , The maximum a posteriori estimation is used to infer the of multiple people 3D posture
[2] Two view 2 Match the pose of another person in the view
版权声明
本文为[The code is too hard to knock. Hello]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230212010140.html
边栏推荐
- Chinese scientists reveal a new mechanism for breaking through the bottleneck of rice yield
- 类初始化和实例初始化面试题
- LeetCode 447. Number of boomerangs (permutation and combination problem)
- What is an API interface?
- 每日一题冲刺大厂第十六天 NOIP普及组 三国游戏
- Develop a chrome plug-in from 0 (2)
- How does Axure set the content of the text box to the current date when the page is loaded
- 009_ Redis_ Getting started with redistemplate
- The importance of ERP integration to the improvement of the company's system
- LeetCode 283. Move zero (simple, array) Day12
猜你喜欢

011_ Redistemplate operation hash

What should I pay attention to when using proxy IP?

Network jitter tool clumsy

013_基于Session实现短信验证码登录流程分析

每日一题冲刺大厂第十六天 NOIP普及组 三国游戏

每日一题(2022-04-22)——旋转函数

【dpdk】10. Dpdk DNS learning notes

Why is one plus one equal to two

009_Redis_RedisTemplate入门

LeetCode 447. Number of boomerangs (permutation and combination problem)
随机推荐
Analyze the three functions of static proxy IP.
010_ StringRedisTemplate
006_redis_jedis快速入门
SO库依赖问题
leetcode:27. Remove element [count remove]
手写内存池以及原理代码分析【C语言】
Consider defining a bean of type ‘com.netflix.discovery.AbstractDiscoveryClientOptionalArgs‘
Open3d point cloud processing
Numerical remapping method (remap)
App optimization and advanced scoreboard Part 2 [Mui + flask + mongodb]
Halo open source project learning (I): project launch
C standard library - < time h>
What businesses use physical servers?
Talk about biology live broadcast: Dr. Wang Ziyuan, a lake view biology, exploring hepatitis B with gene therapy
Today will finally write system out. Println()
Startup of openstack service
数仓建表111111
001_ Redis set survival time
php 2022年4月20面试题整理
用TensorFlow实现线性回归(包括过程中出现的问题及解决方法)