当前位置:网站首页>Fast and robust multi person 3D pose estimation from multiple views
Fast and robust multi person 3D pose estimation from multiple views
2022-04-23 02:15:00 【The code is too hard to knock. Hello】
Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views
subject :《 Fast robustness, multi view, multi person 3D Attitude estimation 》
author :
source :CVPR 2019
research contents :
Many people - Multiview - Unsupervised
Innovation points :
① A multi-channel matching algorithm is proposed , To find the... Detected across multiple views 2D The periodic correspondence of posture . The proposed matching algorithm can prune the error detection and deal with the partial overlap between views , Without knowing the real number of people in the scene ;
② Combine geometric and visual cues , To match the... Detected across views 2D posture .
Existing problems and technologies :
Through graph structure model [1] Direct reasoning 3D China and 2D Detect all assumptions of geometric compatibility . Its disadvantage is that the structural state space of the problem is huge , When the number of cameras is small , The method is not robust ( Sensitive to the number of cameras ), Therefore, this paper : Multi channel matching algorithm based on convex optimization .
Existing multi-channel matching 2D Attitude algorithm , Polar constraints are often used ( Judge 2 individual 2D Whether the posture is the same 3D Projection of attitude ), The disadvantages are as follows ① For occlusion and truncation positions , Estimation inaccuracy ;② Circular consistency constraints may be violated for each pair of views [2]
The main challenge : Find noise and incomplete 2D Find cross view correspondence in posture ( Who belongs to 2D Pose Match the view of )
In this paper, :
Multi channel matching algorithm based on convex optimization ( For all views detected 2D Pose In order to solve the problem of body clustering )
The same person with different views 2D Pose Consistent with key points
【 Periodic consistency constraint : Solve the global consistency correspondence generated by multi view matching ; Geometric consistency + Physical similarity ( From this, we can get 3D Posture ) So as to reduce fuzzy matching 】
For everyone and matching 2D Pose Deduce 3D Pose( The size of state space of multi attitude joint is reduced )
The framework of this paper :
3.1 Multi view matching process
(1) Parameters involved :
① Suppose there is V A camera ;
② In view i Bounding box detected in pi;
③ For a pair of views , Use Aij To represent the affinity matrix , Its elements represent affinity scores ;
④ The correspondence between the two sets of bounding boxes is estimated by the partial permutation matrix Pij Express ;
⑤ By N Composed of two joints 2D posture .
(2) Problem description
The affinity matrix Aij As input , Output the best partial permutation matrix Pij, To maximize the corresponding functions and cycle through multiple views ( Affinity matrix A It combines appearance similarity and geometric compatibility )
(3) Implementation process :
A: Affinity matrix of fusion
① Use Re-ID The network obtains the descriptor of the bounding box ( Such as : Black clothes 、 Long hair, etc ) after , And from “pool5” The feature vector is extracted as the descriptor of each bounding box . then , We calculate the Euclidean distance between the descriptors of the bounding box pair , And use sigmoid Function maps the distance to The value in is used as the appearance affinity score of this bounding box pair .
② Measure by distance : Geometric consistency calculation (Xi,Xj), Satisfy Dg It's very small .
Sum up , The fusion affinity matrix is obtained A by
B: Find the partial permutation matrix P( The estimated relationship between the two sets of bounding boxes )
① Suppose the permutation matrix P Satisfy
However P Can be broken down into P=YYT(Y Express 2D Bounding box and 3D The correspondence between )
②
Optimize it : No longer required P Semi positive definite , Only required P symmetry
(if P Symmetrical and Pii=Ii,0<P<1)
Solving this problem only ADMM Alternating method of algorithmic device , Introduce auxiliary variables Q Rewrite this question
Expand Lagrange
3.2 3DPS structure
(1) Parameters involved :① The joints i Location ti;
②2D View Vi
(2) Yes 3Dpose An estimate of the possibility :
Maximize P(T|I) General strategy :
Transform the state space into 3 Three dimensional mesh of dimensional space , Apply the maximum product method , But its complexity increases with the increase of grid space .
This article maximizes P(T|I) Strategy :
use 3D The state space is 2D Triangulation of joint pairs 3D coordinate ( Just check the joints in both views , Its reality 3D The location is included in the proposal ), Its state space parameters are reduced , And the accuracy increases .
Data sets :
(1)Campus Data sets :
3 people , Outside ,3 The camera
assessment :PCP( Correctly estimate the percentage of parts ), Measure body parts 3D Location accuracy .
(2)Shelf Data sets
4 people , Rack removal ,5 The camera , More occlusion .
(3)CMU Panoptic Data sets
Many people , indoor , A hundred Cameras , The method used in this paper for qualitative evaluation .
[1] In the structure diagram :
Nodes represent the joints of the body 3D Location , Edge coding, pairwise relationship between them .
The state space of each joint is usually represented by discretization 3D Spatial 3D grid . The possibility of a joint in a certain position is applied to all 2D The joint detector of the view gives , The paired potentials between the joints are constrained by the skeleton or at 2D The body parts detected in the view give . then , The maximum a posteriori estimation is used to infer the of multiple people 3D posture
[2] Two view 2 Match the pose of another person in the view
版权声明
本文为[The code is too hard to knock. Hello]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204230212010140.html
边栏推荐
- Explain JS prototype and prototype chain in detail
- Shardingsphere broadcast table and binding table
- Halo open source project learning (I): project launch
- How to set computer IP?
- 012_ Access denied for user ‘root‘@‘localhost‘ (using password: YES)
- PTA: 浪漫倒影 [二叉树重建] [深度优先遍历]
- Leetcode39 combined sum
- 手写内存池以及原理代码分析【C语言】
- If 404 page is like this | daily anecdotes
- 005_ redis_ Set set
猜你喜欢
Daily question (April 22, 2022) - rotation function
006_redis_jedis快速入门
Shardingsphere read write separation
What is a proxy IP pool and how to build it?
day18--栈队列
PTA: 点赞狂魔
每日一题冲刺大厂第十六天 NOIP普及组 三国游戏
Campus transfer second-hand market source code
Micro build low code zero foundation introductory course
005_redis_set集合
随机推荐
我国科学家揭示突破水稻产量瓶颈新机制
A simple and open source navigation website source code
007_ Redis_ Jedis connection pool
【无标题】
LeetCode 349. Intersection of two arrays (simple, array) Day12
Introduction to micro build low code zero Foundation (lesson 2)
一加一为什么等于二
010_StringRedisTemplate
中金财富跟中金公司是一家公司吗,安全吗
从0开始开发一个chrome插件(2)
R language advanced | generalized vector and attribute analysis
009_Redis_RedisTemplate入门
leetcode:27. Remove element [count remove]
Summary of I / O knowledge points
IAR嵌入式开发STM32f103c8t6之点亮LED灯
005_redis_set集合
每日一题冲刺大厂第十六天 NOIP普及组 三国游戏
[chrome extender] content_ Cross domain problem of script
Daily question (April 22, 2022) - rotation function
What problems will you encounter when dialing VPS?