当前位置:网站首页>Planning code ROS migration POMDP prediction planning (I)
Planning code ROS migration POMDP prediction planning (I)
2022-04-23 02:42:00 【Summer melts the season】
One . Markov Markov Principle :
First , The general meaning of Markov process is The future is only about the present , It has nothing to do with the past . The definition is as follows :
F t n , t 1 , t 2 … … t n − 1 ( x n ∣ x 1 , x 2 … … x n 1 ) = F t n t n − 1 ( x n ∣ x n 1 ) F_{t_{n}, t_{1}, t_{2} \ldots \ldots t_{n-1}}\left(x_{n} \mid x_{1}, x_{2} \ldots \ldots x_{n} 1\right)=F_{t_{n} t_{n-1}}\left(x_{n} \mid x_{n} 1\right) Ftn,t1,t2……tn−1(xn∣x1,x2……xn1)=Ftntn−1(xn∣xn1)
such “ The state of the next moment is related to the current state , It has nothing to do with the state of the last moment ” The nature of , It is called no aftereffect or Markov nature . The process with this property is called Markov process .
There are two important concepts in Markov process : Transfer distribution function 、 Transfer probability
markov processes X t X_{t} Xt , Called conditional probability F s , t = P { X t ≤ y ∣ X s = x } F_{s, t}=P\left\{X_{t} \leq y \mid X_{s}=x\right\} Fs,t=P{
Xt≤y∣Xs=x} Is the transfer distribution function of the process .
call P ( X t n = x n ∣ X t n = x n − 1 ) P\left(X_{t_{n}}=x_{n} \mid X_{t_{n}}=x_{n-1}\right) P(Xtn=xn∣Xtn=xn−1) Is the transition probability .
Two . Partial Markov decision principle (POMDP):
MDP Principle : Online too much , Not here .
POMDP Principle :
There are involved POMDP The paper of , Will explain its basic principle , such as :
link : DESPOT: Online POMDP Planning with Regularization.
Pay attention to the following points :
1. comparison MDP,POMDP Can't get s t a t e state state, Can only get o b s e r v a t i o n observation observation;
2. By introducing o b s e r v a t i o n observation observation, Added o b s e r v a t i o n observation observation be based on s t a t e state state and a c t i o n action action The distribution of ;
3. h i s t o r y history history Expressed as :
h t = { a 1 , o 1 , … , a t , o t } h_{t}=\left\{a_{1}, o_{1}, \ldots, a_{t}, o_{t}\right\} ht={
a1,o1,…,at,ot}
h t , a t + 1 = { a 1 , o 1 , … , a t , o t , a t + 1 } h_{t} ,a_{t+1}=\left\{a_{1}, o_{1}, \ldots, a_{t}, o_{t}, a_{t+1}\right\} ht,at+1={
a1,o1,…,at,ot,at+1}
And is defined as a c t i o n action action and o b s e r v a t i o n observation observation Of sequence;
4. b e l i e f belief belief Is defined as s t a t e state state be based on h i s t o r y history history The distribution of ;
Detailed basic explanation :POMDP Basics .
3、 ... and . Use despot To solve the POMDP The overall framework :
1. Input b( Confidence space ), adopt buliddespot The model gets l(lowerbound)
2. Find the biggest l Of a( action )
3. If the strategy of confidence space gets more reward than the one chosen a Big , Use at this time b The original strategy chooses a
4. Executive action a
5. Accept observation and measurement z
6. According to the observation , action , And the current state , Get the state of the next moment
among ,builddespot Framework :
1. from K Get a series of
2. Build a new one despot Model , Take a new point as the root node
3. initialization U,L,l,u
4. obtain e(b0)=u(b0)-l(b0)
5. When e(b0)>e, And the running time is less than the maximum running time :
6.b=EXPLORE(D,b)
7.BACKUP(D,b)
8.e(b0)=u(b0)-l(b0)
return l
版权声明
本文为[Summer melts the season]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204220748368172.html
边栏推荐
猜你喜欢

Fast and robust multi person 3D pose estimation from multiple views

Day 3 of learning rhcsa

Fashion MNIST 数据集分类训练

How to build an integrated industrial Internet plus hazardous safety production management platform?
![[XJTU计算机网络安全与管理]第二讲 密码技术](/img/b0/263e8dcbfeb2ce9f504a9c8eb76b07.png)
[XJTU计算机网络安全与管理]第二讲 密码技术

ROP Emporium x86_64 7~8题

Using go language to build web server

Servlet template engine usage example

How big the program development of single chip microcomputer project can be, it represents your level of knocking code

一、序列模型-sequence model
随机推荐
This is how the power circuit is designed
JZ35 replication of complex linked list
字符串去掉空格问题
Efficient music format conversion tool Music Converter Pro
Rhcsa day 4 operation
Les derniers noeuds K de la liste jz22
Fast and robust multi person 3D pose estimation from multiple views
一个国产图像分割项目重磅开源!
认识进程(多线程_初阶)
双亲委派模型【理解】
Water diversion into chengluo Valley p1514
Handwritten memory pool and principle code analysis [C language]
Rich intelligent auxiliary functions and exposure of Sihao X6 security configuration: it will be pre sold on April 23
A domestic image segmentation project is heavy and open source!
Using go language to build web server
PIP install shutil reports an error
How can enterprises with major hazard installations ensure the completion of the digital construction task of double prevention mechanism by the end of the year
[XJTU計算機網絡安全與管理]第二講 密碼技術
MySQL复杂查询使用临时表/with as(类似表变量)
Huashu "deep learning" and code implementation: 01 Linear Algebra: basic concepts + code implementation basic operations