当前位置:网站首页>Complex AB experiment
Complex AB experiment
2022-08-10 06:50:00 【Coco-Lele】
1. 基本问题
Classification of inspection indicators
ABTest indicators can be divided into two categories:绝对值指标、Proportional value indicator.The variance is calculated differently for the two.
The proportional values can be divided into two categories according to the different denominators:The denominator is person times(留存率、转化率等)、The denominator is the number of actions(曝光点击率).
The denominator is person times,The split unit and the analysis unit are the same,可以用 z z z检验;The denominator is the number of actions,The units of analysis are not independent,要用 d e l t a delta delta检验.
Cumulative value over multiple days
The performance of indicators over multiple days is aggregated and calculated.For example, the per capita frequency of a certain behavior,Then the denominator is the total number of times that behavior occurred during the experiment,The numerator is the number of deduplicated people who entered the group during the experiment.
优点:To ensure independence between samples;增加样本量,Significance can increase with accumulation.
Multi-day accumulation of retention rate:Calculate the retention rate of new recruits every day,Then weighted according to the number of people.
不能用AB的情况
- When the intervention variable cannot be controlled(For example, the impact of watching live broadcasts on users,Some people cannot be forced to watch,Some people don't see it)
- Too much traffic is being used
- Policies can harm the user experience
AB实验步骤
Determine the experimental strategy;Develop experimental indicators of observation;计算样本量(显著性水平/统计功效/The minimum level of improvement in the indicator that needs to be observed/指标方差);Experimental development online;数据回收.
AB不显著
- Whether the minimum sample size is reached
- DIDEliminate fixed differences
- Check the experiment link,See if everyone is reached by the strategy(渗透率低,可以PSM)
2. delta检验
见上篇,Applicable when the split unit and analysis unit are different.
3. 贝叶斯检验
优点:
- Sample size does not need to be considered.
- The distribution of the posterior parameters can be obtained,Then quantify the probability of the improvement of the index、The size of the metric lift.
贝叶斯派 VS 频率派 基本理论:
先验分布 π ( θ ) \pi(\theta) π(θ) + 样本数据 P ( X ∣ θ ) P(X|\theta) P(X∣θ) = 后验分布 π ( θ ∣ X ) \pi(\theta|X) π(θ∣X)
共轭先验分布:贝塔分布 与 二项分布
θ \theta θ~ b e t a ( α , β ) beta(\alpha, \beta) beta(α,β), X X X~ B i n o m i a l ( n , p ) Binomial(n, p) Binomial(n,p), 则 θ ∣ X \theta|X θ∣X~ b e t a ( x + α , n − x + β ) beta(x+\alpha, n-x+\beta) beta(x+α,n−x+β)
4. Different hypothesis tests
z z z检验:Large sample data mean test(Distributions are not differentiated,中心极限定理;Does not distinguish whether the variance is known or not,n>30时t分布和z分布相似)
t t t检验:Small sample normal data mean test(小于30,方差未知)
F F F检验:方差齐性检验;单因素方差分析,Test the effect of the value of each level of a categorical variable.
卡方检验:The essence is to test whether the sample frequency is consistent with the expectation.Can be used to test the correlation between two sets of discrete variables(列联表);Test the similarity between the actual distribution and the expected distribution,Nonparametric tests are mostly used for categorical variables.
X 2 = Σ ( X − E ) 2 / E X^2=\Sigma(X-E)^2/E X2=Σ(X−E)2/E
令 E = n p E=np E=np,The square of the normal distribution can be obtained.k k k- s s s检验:Whether the sample satisfies a specific distribution;Look at the maximum value of the difference between the sample cumulative distribution and the theoretical cumulative distribution.
DID
y = α 1 ∗ t r e a t m e n t + α 2 ∗ p o s t + α 3 ∗ t r e a t m e n t ∗ p o s t + u y=\alpha_1*treatment + \alpha2 * post + \alpha_3*treatment*post+u y=α1∗treatment+α2∗post+α3∗treatment∗post+u
α 3 \alpha_3 α3represents the net effect of the policy
平行趋势检验
参考文献
ABexperimental interview
https://www.jiqizhixin.com/articles/2020-09-18-2
https://blog.csdn.net/deephub/article/details/112167937
边栏推荐
- 大佬,oracle单表增量同步时候源库服务器额外占用内存近2g,这不正常吧
- 2022河南萌新联赛第(五)场:信息工程大学 B - 交通改造
- IDLE开发wordCount程序(第五弹)
- 复现dns外带数据结合sqlmap
- SCS【2】单细胞转录组 之 cellranger
- 腾讯云宋翔:Kubernetes集群利用率提升实践
- 【Day10】进程管理命令
- 2022河南萌新联赛第(五)场:信息工程大学 F - 分割草坪
- 2022 Henan Mengxin League No. 5: University of Information Engineering J-AC Automata
- Quickly grasp game resources in one hour and remote hot update
猜你喜欢
随机推荐
3. Transactions [mysql advanced]
2022 Henan Mengxin League Game (5): University of Information Engineering K - Matrix Generation
英国国家卫生服务遭受攻击,系统出现大面积故障
深入理解LTE网络的CDRX
The constraints of the database learning table
【论文解读】滴滴智能派单-KDD2018 Large-Scale Order Dispatch in On-Demand Ride-Hailing
2022 Henan Mengxin League No. 5: University of Information Engineering B - Transportation Renovation
各位大佬 oracle cdc 默认配置 偶发会30秒才抓取到数据 这个怎么优化啊
High quality WordPress download station 5 play theme template
【Event Preview on August 9】Prometheus Summit
Qt滚动条(QScrollBar)圆角样式问题跟踪
2022河南萌新联赛第(五)场:信息工程大学 J - AC自动机
【强化学习】《Easy RL》- Q-learning - CliffWalking(悬崖行走)代码解读
QScroller的QScrollerProperties参数研究
[Reinforcement Learning] "Easy RL" - Q-learning - CliffWalking (cliff walking) code interpretation
强化学习_03_表格方法实践(CartPole-v0 And MontoCarlo)
【无标题】
mysql数据库月增长量问题
Win32屏幕坐标转换Qt坐标
DGIOT三千万电表集抄压测









