Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis

Last update: Jan 27, 2022

Related tags

Deep Learning CRT

Overview

Readme File for "Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis" by Ham, Imai, and Janson. (2022)

All scripts were written and edited by Dae Woong Ham on 01/27/2022

Code Overview

Plotting previous empirical results (Fig 1, Fig 2)

"Section2_AMCE_plots/immigration_Fig1.R" produces Figure 1 AMCE plots based on original AMCE estimates
"Section2_AMCE_plots/gender_Fig2.R" produces Figure 2 AMCE plots based on original AMCE estimates

All simulation plots (Fig 3, 4, 5, 6, 7)

All simulations are plotted through "Simulations/all_simulation_plots.R" file
All simulation scripts are executed through "source/left_fig_simulation.sh" or "source/right_fig_simulation.sh"
"Simulations/Section4/Figure3_leftplot.R"/"Simulations/Section4/Figure3_rightplot.R" produces results of Fig 3 # 50 and 33 hours of computing time respectively
"Simulations/Appendix/Figure4_and_6_leftplot.R"/"Simulations/Section4/Figure4_and_6_rightplot.R" produces results of Fig 4 and 6 # 50 and 33 hours of computing time respectively
"Simulations/Appendix/Figure5_leftplot.R"/"Simulations/Section4/Figure5_rightplot.R" produces results of Fig 5 # 50 and 33 hours of computing time respectively
"Simulations/Appendix/Figure7.R" produces results of Fig 7 # less than 5 minutes of computing time on FAS computing cluster

Obtaining new p-values (Section 5 and Table 1)

All p-values in Section 5 are summarized and obtained in "Section5_ApplicationResults/pval_analysis.R"
"Section5_ApplicationResults/Immigration/main_analysis/obs_test_stat.R"/"Section5_ApplicationResults/Immigration/main_analysis/resampled_test_stats.R" produces observed and resampled test statistics to produce p-value in Table 1 row 1 column 1. # 30 minutes of total computing time
"Section5_ApplicationResults/Immigration/main_analysis/AMCE_pval.do" produces AMCE p-value in Table 1 row 1 column 2. #less than 5 seconds of total computing time
"Section5_ApplicationResults/Immigration/main_analysis/profile_order_effect.R"/"Section5_ApplicationResults/Immigration/main_analysis/profile_order_effect/resampled_test_stats.R" produces observed and resampled test statistics to produce p-value in Table 1 row 1 column 3. # 10 minutes of total computing time
"Section5_ApplicationResults/Immigration/main_analysis/carryover_effect_obs_test_stat.R"/"Section5_ApplicationResults/Immigration/main_analysis/carryover_effect_resampled_test_stats.R" produces observed and resampled test statistics to produce p-value in Table 1 row 1 column 4. # 30 minutes of total computing time
"Section5_ApplicationResults/Immigration/main_analysis/fatigue_effect_obs_test_stat.R"/"Section5_ApplicationResults/Immigration/main_analysis/fatigue_effect_resampled_test_stats.R" produces observed and resampled test statistics to produce p-value in Table 1 row 1 column 5. # 24 minutes of computing time
To obtain p-value for second row repeat above but for "Section5_ApplicationResults/Gender/..." # Approximate computation time is listed in the individual files
Each application also contains "../lasso_obs_test_stat.R"/"../lasso_resampled_test_stats.R" to produce supplementary main effect analysis in Section 5
"Section5_ApplicationResults/Immigration/with_ethnocentrism/" contains files to produce p-value when including ethnocentrism in Section 5.1
"Section5_ApplicationResults/gender/supplementary_analysis/" contains files to produce p-value when performing robustness analysis using second most significant interaction in Appendix
"Section5_ApplicationResults/gender/main_analysis/presidential_lasso_explore.R" contains script to find which interaction is strongest in Presidential dataset

Other folders

"data" folder contains all relevant datasets in both Immigration and gender conjoint examples and all the saved results of p-values in simulations and test statistics for Section 5
"Figures" folder contains all figures
"source" folder contains all helper and main functions to run above scripts (including data cleaning, obtaining test statistics, generating simulation datasets). In particular "source/hiernet_source.R" contains the main function to compute all HierNet test statistics in the paper.

Environment

R version 4.1.0
200 cores for all scripts that required parallel computing
All parallel computations in this paper were run on the FASRC Cannon cluster supported by the FAS Division of Science Research Computing Group at Harvard University

Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis

Related tags

Overview

Readme File for "Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis" by Ham, Imai, and Janson. (2022)

All scripts were written and edited by Dae Woong Ham on 01/27/2022

Code Overview

Plotting previous empirical results (Fig 1, Fig 2)

All simulation plots (Fig 3, 4, 5, 6, 7)

Obtaining new p-values (Section 5 and Table 1)

Other folders

Environment

Owner

Collaborative forensic timeline analysis

Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Fully-automated scripts for collecting AI-related papers

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

Sibur challange 2021 competition - 6 place

2021-AIAC-QQ-Browser-Hyperparameter-Optimization-Rank6

Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

Classification of ecg datas for disease detection

Deep Image Matting implementation in PyTorch

A free, multiplatform SDK for real-time facial motion capture using blendshapes, and rigid head pose in 3D space from any RGB camera, photo, or video.

基于Paddle框架的arcface复现

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Open source person re-identification library in python

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch