CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Last update: Dec 22, 2022

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

This repo contains code for our paper "Counterfactual Samples Synthesizing for Robust Visual Question Answering" This repo contains code modified from here,many thanks!

Prerequisites

Make sure you are on a machine with a NVIDIA GPU and Python 2.7 with about 100 GB disk space.
h5py==2.10.0
pytorch==1.1.0
Click==7.0
numpy==1.16.5
tqdm==4.35.0

Data Setup

You can use

bash tools/download.sh

to download the data
and the rest of the data and trained model can be obtained from BaiduYun(passwd:3jot) or GoogleDrive unzip feature1.zip and feature2.zip and merge them into data/rcnn_feature/
use

bash tools/process.sh

to process the data

Training

Run

CUDA_VISIBLE_DEVICES=0 python main.py --dataset cpv2 --mode q_v_debias --debias learned_mixin --topq 1 --topv -1 --qvp 5 --output [] --seed 0

to train a model

Testing

Run

CUDA_VISIBLE_DEVICES=0 python eval.py --dataset cpv2 --debias learned_mixin --model_state []

to eval a model

Citation

If you find this code useful, please cite the following paper:

@inproceedings{chen2020counterfactual,
title={Counterfactual Samples Synthesizing for Robust Visual Question Answering},
author={Chen, Long and Yan, Xin and Xiao, Jun and Zhang, Hanwang and Pu, Shiliang and Zhuang, Yueting},
booktitle={CVPR},
year={2020}
}

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Related tags

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Prerequisites

Data Setup

Training

Testing

Citation

Owner

A flexible framework of neural networks for deep learning

Equivariant Imaging: Learning Beyond the Range Space

This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search Engines"

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

PyTorch implementation of paper A Fast Knowledge Distillation Framework for Visual Recognition.

Deploy optimized transformer based models on Nvidia Triton server

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Deep Residual Learning for Image Recognition

Simple tutorials on Pytorch DDP training

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

OpenMMLab Pose Estimation Toolbox and Benchmark.

Implementation of TimeSformer, a pure attention-based solution for video classification

AISTATS 2019: Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Deep learning with dynamic computation graphs in TensorFlow

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

Code for the paper "Implicit Representations of Meaning in Neural Language Models"

SARS-Cov-2 Recombinant Finder for fasta sequences

Evaluating AlexNet features at various depths