Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning

Nov 01, 2021 1 min read

The Code for MM2021 paper “Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning”

Setting up and using the repo

Get the dataset. Follow the steps in data/README.md. This includes the steps to get the pretrained BERT embeddings and visual representations.
Install cuda 11.0 if it’s not available already.
Install anaconda if it’s not available already, and create a new environment. You need to install a few things, namely, pytorch 1.7.1, torchvision, and allennlp.

wget https://repo.anaconda.com/archive/Anaconda3-5.2.0-Linux-x86_64.sh
conda update -n base -c defaults conda
conda create --name MCC python=3.6
source activate MCC

conda install numpy pyyaml setuptools cmake cffi tqdm pyyaml scipy ipython mkl mkl-include cython typing h5py pandas nltk spacy numpydoc scikit-learn jpeg

conda install pytorch==1.7.1 torchvision==0.8.2 cudatoolkit=11.0 -c pytorch

pip install -r allennlp-requirements.txt
pip install --no-deps allennlp==0.8.0
python -m spacy download en_core_web_sm


# this one is optional but it should help make things faster
pip uninstall pillow && CC="cc -mavx2" pip install -U --force-reinstall pillow-simd

That’s it! Now to set up the environment, run source activate MCC.

Train/Evaluate models

Please refer to models/README.md.

Acknowledgement

We refer to the repo r2c and tab-vcr for preprocessing codes.

Cite

@inproceedings{zhang2021multi,
  title={Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning},
  author={Zhang, Xi and Zhang, Feifei and Xu, Changsheng},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={1793--1802},
  year={2021}
}

GitHub

View Github

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning

The Code for MM2021 paper “Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning”

Setting up and using the repo

Train/Evaluate models

Acknowledgement

Cite

GitHub

John

Application to list countries in order of travel from the United States

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

The Code for MM2021 paper “Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning”

Setting up and using the repo

Train/Evaluate models

Acknowledgement

Cite

GitHub

Application to list countries in order of travel from the United States

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

You might also like...