This repo contains PyTorch implementation of MLP-Mixer: An all-MLP Architecture for Vision.
Code to accompany our paper "How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change".
Pytorch implementation of the paper DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks.
A neuroanatomy-based augmented reality experience powered by computer vision that features 3D visuals of the Atlas Brain Map slices.
Tensorflow implementation of the MIRNet architecture as proposed by Learning Enriched Features for Real Image Restoration and Enhancement.
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning [CVPR'21, Oral]
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks (SDPoint)
The goal of this challenge is to build a model/agent that move objects in a room to restore them to a given initial configuration.
This repository provides the code for our CVPR 2021 paper Deep Two-View Structure-from-Motion Revisited.
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
The development of areas such as computer vision, image processing, and computer graphics, allow the introduction of technologies such as Augmented Reality.
Global Filter Networks is a transformer-style architecture that learns long-term spatial dependencies in the frequency domain with log-linear complexity.
The program for searching through photos from the air of lost people in the forest using Retina Net neural nwtwork.