NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)
Self-Learned Video Rain Streak Removal: When Cyclic Consistency Meets Temporal Correspondence (CVPR'2020)
AutoGiphyMovie lets you search giphy for gifs, converts them to videos, attach a soundtrack and stitches it all together into a movie!
Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos.
Basic flask server that serves fixed twitter video embeds to desktop discord by using either the Twitter API or Youtube-DL to grab tweet video information.
Pytorch implementation for high-resolution (e.g., 2048x1024) photorealistic video-to-video translation.
SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos
PNS-Net This repository provides code for paper"Progressively Normalized Self-Attention Network for Video Polyp Segmentation" published at the MICCAI-2021 conference (arXiv Version | 中文版). If you have any questions about our paper,
a model for locally controlled, stochastic video synthesis based on poking a single pixel in a static scene
This is a PyTorch reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution (VSR).
Cross-platform command-line AV1 / VP9 / HEVC / H264 encoding framework with per scene quality encoding
This repository contains Python and C++ implementation of Robust Consistent Video Depth, as described in the paper
ASPset-510 (Australian Sports Pose Dataset) is a large-scale video dataset for the training and evaluation of 3D human pose estimation models.
AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations.
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation