Benchmark

Map Compressibility Assessment for LiDAR Registration

This repo contains the released version of code and datasets used for our IROS 2021 paper: "Map Compressibility Assessment for LiDAR Registration.

18 October 2021

Benchmark

Benchmark tools for Compressive LiDAR-to-map registration

18 October 2021

Benchmark

Benchmarks to read parquet to arrow

18 October 2021

Question Answering

Repository for the Bias Benchmark for QA dataset

17 October 2021

DeepMind

DeepMind Alchemy task environment: a meta-reinforcement learning benchmark

15 October 2021

Benchmark

The self-supervised goal reaching benchmark introduced in Discovering and Achieving Goals via World Models

15 October 2021

Generator

A benchmark for concept generalization

ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations which measure how well they generalize to unseen concepts.

13 October 2021

Natural Language Processing

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

09 October 2021

Deep Learning

Four neural network based domain adaptation techniques: DeepCORAL, DDC, CDAN and CDAN+E. Evaluated on benchmark dataset Office31

01 October 2021

Benchmark

Benchmark a WebSocket server's message throughput

01 October 2021

Benchmark

Gait Recognition in the Wild: A Benchmark

30 September 2021

Benchmark

Reproducible nvim completion framework benchmarks

30 September 2021

Benchmark

A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models

30 September 2021

Benchmark

RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

28 September 2021

PyTorch

CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

27 September 2021

Question Answering

TruthfulQA: Measuring How Models Imitate Human Falsehoods

26 September 2021

Benchmark

Benchmark for Answering Existential First Order Queries with Single Free Variable

24 September 2021

Machine Learning

Merlion: A Machine Learning Framework for Time Series Intelligence

23 September 2021

Speech To Text

Pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

23 September 2021

Tracking

SiamSA: Robust Siamese Object Tracking for Unmanned Aerial Manipulator

23 September 2021

Scripts

A python script that benchmarks the download speeds for the connections defined in one or more wireguard config files

wireguard-config-benchmark is a python script that benchmarks the download speeds for the connections defined in one or more wireguard config files

23 September 2021

Benchmark

Standard implementations of FedLab and its provided benchmarks

17 September 2021

Benchmark

Phy-Q: A Benchmark for Physical Reasoning

Humans are well-versed in reasoning about the behaviors of physical objects when choosing actions to accomplish tasks, while it remains a major challenge for AI.

17 September 2021

Benchmark

Load and performance benchmark tool in python

11 September 2021

Benchmark

Model Quantization Benchmark in python

11 September 2021

Dataset

Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

07 September 2021

Testing

An easy to use, scriptable and scalable performance testing tool

Locust is an easy to use, scriptable and scalable performance testing tool. You define the behaviour of your users in regular Python code

06 September 2021

Autonomous Driving

AutoLay: Benchmarking Monocular Layout Estimation

04 September 2021

Benchmark

CPU benchmark by calculating Pi Build With Python3

The program calculates pi with an accuracy of 10,000 decimal places. The time spent on the calculation is counted as the test result. The result is determined by the average of 10 attempts. Lower is Better.

02 September 2021

Benchmark

A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

01 September 2021

Transformer

Transfer Learning Shootout for PyTorch's model zoo

Transfer Learning shootout for PyTorch's model zoo (torchvision).

29 August 2021

Benchmark

A benchmark platform containing diverse weak supervision tasks

Wrench is a benchmark platform containing diverse weak supervision tasks.

27 August 2021

Neural Network

Bag of Tricks for Training Deeper Graph Neural Networks

Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study

24 August 2021

Benchmark

A MNIST-like fashion product database. Benchmark

a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples.

23 August 2021

Benchmark

A Universal Commonsense Reasoning Model on a New Multitask Benchmark

This repository is for the paper: Unicorn on Rainbow: A Universal Commonsense Reasoning Model on a New Multitask Benchmark.

22 August 2021

Graph

New Benchmarks for Learning on Non-Homophilous Graphs

[WWW 2021 GLB] New Benchmarks for Learning on Non-Homophilous Graphs

19 August 2021

Benchmark

A simulation-based inference benchmark framework

This repository contains a simulation-based inference benchmark framework

15 August 2021

Benchmark

OpenMMLab Semantic Segmentation Toolbox and Benchmark

MMSegmentation is an open source semantic segmentation toolbox based on PyTorch.

14 August 2021

Machine Learning

Benchmarking Model and System Performance of Federated Learning

FedScale: Benchmarking Model and System Performance of Federated Learning

13 August 2021

Benchmark

A Versatile Benchmark for Comprehensive Forgery Analysis

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

13 August 2021

Task

A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

12 August 2021

Benchmark

Towards Reproducible and Deployable Model Quantization Benchmark

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

10 August 2021

Graph

A Federated Learning System and Benchmark for Graph Neural Networks

A Research-oriented Federated Learning Library and Benchmark Platform for Graph Neural Networks.

08 August 2021

Question Answering

A Benchmark Dataset for Understanding Disfluencies in Question Answering

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

07 August 2021

Benchmark

The Korean Language Understanding Evaluation (KLUE) benchmark

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark.

06 August 2021

Machine Learning

A collection of GNN-based fake news detection models

This repo includes the Pytorch-Geometric implementation of a series of Graph Neural Network (GNN) based fake news detection models.

04 August 2021

Benchmark

Multi-View Partial Point Clouds for Completion and Registration

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

02 August 2021

Benchmark

A Large-Scale Benchmark for General Object Grasping

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020).

02 August 2021

Benchmark

Visual reinforcement learning benchmark for controllability

BridgeWalk is a partially-observed reinforcement learning environment with dynamics of varying stochasticity.

02 August 2021

Benchmark

A Chinese Biomedical Language Understanding Evaluation Benchmark

AI (Artificial Intelligence) plays an indispensable role in the biomedical field, helping improve medical technology.

02 August 2021

A collection of 110 posts