Image

WIDERFace

YEAR: 2020 LICENSE: Not found UpdateTime: 2021-01-15

32,203张图像,并对393,703张像样本图像中所描述的在尺度、姿势和遮挡方面具有高度可变性的面孔进行标记。较宽的人脸数据集基于61个事件类进行组织。 

Image

WIDER

LICENSE: Not found UpdateTime: 2021-01-15

WIDER包含61个事件类别和大约50574个用事件 类标签注释的图像。 

Image

IMDB-WIKI_faces

LICENSE: Not found UpdateTime: 2021-01-15

来自IMDb的20,284名名人和Wikipedia的62,328名名人共460,723张人脸图像,因此总计523,051张。  

Image

Objectron

YEAR: 2020 LICENSE: C-UDA-1.0 UpdateTime: 2021-01-19

The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. Includes 15000 annotated videos and 4M annotated images.

Image

Products-10K

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

The largest production recognition dataset containing 10,000 products frequently bought by online customers in JD.com

Medical

MedICaT

YEAR: 2020 LICENSE: CC-BY-NC-ND 4.0 UpdateTime: 2021-01-19

MedICaT is a dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references. Consists of: 217,060 figures from 131,410 open access papers, 7507 subcaption and subfigure annotations for 2069 compound figures, Inline references for ~25K figures in the ROCO dataset.

Question answering

ClarQ

YEAR: 2020 LICENSE: CC BY-NC 4.0 UpdateTime: 2021-01-19

ClarQ: A large-scale and diverse dataset for Clarification Question Generation. Consists of ~2M examples distributed across 173 domains of stackexchange.

Image

HAA500

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

HAA500, a manually annotated human-centric atomic action dataset for action recognition on 500 classes with over 591k labeled frames.

NLP

CLUE benchmark

YEAR: 2020 LICENSE: Various UpdateTime: 2021-01-19

CLUE: A Chinese Language Understanding Evaluation Benchmark. CLUE is an open-ended, community-driven project that brings together 9 tasks spanning several well-established single-sentence/sentence-pair classification tasks, as well as machine reading comprehension, all on original Chinese text.

Image

KeypointNet

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

KeypointNet is a large-scale and diverse 3D keypoint dataset that contains 83,231 keypoints and 8,329 3D models from 16 object categories, by leveraging numerous human annotations, based on ShapeNet models.

Image

Face/Head segmentation dataset

YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19

The dataset contains over 16.5k (16557) fully pixel-level labeled segmentation images.

Image

Ruralscapes Dataset

YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19

Ruralscapes Dataset for Semantic Segmentation in UAV Videos. Ruralscapes is a dataset with 20 high quality (4K) videos portraying rural areas.

Image

TAO

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

TAO is a federated dataset for Tracking Any Object, containing 2,907 high resolution videos, captured in diverse environments, which are half a minute long on average.

Image

HiEve

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

Human-centric Video Analysis in Complex Events. HiEve dataset includes the currently largest number of poses (>1M), the largest number of complex-event action labels (>56k), and one of the largest number of trajectories with long terms (with average trajectory length >480).

Image

Fashionpedia

YEAR: 2020 LICENSE: CC BY 4.0 UpdateTime: 2021-01-19

Fashionpedia is a dataset which consists of two parts: (1) an ontology built by fashion experts containing 27 main apparel categories, 19 apparel parts, 294 fine-grained attributes and their relationships; (2) a dataset with 48k everyday and celebrity event fashion images annotated with segmentation masks and their associated per-mask fine-grained attributes, built upon the Fashionpedia ontology.

Image

Condensed Moives

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

A large-scale video dataset, featuring clips from movies with detailed captions. Over 3,000 diverse movies from a variety of genres, countries and decades.

Image

AVID

YEAR: 2020 LICENSE: MIT UpdateTime: 2021-01-19

AViD is a large-scale video dataset with 467k videos and 887 action classes. The collected videos have a creative-commons license.

NLP

Social Bias Inference Corpus

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

Social Bias Inference Corpus (SBIC) contains 150k structured annotations of social media posts, covering over 34k implications about a thousand demographic groups.

Self-driving

DDAD

YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19

DDAD (Dense Depth for Autonomous Driving) is a new autonomous driving benchmark from TRI (Toyota Research Institute) for long range (up to 250m) and dense depth estimation in challenging and diverse urban conditions. It contains monocular videos and accurate ground-truth depth (across a full 360 degree field of view) generated from high-density LiDARs mounted on a fleet of self-driving cars operating in a cross-continental setting.

NLP

GoEmotions

YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19

GoEmotions, the largest manually annotated dataset of 58k English Reddit comments, labeled for 27 emotion categories or Neutral.