数据导航
WIDERFace
YEAR: 2020 LICENSE: Not found UpdateTime: 2021-01-15
32,203张图像,并对393,703张像样本图像中所描述的在尺度、姿势和遮挡方面具有高度可变性的面孔进行标记。较宽的人脸数据集基于61个事件类进行组织。
IMDB-WIKI_faces
LICENSE: Not found UpdateTime: 2021-01-15
来自IMDb的20,284名名人和Wikipedia的62,328名名人共460,723张人脸图像,因此总计523,051张。
Objectron
YEAR: 2020 LICENSE: C-UDA-1.0 UpdateTime: 2021-01-19
The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. Includes 15000 annotated videos and 4M annotated images.
Products-10K
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
The largest production recognition dataset containing 10,000 products frequently bought by online customers in JD.com
MedICaT
YEAR: 2020 LICENSE: CC-BY-NC-ND 4.0 UpdateTime: 2021-01-19
MedICaT is a dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references. Consists of: 217,060 figures from 131,410 open access papers, 7507 subcaption and subfigure annotations for 2069 compound figures, Inline references for ~25K figures in the ROCO dataset.
ClarQ
YEAR: 2020 LICENSE: CC BY-NC 4.0 UpdateTime: 2021-01-19
ClarQ: A large-scale and diverse dataset for Clarification Question Generation. Consists of ~2M examples distributed across 173 domains of stackexchange.
HAA500
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
HAA500, a manually annotated human-centric atomic action dataset for action recognition on 500 classes with over 591k labeled frames.
- Source:https://www.cse.ust.hk/haa/
CLUE benchmark
YEAR: 2020 LICENSE: Various UpdateTime: 2021-01-19
CLUE: A Chinese Language Understanding Evaluation Benchmark. CLUE is an open-ended, community-driven project that brings together 9 tasks spanning several well-established single-sentence/sentence-pair classification tasks, as well as machine reading comprehension, all on original Chinese text.
KeypointNet
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
KeypointNet is a large-scale and diverse 3D keypoint dataset that contains 83,231 keypoints and 8,329 3D models from 16 object categories, by leveraging numerous human annotations, based on ShapeNet models.
Face/Head segmentation dataset
YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19
The dataset contains over 16.5k (16557) fully pixel-level labeled segmentation images.
Ruralscapes Dataset
YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19
Ruralscapes Dataset for Semantic Segmentation in UAV Videos. Ruralscapes is a dataset with 20 high quality (4K) videos portraying rural areas.
TAO
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
TAO is a federated dataset for Tracking Any Object, containing 2,907 high resolution videos, captured in diverse environments, which are half a minute long on average.
- Source:http://taodataset.org/
HiEve
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
Human-centric Video Analysis in Complex Events. HiEve dataset includes the currently largest number of poses (>1M), the largest number of complex-event action labels (>56k), and one of the largest number of trajectories with long terms (with average trajectory length >480).
- Source:http://humaninevents.org/
Fashionpedia
YEAR: 2020 LICENSE: CC BY 4.0 UpdateTime: 2021-01-19
Fashionpedia is a dataset which consists of two parts: (1) an ontology built by fashion experts containing 27 main apparel categories, 19 apparel parts, 294 fine-grained attributes and their relationships; (2) a dataset with 48k everyday and celebrity event fashion images annotated with segmentation masks and their associated per-mask fine-grained attributes, built upon the Fashionpedia ontology.
Condensed Moives
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
A large-scale video dataset, featuring clips from movies with detailed captions. Over 3,000 diverse movies from a variety of genres, countries and decades.
AVID
YEAR: 2020 LICENSE: MIT UpdateTime: 2021-01-19
AViD is a large-scale video dataset with 467k videos and 887 action classes. The collected videos have a creative-commons license.
Social Bias Inference Corpus
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
Social Bias Inference Corpus (SBIC) contains 150k structured annotations of social media posts, covering over 34k implications about a thousand demographic groups.
DDAD
YEAR: 2020 LICENSE: CC-BY-NC-SA 4.0 UpdateTime: 2021-01-19
DDAD (Dense Depth for Autonomous Driving) is a new autonomous driving benchmark from TRI (Toyota Research Institute) for long range (up to 250m) and dense depth estimation in challenging and diverse urban conditions. It contains monocular videos and accurate ground-truth depth (across a full 360 degree field of view) generated from high-density LiDARs mounted on a fleet of self-driving cars operating in a cross-continental setting.
GoEmotions
YEAR: 2020 LICENSE: Non-commercial UpdateTime: 2021-01-19
GoEmotions, the largest manually annotated dataset of 58k English Reddit comments, labeled for 27 emotion categories or Neutral.