Explore Datasets

Camelyon 16
All ground truth annotations were carefully prepared under supervision of expert pathologists. For the purpose of revising the slides, additional slides stained with cytokeratin immunohistochemistry were used.

The CAMELYON16 challenge data set (more information at camelyon16.grand-challenge.org) consists of 400 digitized hematoxylin and eosin (H&E) stained sentinel lymph node sections (270 training and 130 testing images) sampled from 400 patients. All metastases were exhaustively and accurately annotated under the supervision of expert pathologists using H&E and immunohistochemistry stained slides.
Images
407
Annotations
7,602
Start Project
ImageNet
ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. Currently we have an average of over five hundred images per node. We hope ImageNet will become a useful resource for researchers, educators, students and all of you who share our passion for pictures.
Images
12,845,700
Annotations
9,418,054
Start Project
CIFAR 10
The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.
Images
70,000
Annotations
59,995
Start Project
CIFAR 100
This dataset is just like the CIFAR-10, except it has 100 classes containing 600 images each. There are 500 training images and 100 testing images per class. The 100 classes in the CIFAR-100 are grouped into 20 superclasses.
Images
60,000
Annotations
59,997
Start Project
FaceScrub
A Dataset With Face Images of 530 People
Images
50,092
Start Project
MNIST Digits
The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger set available from NIST. The digits have been size-normalized and centered in a fixed-size image.
Images
69,977
Annotations
63,104
Start Project
VOC2010
Images
11,319
Annotations
32,045
Start Project
VOC2012
Images
17,112
Annotations
33,374
Start Project
Microsoft COCO
Microsoft COCO is a large image dataset designed for object detection, segmentation, and caption generation.
Images
409,425
Annotations
78,805
Coming soon
Open Images
Open Images is a dataset of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories.
Images
8,332,243
Coming soon
Kitti
Images
47,962
Annotations
47,628
Coming soon
Spacenet
SpaceNet is a corpus of commercial satellite imagery and labeled training data being made available at no cost to the public to foster innovation in the development of computer vision algorithms to automatically extract information from remote sensing data.
Images
404,534
Coming soon
Udacity
Udacity contains all challenges of car driving imagery in 3 different angles, telemetry data compatible with ROS, including gps, steering, brake, throttle, gear, speed, imu, etc all interpolated and normalized.
Images
843,483
Annotations
843,092
Coming soon