6 Mins Read

Important Computer Vision Datasets

Hamed Mohamadi

January 26, 2023

Table of contents

Important Computer Vision Datasets

CIFAR-10 and CIFAR-100 datasets

Important Computer Vision Datasets

One of the most rapidly developing artificial intelligence domains is computer vision. In computer vision, datasets are very important. As a result of the advances made in this field, a large amount of data is now available for training computer vision models. This article reviews some of the most famous CV datasets.

MNIST dataset

MNIST is an introductory computer vision dataset containing 70,000 images of handwritten digits. Data preparation was carried out by Professor Yann Lechon. The data is divided into two sets, a training set of 60,000, and a test set of 10,000 images. All figures are placed in the center of the image.

MNIST fashion dataset

This dataset contains grayscale, 28×28 pixels images similar to MNIST images. Various fashion topics are covered, including t-shirts, pants, coats, sandals, blouses, sports shoes, bags, and boots. The data was provided by the Zalando (fashion and clothing store) research team.

CIFAR-10 and CIFAR-100 datasets

The Canadian Institute for Advanced Research has released both CIFAR-10 and CIFAR-100 datasets. CIFAR-10 consists of 60,000 images in 10 categories including airplanes, cars, birds, cats, deer, dogs, frogs, horses, ships, and trucks. CIFAR-100 is a similar dataset with a total of 60,000 images in 100 categories. Since both datasets are available in 32×32 pixels and have 50,000 training images and 10,000 test images with equally divided data ratios, even beginners can easily use them.

ImageNet dataset

This dataset was created for a computer vision competition called the “ImageNet Large Scale Visual Recognition Challenge” which challenges teams in five sections. ImageNet is based on the WordNet lexical database and contains over 1.4 million images in over 220,000 categories which makes it the largest collection of publicly available images.

PASCAL VOC dataset

This dataset has been made available to the public by Pascal Research Institute. There are four different types of images in this dataset: household images, vehicle images, animal images, and human images divided into 20 object classes. While the PASCAL VOC dataset has fewer categories and numbers than ImageNet, it can be used for a broader range of image segmentation and object detection applications.

IMDB-Wiki dataset

This dataset contains 520,000 images of faces taken from IMDB and Wikipedia. It also includes information about the person including name, date of birth, and gender, as well as the position of the face in the image. Gender detection and age estimation are usually performed using this dataset.

Reference

http://yann.lecun.com/exdb/mnist/

https://github.com/zalandoresearch/fashion-mnist

https://www.cs.toronto.edu/~kriz/cifar.html

https://data.vision.ee.ethz.ch/cvl/rrothe/imdb-wiki/

https://www.image-net.org/

http://host.robots.ox.ac.uk/pascal/VOC/

Public

Ensemble Machine Learning

Ensemble machine learning is a powerful technique that leverages the strengths of multiple weak learning models, also known as...

Parisa Sabzeh

February 19, 2023

5 Mins Read

Public

Activation Functions in Neural Network

Activation functions are the main components of neural network nodes. This article examines the various types of activation functions...

Parisa Sabzeh

February 19, 2023

4 Mins Read

Public

Various Types of Regularization

Regularization is a technique used in Machine Learning and Deep Learning models to prevent overfitting. This paper introduces L1,...

Hamed Mohamadi

February 17, 2023

3 Mins Read

Public

Machine Learning Engineers Should Use Docker

Docker is a platform that enables developers to easily create, deploy, and run applications in containers, and has gained...

Hamed Mohamadi

February 17, 2023

4 Mins Read

Public

TPU, GPU, CPU: Which Is Better for Deep Learning?

In this paper, we compare the performance of CPU, GPU, and TPU processors to see which one is better...

Hamed Mohamadi

February 11, 2023

3 Mins Read

Public

The History of AI (part 2)

In the second part of a series of articles about the history of artificial intelligence, we look at important...

Parisa Sabzeh

January 28, 2023

3 Mins Read

Surfing on Categories

Subscribe to our newsletter and get the latest practical content.

You can enter your email address and subscribe to our newsletter and get the latest practical content. You can enter your email address and subscribe to our newsletter.

How does AIEX work?

Revolutionary Indsutry Transformation

Revolutionary Indsutry Transformation

Important Computer Vision Datasets

Important Computer Vision Datasets

MNIST dataset

MNIST fashion dataset

CIFAR-10 and CIFAR-100 datasets

ImageNet dataset

PASCAL VOC dataset

IMDB-Wiki dataset

Reference

About

Contact Info

info@aiex.ai