50+ Object Detection Datasets from different industry domains
Last Updated on October 10, 2020 by Editorial Team
Author(s): Abhishek Annamraju
Computer Vision
A list of object detection and image segmentation datasets (With colab notebooks for training and inference) to explore and experiment with different algorithms on!
Computer Vision is such a fast-paced field that everyday loads of new techniques and algorithms are presented in different conferences and journals. When it comes to object detection, theoretically you learn about multitudes of algorithms like Faster-rcnn, Mask-rcnn, Yolo, SSD, Retinenet, Cascaded-rcnn, Peleenet, EfficientDet, CornerNetβ¦. This list is never-ending!
It is always beneficial to consolidate your learning experience by applying it on different datasets!!!!
This way you tend to understand the algorithms better, plus you get an intuition over which algorithms work on what kind of datasets.
Our opensource team at Monk Computer Vision Org compiled a list of object detection, image segmentation and action recognition datasets and created short tutorials over each of them for you to utilize these datasets and try out different object detection algorithms
Mentioned below is a shortlist of object detection datasets, brief details on the same, and steps to utilize them. The datasets are from the following domains
β
Agriculture
β
Advance Driver Assistance and Self Driving Car Systems
β
Fashion, Retail, and Marketing
β
Wildlife
β
Sports
β
Satellite Imaging
β
Medical Imaging
β
Security and Surveillance
β
Underwater Imaging
β¦.. and much more!!!!!
The complete list at one place is available with associated usage instructions and training codes onΒ github
Agriculture-related datasets
A) Winegrape Detection Dataset
* GoalβββTo detect grape clusters in vineyards
* ApplicationβββTo monitor growth and analyze yield
* Detailsβββ300 images with 4400 bounding boxes over 5 classes of grapes
* How to utilize the dataset and build a custom detector using YoloV3Β pipeline
B) Global Wheat Detection Dataset
* GoalβββTo detect wheat crop from in fields
* ApplicationβββTo monitor growth and analyze yield
* Details β3430 images with 100K+ annotations
* How to utilize the dataset and build a custom detector using EfficientDet-D4 pipeline
Advance Driver Assistance and Self Driving Car Systems relatedΒ Datasets
A) LISA Traffic Sign Detection Dataset
* GoalβββTo detect and classify traffic signs in dash cam images
* ApplicationβββTraffic sign recognition acts as a rule setter for autonomous driving
* Detailsβββ7855 annotations on 6610 frames over 47 US sign types
* How to utilize the dataset and build a custom detector using EfficientDet-D3 pipeline
* This repository has one more dataset
βββLISA Vehicle Detection Data
B) Object Detection in Low Lighting Conditions
* GoalβββTo detect on-road objects in low-lighting conditionsβββfog, dark, haze, rains, etc
* ApplicationβββThis is a crucial component in self-driving vehicles as it pertains to a safer vehicle if itβs capable of detecting objects in adverse conditions
* Details β15K+ annotations on 7500 frames over 12 different object types
* How to utilize the dataset and build a custom detector using EfficientDet-D3 pipeline
C) LARA Traffic Lights Detection Dataset
* GoalβββTo detect traffic lights and classify them as red, green, and yellow
* ApplicationβββThis does rule-setting for adas and self-driving car systems at road network junctions
* Detailsβββ11K frames with 20K+ annotations over three classes of traffic lights
* How to utilize the dataset and build a custom detector using Mmdet-Faster-Rcnn-fpn50 pipeline
D) Person Detection using InfraredΒ Images
* GoalβββTo detect people in infrared imagery
* ApplicationβββAutonomous vehicles are equipped with infrared cams to detect objects in adverse conditions
* Detailsβββ30 video sequences with 1K+ annotations
* How to utilize the dataset and build a custom detector using Mx-RcnnΒ pipeline
E) Pothole Detection Dataset
* GoalβββTo detect potholes from on-road imagery
* ApplicationβββDetecting road terrain and potholes results in smooth driving.
* Detailsβββ700 images with 3K+ annotations on potholes
* How to utilize the dataset and build a custom detector using M-RcnnΒ pipeline
F) Nexet Vehicle Detection Dataset
* GoalβββTo detect vehicles on-road imagery
* ApplicationβββDetecting vehicles is a prime component in autonomous driving
* Detailsβββ7000 images with 15K+ annotations on 6 types of vehicles
* How to utilize the dataset and build a custom detector using Tensorflow Object Detection API
G) BDD100K AdasΒ Dataset
* GoalβββTo detect on-road objects
* ApplicationβββDetecting vehicles, traffic signs, and people is a prime component in autonomous driving
* Details β100K images with 250K+ annotations on 10 types of objects
* How to utilize the dataset and build a custom detector using Tensorflow Object Detection API
H) Linkopings Traffic SignsΒ Dataset
* GoalβββTo detect traffic signs in images
* ApplicationβββDetecting traffic signs is the first step towards understanding traffic rules
* Details β3K images with 5K+ annotations on 40+ types of traffic signs
* How to utilize the dataset and build a custom detector using MmdetβββCascade MaskΒ Rcnn
Fashion, Retail, and Marketing relatedΒ Dataset
A) Billboard Detection (Subsampling OpenImages Dataset)Β Dataset
* GoalβββTo detect billboards in images
* ApplicationβββDetecting billboards forms a crucial part in auto-analyzing marketing campaigns across the city
* Detailsβββ2K images with 5K+ annotations on billboards
* How to utilize the dataset and build a custom detector using Retinanet
B) DeepFashion2 Fashion element Detection Dataset
* GoalβββTo detect fashion products, clothing, and accessories in images
* ApplicationβββFashion detection has huge applications from data sorting to recommendation engines
* Detailsβββ490K images with around 100s of annotation objects classes
* How to utilize the dataset and build a custom detector using CornetNet-Lite Pipeline
* Another Fashion related dataset is Taobao Commodity Dataset
C) Qmul-OpenLogo Logo Detection Dataset
* GoalβββTo detect different logos in natural images
* ApplicationβββAnalyzing frequency of logo appearance in videos and natural scenes is crucial in marketing
* Detailsβββ16K training images with logos from all kinds of brandsβββfood, vehicles, restaurant-chains, delivery services, airlines, etc
* How to utilize the dataset and build a custom detector using mx-rcnnΒ pipeline
Sports-Related Datasets
A) Football Detection Dataset (Subsampling from OpenImages Dataset)
* GoalβββTo detect football across frames in videos
* ApplicationβββDetecting football positions is crucial in auto-analysing situations such as offsides, etc
* DetailsβββAround 3K training images.
* How to utilize the dataset and build a custom detector using yolo-v3Β pipeline
B) Playing Card Type Detection
* GoalβββTo detect playing card in natural images and classify the card type
* ApplicationβββPossible application is in analyzing winning odds in different card games
* Detailsβββ500+ images over 52 card class types
* How to utilize the dataset and build a custom detector using mx-rcnnΒ pipeline
C) Soccer Player Detection in ThermalΒ Imagery
* GoalβββTo localize and track players using thermal imagery
* ApplicationβββTracking players in the game is a crucial part in generating analytics
* Details β3K+ images over 5K+ annotations.
* How to utilize the dataset and build a custom detector using mmdet faster-rcnn pipeline
Security and Surveillance RelatedΒ Datasets
A) MIO-TCD Vehicle Detection in CCTV TrafficΒ Cams
* GoalβββTo detect vehicles in cctv traffic cameras
* ApplicationβββDetecting vehicles in cctv traffic cams forms a crucial part in security surveillance applications
* Detailsβββ113K images with 200K+ annotations on 5+ types of vehicles
* How to utilize the dataset and build a custom detector using MmdetβββRetinanet pipeline
B) WIDER Person Detection Dataset
* GoalβββTo detect people in cctv and natural scene images and videos
* ApplicationβββCCTV based people detection forms the core of security and surveillance applications
* Detailsβββ10K+ images with 20K+ annotations on detecting pedestrians
* How to utilize the dataset and build a custom detector using Cornernet-Lite pipeline
C) Protective GearβββHelmet and Vest Detection
* GoalβββTo detect helmet and vests on people
* ApplicationβββThis forms an integral part in security compliance monitoring
* Detailsβββ1.5K+ images with 2K+ annotations on detecting people, helmets, and vests
* How to utilize the dataset and build a custom detector using MmdetβββCascadeΒ RPN
D) Anomaly Detection inΒ Videos
* GoalβββTo classify videos as per actions being carried out in videos
* ApplicationβββDetecting anomalies in real time helps in stopping crime
* Detailsβββ1K+ videos corresponding to 10 anomaly classes.
* How to utilize the dataset and build a custom classifier using mmaction-tsn50 pipeline
Medical ImagingΒ Datasets
A) Ultrasound Brachial Plexus (BP) Nerve Segmentation Dataset
* GoalβββTo segment certain nerve types in ultrasound images
* ApplicationβββThis helps in improving pain management through the use of indwelling catheters that block or mitigate pain at the source.
* Detailsβββ11K+ images with associated instance masks for detecting nerves
* How to utilize the dataset and build a customΒ detector
B) PanNuke Cancer Instance Segmentation inΒ Cells
* GoalβββTo segment different cell types in the slide image
* ApplicationβββAuto-analyzing presence of cancerous and dead cells in terabytes of data
* Detailsβββ3K+ images with associated instance masks for detecting different cell types
* How to utilize the dataset and build a customΒ detector
Satellite ImagingΒ Datasets
A) Road Segmentation in Satellite Imagery
* GoalβββTo segment road lines in satellite imagery
* ApplicationβββHelps in urban planning and monitoring roadways
* Detailsβββ1K+ images with associated instance masks for detecting different road regions
* How to utilize the dataset and build a customΒ detector
B) Traversable region segmentation in Synthetically generated lunarΒ imagery
* GoalβββTo segment out rocks and find traversable region in lunar imagery
* ApplicationβββEssential element in autonomous roversβ path planning
* Detailsβββ10K+ images with associated instance masks for detecting different rocks and flat ground
* How to utilize the dataset and build a customΒ detector
C) Cars and Swimming Pools Detection in Satellite Imagery
* GoalβββTo detect vehicles and pools in satellite imagery
* ApplicationβββThis forms a crucial part in property tax estimation
* Detailsβββ3.5K+ images with 5K+ annotations labels on cars and pools
* How to utilize the dataset and build a custom detector using cornernet-lite pipeline
D) Roads and Residential area segmentation in AerialΒ Imagery
* GoalβββTo segment road and residential areas in satellite imagery
* ApplicationβββThis forms a crucial part in property tax estimation
* Detailsβββ100 very high resolution images with segmentation masks
* How to utilize the dataset and build a customΒ detector
* Another similar road segmentation dataset and associated trainingΒ code
E) Water Body Segmentation in satellite imagery
* GoalβββTo segment water bodies in satellite imagery
* ApplicationβββVery important to understand how water bodies change and evolve over time
* Detailsβββ100 very high resolution images with segmentation masks
* How to utilize the dataset and build a customΒ detector
* Another such dataset is DeepGlobe Land Cover Classification and itβs associated usage guidelines
Wildlife RelatedΒ Datasets
A) Tiger Detection Dataset (Subsampled from OpenImages)
* GoalβββTo detect tigers in natural and drone images
* ApplicationβββTo monitor endangered species
* Detailsβββ2K+ images with 4k+ annotations.
* How to utilize the dataset and build a custom detector using cornernet-lite pipeline
* One more such dataset could be Monkey detection dataset and itβs associated tutorial
B) Zebras and Giraffes Detection Dataset
* GoalβββTo detect zebra and giraffe species in natural and drone images
* ApplicationβββTo monitor endangered species
* Detailsβββ5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet-d3 pipeline
C) Caltech Cameratrap Dataset
* GoalβββTo detect animals in trap camera types images
* ApplicationβββTo monitor endangered species
* Detailsβββ10K+ images with 8k+ annotations.
* How to utilize the dataset and build a custom detector using retinanet pipeline
* One more such cameratrap dataset and associated trainingΒ code
D) Elephant Detection Dataset (Subsampled from COCOΒ dataset)
* GoalβββTo detect elephant species in natural and drone images
* ApplicationβββTo monitor endangered species
* Detailsβββ5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using mmdet-maskrcnn
Underwater Datasets
A) Detecting Sea Turtles in theΒ wild
* GoalβββTo detect sea turtles in underwater images
* ApplicationβββTo monitor endangered species
* Detailsβββ5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet
* A similar dataset to monitor fish species underwater and associated utilization code
B) Underwater trash detection Dataset
* GoalβββTo detect marine trash
* ApplicationβββTo monitor and control marine waste issue
* Detailsβββ2K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet
* A more complex pixel based trash segmentation dataset and associated codes
C) SUIM underwater object detection dataset
* GoalβββTo segment underwater objects
* ApplicationβββPath planning for autonomous underwater vehicles, track divers and monitor marine species
* Detailsβββ1.5K+ images with 1.5k+ annotation masks.
* How to utilize the dataset and build a customΒ detector
D) Brackish underwater fish recognition dataset
* GoalβββTo detect marine species in underwater imagery.
* ApplicationβββTo monitor marine species
* Detailsβββ89 videos to detect fish, crab, shrimp, jellyfish, starfish
* How to utilize the dataset and build a custom detector using mmdetβββfaster rcnnΒ pipeline
Text Analysis relatedΒ datasets
A) Document Layout Detection Dataset
* GoalβββTo detect document layout for further analysis
* ApplicationβββEssential to segment images into different parts so that certain rule based nlp and text recognition can further be applied.
* Detailsβββ5K+ images with 10k+ annotations with labels such as paragraphs, images, headers.
* How to utilize the dataset and build a custom detector usingΒ mx-rcnn
* A very similar dataset exists for graphical components detection in documents named IIIT-AR-13K, hereβs how to utilize the dataset and train a model onΒ it
B) Total-Text Dataset
* GoalβββTo localize text in natural scenes
* ApplicationβββEssential base component to recognize using OCR
* Detailsβββ1.5K+ images with 5K+ polygonal annotations
* How to utilize the dataset and build a custom detector using Text-Snake pipeline
C) YY-Mnist Simple OCRΒ Dataset
* GoalβββTo localize number and classify them in white backgroud images
* ApplicationβββEssential base component to recognize using OCR
* Detailsβββ1K images with 2K+ annotations over 10 classes
* How to utilize the dataset and build a custom detector using Retinanet pipeline
Other Datasets
A) TACO Trash Detection Dataset
* GoalβββTo localize and segment all kinds of garbage in images
* ApplicationβββCritical component in autonomous bots trying to tackle trash problem in public places
* Detailsβββ10K images with 15K+ annotations over 20+ different classes trash objects
* How to utilize the dataset and build a custom detector using Retinanet pipeline
B) Indoor Scene General Object Detection Dataset
* GoalβββTo localize and detect indoor objects in images
* ApplicationβββAutotag images in real-estate and rental websites with amenities
* Detailsβββ3K+ images with 5K+ annotations over 10+ different classes indoor objects such as electronic-appliances, bed, curtains, chairs, etc
* How to utilize the dataset and build a custom detector using Retinanet pipeline
C) EgoHands Hand Segmentation Dataset
* GoalβββTo segment hands in natural scenes
* ApplicationβββFirst step towards understanding gestures, with applications in human computer interaction, sign language recognition
* Detailsβββ4.8K+ images with corresponding hand masks.
* How to utilize the dataset and build a custom detector using Retinanet pipeline
D) UCF Action recognition dataset
* GoalβββTo classify videos as per actions being carried out in videos
* ApplicationβββTagging videos is important in storing and retrieving large number of videos
* Detailsβββ1K+ videos corresponding to 101 action type classes.
* How to utilize the dataset and build a custom classifier using mmaction-tsn50 pipeline
E) Oil TanksΒ Datasets
* GoalβββTo detect tanks in satellite imagery
* ApplicationβββTo keep track of oil tanks
* Detailsβββ10K+ images with 10K+ annotations.
* How to utilize the dataset and build a custom classifier using retinanet pipeline
Other action recognition datasets
A) STAIR Action Recognition dataset and how to train a model onΒ it
B) A2D Action Recognition dataset and how to train a model onΒ it
C) KTH Action Recognition dataset and how to train a model onΒ it
APPENDIX
For more details on the tutorials visit our GithubΒ page
Tutorial Credits to all the opensource contributors at the Monk Object Detection Library
50+ Object Detection Datasets from different industry domains was originally published in Towards AIβββMultidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.
Published via Towards AI