Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

50+ Object Detection Datasets from different industry domains
Computer Vision

50+ Object Detection Datasets from different industry domains

Last Updated on October 10, 2020 by Editorial Team

Author(s): Abhishek Annamraju

Computer Vision

A list of object detection and image segmentation datasets (With colab notebooks for training and inference) to explore and experiment with different algorithms on!

Free to use Image.Β Credits

Computer Vision is such a fast-paced field that everyday loads of new techniques and algorithms are presented in different conferences and journals. When it comes to object detection, theoretically you learn about multitudes of algorithms like Faster-rcnn, Mask-rcnn, Yolo, SSD, Retinenet, Cascaded-rcnn, Peleenet, EfficientDet, CornerNet…. This list is never-ending!

It is always beneficial to consolidate your learning experience by applying it on different datasets!!!!

This way you tend to understand the algorithms better, plus you get an intuition over which algorithms work on what kind of datasets.

Our opensource team at Monk Computer Vision Org compiled a list of object detection, image segmentation and action recognition datasets and created short tutorials over each of them for you to utilize these datasets and try out different object detection algorithms

Mentioned below is a shortlist of object detection datasets, brief details on the same, and steps to utilize them. The datasets are from the following domains

β˜… Agriculture
β˜… Advance Driver Assistance and Self Driving Car Systems
β˜… Fashion, Retail, and Marketing
β˜… Wildlife
β˜… Sports
β˜… Satellite Imaging
β˜… Medical Imaging
β˜… Security and Surveillance
β˜… Underwater Imaging

….. and much more!!!!!

The complete list at one place is available with associated usage instructions and training codes onΒ github

Agriculture-related datasets

A) Winegrape Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect grape clusters in vineyards
* Applicationβ€Šβ€”β€ŠTo monitor growth and analyze yield
* Detailsβ€Šβ€”β€Š300 images with 4400 bounding boxes over 5 classes of grapes
* How to utilize the dataset and build a custom detector using YoloV3Β pipeline

B) Global Wheat Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect wheat crop from in fields
* Applicationβ€Šβ€”β€ŠTo monitor growth and analyze yield
* Details β€”3430 images with 100K+ annotations
* How to utilize the dataset and build a custom detector using EfficientDet-D4 pipeline

Advance Driver Assistance and Self Driving Car Systems relatedΒ Datasets

A) LISA Traffic Sign Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect and classify traffic signs in dash cam images
* Applicationβ€Šβ€”β€ŠTraffic sign recognition acts as a rule setter for autonomous driving
* Detailsβ€Šβ€”β€Š7855 annotations on 6610 frames over 47 US sign types
* How to utilize the dataset and build a custom detector using EfficientDet-D3 pipeline

* This repository has one more dataset
β€Šβ€”β€Š
LISA Vehicle Detection Data

B) Object Detection in Low Lighting Conditions

Demo

* Goalβ€Šβ€”β€ŠTo detect on-road objects in low-lighting conditionsβ€Šβ€”β€Šfog, dark, haze, rains, etc
* Applicationβ€Šβ€”β€ŠThis is a crucial component in self-driving vehicles as it pertains to a safer vehicle if it’s capable of detecting objects in adverse conditions
* Details β€”15K+ annotations on 7500 frames over 12 different object types
* How to utilize the dataset and build a custom detector using EfficientDet-D3 pipeline

C) LARA Traffic Lights Detection Dataset

* Goalβ€Šβ€”β€ŠTo detect traffic lights and classify them as red, green, and yellow
* Applicationβ€Šβ€”β€ŠThis does rule-setting for adas and self-driving car systems at road network junctions
* Detailsβ€Šβ€”β€Š11K frames with 20K+ annotations over three classes of traffic lights
* How to utilize the dataset and build a custom detector using Mmdet-Faster-Rcnn-fpn50 pipeline

D) Person Detection using InfraredΒ Images

Demo

* Goalβ€Šβ€”β€ŠTo detect people in infrared imagery
* Applicationβ€Šβ€”β€ŠAutonomous vehicles are equipped with infrared cams to detect objects in adverse conditions
* Detailsβ€Šβ€”β€Š30 video sequences with 1K+ annotations
* How to utilize the dataset and build a custom detector using Mx-RcnnΒ pipeline

E) Pothole Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect potholes from on-road imagery
* Applicationβ€Šβ€”β€ŠDetecting road terrain and potholes results in smooth driving.
* Detailsβ€Šβ€”β€Š700 images with 3K+ annotations on potholes
* How to utilize the dataset and build a custom detector using M-RcnnΒ pipeline

F) Nexet Vehicle Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect vehicles on-road imagery
* Applicationβ€Šβ€”β€ŠDetecting vehicles is a prime component in autonomous driving
* Detailsβ€Šβ€”β€Š7000 images with 15K+ annotations on 6 types of vehicles
* How to utilize the dataset and build a custom detector using Tensorflow Object Detection API

G) BDD100K AdasΒ Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect on-road objects
* Applicationβ€Šβ€”β€ŠDetecting vehicles, traffic signs, and people is a prime component in autonomous driving
* Details β€”100K images with 250K+ annotations on 10 types of objects
* How to utilize the dataset and build a custom detector using Tensorflow Object Detection API

H) Linkopings Traffic SignsΒ Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect traffic signs in images
* Applicationβ€Šβ€”β€ŠDetecting traffic signs is the first step towards understanding traffic rules
* Details β€”3K images with 5K+ annotations on 40+ types of traffic signs
* How to utilize the dataset and build a custom detector using Mmdetβ€Šβ€”β€ŠCascade MaskΒ Rcnn

Fashion, Retail, and Marketing relatedΒ Dataset

A) Billboard Detection (Subsampling OpenImages Dataset)Β Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect billboards in images
* Applicationβ€Šβ€”β€ŠDetecting billboards forms a crucial part in auto-analyzing marketing campaigns across the city
* Detailsβ€Šβ€”β€Š2K images with 5K+ annotations on billboards
* How to utilize the dataset and build a custom detector using Retinanet

B) DeepFashion2 Fashion element Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect fashion products, clothing, and accessories in images
* Applicationβ€Šβ€”β€ŠFashion detection has huge applications from data sorting to recommendation engines
* Detailsβ€Šβ€”β€Š490K images with around 100s of annotation objects classes
* How to utilize the dataset and build a custom detector using CornetNet-Lite Pipeline

* Another Fashion related dataset is Taobao Commodity Dataset

C) Qmul-OpenLogo Logo Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect different logos in natural images
* Applicationβ€Šβ€”β€ŠAnalyzing frequency of logo appearance in videos and natural scenes is crucial in marketing
* Detailsβ€Šβ€”β€Š16K training images with logos from all kinds of brandsβ€Šβ€”β€Šfood, vehicles, restaurant-chains, delivery services, airlines, etc
* How to utilize the dataset and build a custom detector using mx-rcnnΒ pipeline

Sports-Related Datasets

A) Football Detection Dataset (Subsampling from OpenImages Dataset)

Demo

* Goalβ€Šβ€”β€ŠTo detect football across frames in videos
* Applicationβ€Šβ€”β€ŠDetecting football positions is crucial in auto-analysing situations such as offsides, etc
* Detailsβ€Šβ€”β€ŠAround 3K training images.
* How to utilize the dataset and build a custom detector using yolo-v3Β pipeline

B) Playing Card Type Detection

Demo

* Goalβ€Šβ€”β€ŠTo detect playing card in natural images and classify the card type
* Applicationβ€Šβ€”β€ŠPossible application is in analyzing winning odds in different card games
* Detailsβ€Šβ€”β€Š500+ images over 52 card class types
* How to utilize the dataset and build a custom detector using mx-rcnnΒ pipeline

C) Soccer Player Detection in ThermalΒ Imagery

Demo

* Goalβ€Šβ€”β€ŠTo localize and track players using thermal imagery
* Applicationβ€Šβ€”β€ŠTracking players in the game is a crucial part in generating analytics
* Details β€”3K+ images over 5K+ annotations.
* How to utilize the dataset and build a custom detector using mmdet faster-rcnn pipeline

Security and Surveillance RelatedΒ Datasets

A) MIO-TCD Vehicle Detection in CCTV TrafficΒ Cams

Demo

* Goalβ€Šβ€”β€ŠTo detect vehicles in cctv traffic cameras
* Applicationβ€Šβ€”β€ŠDetecting vehicles in cctv traffic cams forms a crucial part in security surveillance applications
* Detailsβ€Šβ€”β€Š113K images with 200K+ annotations on 5+ types of vehicles
* How to utilize the dataset and build a custom detector using Mmdetβ€Šβ€”β€ŠRetinanet pipeline

B) WIDER Person Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect people in cctv and natural scene images and videos
* Applicationβ€Šβ€”β€ŠCCTV based people detection forms the core of security and surveillance applications
* Detailsβ€Šβ€”β€Š10K+ images with 20K+ annotations on detecting pedestrians
* How to utilize the dataset and build a custom detector using Cornernet-Lite pipeline

C) Protective Gearβ€Šβ€”β€ŠHelmet and Vest Detection

Demo

* Goalβ€Šβ€”β€ŠTo detect helmet and vests on people
* Applicationβ€Šβ€”β€ŠThis forms an integral part in security compliance monitoring
* Detailsβ€Šβ€”β€Š1.5K+ images with 2K+ annotations on detecting people, helmets, and vests
* How to utilize the dataset and build a custom detector using Mmdetβ€Šβ€”β€ŠCascadeΒ RPN

D) Anomaly Detection inΒ Videos

Sample Visualization

* Goalβ€Šβ€”β€ŠTo classify videos as per actions being carried out in videos
* Applicationβ€Šβ€”β€ŠDetecting anomalies in real time helps in stopping crime
* Detailsβ€Šβ€”β€Š1K+ videos corresponding to 10 anomaly classes.
* How to utilize the dataset and build a custom classifier using mmaction-tsn50 pipeline

Medical ImagingΒ Datasets

A) Ultrasound Brachial Plexus (BP) Nerve Segmentation Dataset

Demo

* Goalβ€Šβ€”β€ŠTo segment certain nerve types in ultrasound images
* Applicationβ€Šβ€”β€ŠThis helps in improving pain management through the use of indwelling catheters that block or mitigate pain at the source.
* Detailsβ€Šβ€”β€Š11K+ images with associated instance masks for detecting nerves
* How to utilize the dataset and build a customΒ detector

B) PanNuke Cancer Instance Segmentation inΒ Cells

Demo

* Goalβ€Šβ€”β€ŠTo segment different cell types in the slide image
* Applicationβ€Šβ€”β€ŠAuto-analyzing presence of cancerous and dead cells in terabytes of data
* Detailsβ€Šβ€”β€Š3K+ images with associated instance masks for detecting different cell types
* How to utilize the dataset and build a customΒ detector

Satellite ImagingΒ Datasets

A) Road Segmentation in Satellite Imagery

Demo

* Goalβ€Šβ€”β€ŠTo segment road lines in satellite imagery
* Applicationβ€Šβ€”β€ŠHelps in urban planning and monitoring roadways
* Detailsβ€Šβ€”β€Š1K+ images with associated instance masks for detecting different road regions
* How to utilize the dataset and build a customΒ detector

B) Traversable region segmentation in Synthetically generated lunarΒ imagery

Demo

* Goalβ€Šβ€”β€ŠTo segment out rocks and find traversable region in lunar imagery
* Applicationβ€Šβ€”β€ŠEssential element in autonomous rovers’ path planning
* Detailsβ€Šβ€”β€Š10K+ images with associated instance masks for detecting different rocks and flat ground
* How to utilize the dataset and build a customΒ detector

C) Cars and Swimming Pools Detection in Satellite Imagery

Demo

* Goalβ€Šβ€”β€ŠTo detect vehicles and pools in satellite imagery
* Applicationβ€Šβ€”β€ŠThis forms a crucial part in property tax estimation
* Detailsβ€Šβ€”β€Š3.5K+ images with 5K+ annotations labels on cars and pools
* How to utilize the dataset and build a custom detector using cornernet-lite pipeline

D) Roads and Residential area segmentation in AerialΒ Imagery

Demo

* Goalβ€Šβ€”β€ŠTo segment road and residential areas in satellite imagery
* Applicationβ€Šβ€”β€ŠThis forms a crucial part in property tax estimation
* Detailsβ€Šβ€”β€Š100 very high resolution images with segmentation masks
* How to utilize the dataset and build a customΒ detector

* Another similar road segmentation dataset and associated trainingΒ code

E) Water Body Segmentation in satellite imagery

Demo

* Goalβ€Šβ€”β€ŠTo segment water bodies in satellite imagery
* Applicationβ€Šβ€”β€ŠVery important to understand how water bodies change and evolve over time
* Detailsβ€Šβ€”β€Š100 very high resolution images with segmentation masks
* How to utilize the dataset and build a customΒ detector

* Another such dataset is DeepGlobe Land Cover Classification and it’s associated usage guidelines

Wildlife RelatedΒ Datasets

A) Tiger Detection Dataset (Subsampled from OpenImages)

Demo

* Goalβ€Šβ€”β€ŠTo detect tigers in natural and drone images
* Applicationβ€Šβ€”β€ŠTo monitor endangered species
* Detailsβ€Šβ€”β€Š2K+ images with 4k+ annotations.
* How to utilize the dataset and build a custom detector using cornernet-lite pipeline

* One more such dataset could be Monkey detection dataset and it’s associated tutorial

B) Zebras and Giraffes Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect zebra and giraffe species in natural and drone images
* Applicationβ€Šβ€”β€ŠTo monitor endangered species
* Detailsβ€Šβ€”β€Š5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet-d3 pipeline

C) Caltech Cameratrap Dataset

Image Credits

* Goalβ€Šβ€”β€ŠTo detect animals in trap camera types images
* Applicationβ€Šβ€”β€ŠTo monitor endangered species
* Detailsβ€Šβ€”β€Š10K+ images with 8k+ annotations.
* How to utilize the dataset and build a custom detector using retinanet pipeline

* One more such cameratrap dataset and associated trainingΒ code

D) Elephant Detection Dataset (Subsampled from COCOΒ dataset)

Demo

* Goalβ€Šβ€”β€ŠTo detect elephant species in natural and drone images
* Applicationβ€Šβ€”β€ŠTo monitor endangered species
* Detailsβ€Šβ€”β€Š5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using mmdet-maskrcnn

Underwater Datasets

A) Detecting Sea Turtles in theΒ wild

Demo

* Goalβ€Šβ€”β€ŠTo detect sea turtles in underwater images
* Applicationβ€Šβ€”β€ŠTo monitor endangered species
* Detailsβ€Šβ€”β€Š5K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet

* A similar dataset to monitor fish species underwater and associated utilization code

B) Underwater trash detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect marine trash
* Applicationβ€Šβ€”β€ŠTo monitor and control marine waste issue
* Detailsβ€Šβ€”β€Š2K+ images with 5k+ annotations.
* How to utilize the dataset and build a custom detector using efficientdet

* A more complex pixel based trash segmentation dataset and associated codes

C) SUIM underwater object detection dataset

Image Credits

* Goalβ€Šβ€”β€ŠTo segment underwater objects
* Applicationβ€Šβ€”β€ŠPath planning for autonomous underwater vehicles, track divers and monitor marine species
* Detailsβ€Šβ€”β€Š1.5K+ images with 1.5k+ annotation masks.
* How to utilize the dataset and build a customΒ detector

D) Brackish underwater fish recognition dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect marine species in underwater imagery.
* Applicationβ€Šβ€”β€ŠTo monitor marine species
* Detailsβ€Šβ€”β€Š89 videos to detect fish, crab, shrimp, jellyfish, starfish
* How to utilize the dataset and build a custom detector using mmdetβ€Šβ€”β€Šfaster rcnnΒ pipeline

Text Analysis relatedΒ datasets

A) Document Layout Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo detect document layout for further analysis
* Applicationβ€Šβ€”β€ŠEssential to segment images into different parts so that certain rule based nlp and text recognition can further be applied.
* Detailsβ€Šβ€”β€Š5K+ images with 10k+ annotations with labels such as paragraphs, images, headers.
* How to utilize the dataset and build a custom detector usingΒ mx-rcnn

* A very similar dataset exists for graphical components detection in documents named IIIT-AR-13K, here’s how to utilize the dataset and train a model onΒ it

B) Total-Text Dataset

Demo

* Goalβ€Šβ€”β€ŠTo localize text in natural scenes
* Applicationβ€Šβ€”β€ŠEssential base component to recognize using OCR
* Detailsβ€Šβ€”β€Š1.5K+ images with 5K+ polygonal annotations
* How to utilize the dataset and build a custom detector using Text-Snake pipeline

C) YY-Mnist Simple OCRΒ Dataset

Demo

* Goalβ€Šβ€”β€ŠTo localize number and classify them in white backgroud images
* Applicationβ€Šβ€”β€ŠEssential base component to recognize using OCR
* Detailsβ€Šβ€”β€Š1K images with 2K+ annotations over 10 classes
* How to utilize the dataset and build a custom detector using Retinanet pipeline

Other Datasets

A) TACO Trash Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo localize and segment all kinds of garbage in images
* Applicationβ€Šβ€”β€ŠCritical component in autonomous bots trying to tackle trash problem in public places
* Detailsβ€Šβ€”β€Š10K images with 15K+ annotations over 20+ different classes trash objects
* How to utilize the dataset and build a custom detector using Retinanet pipeline

B) Indoor Scene General Object Detection Dataset

Demo

* Goalβ€Šβ€”β€ŠTo localize and detect indoor objects in images
* Applicationβ€Šβ€”β€ŠAutotag images in real-estate and rental websites with amenities
* Detailsβ€Šβ€”β€Š3K+ images with 5K+ annotations over 10+ different classes indoor objects such as electronic-appliances, bed, curtains, chairs, etc
* How to utilize the dataset and build a custom detector using Retinanet pipeline

C) EgoHands Hand Segmentation Dataset

Demo (ImageΒ Credits)

* Goalβ€Šβ€”β€ŠTo segment hands in natural scenes
* Applicationβ€Šβ€”β€ŠFirst step towards understanding gestures, with applications in human computer interaction, sign language recognition
* Detailsβ€Šβ€”β€Š4.8K+ images with corresponding hand masks.
* How to utilize the dataset and build a custom detector using Retinanet pipeline

D) UCF Action recognition dataset

Demo

* Goalβ€Šβ€”β€ŠTo classify videos as per actions being carried out in videos
* Applicationβ€Šβ€”β€ŠTagging videos is important in storing and retrieving large number of videos
* Detailsβ€Šβ€”β€Š1K+ videos corresponding to 101 action type classes.
* How to utilize the dataset and build a custom classifier using mmaction-tsn50 pipeline

E) Oil TanksΒ Datasets

Demo

* Goalβ€Šβ€”β€ŠTo detect tanks in satellite imagery
* Applicationβ€Šβ€”β€ŠTo keep track of oil tanks
* Detailsβ€Šβ€”β€Š10K+ images with 10K+ annotations.
* How to utilize the dataset and build a custom classifier using retinanet pipeline

Other action recognition datasets

A) STAIR Action Recognition dataset and how to train a model onΒ it

B) A2D Action Recognition dataset and how to train a model onΒ it

C) KTH Action Recognition dataset and how to train a model onΒ it

APPENDIX

For more details on the tutorials visit our GithubΒ page

Tutorial Credits to all the opensource contributors at the Monk Object Detection Library


50+ Object Detection Datasets from different industry domains was originally published in Towards AIβ€Šβ€”β€ŠMultidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Feedback ↓