Car and Pool Detector Using Monk AI

Last Updated on January 7, 2023 by Editorial Team

Last Updated on August 29, 2020 by Editorial Team

Author(s): Omkar Tupe

Computer Vision

About the project

This project is focused on detecting cars and pools from satellite images using CornerNet[1]. Performing object detection, by coding from scratch, can be difficult and tedious for someone not very well acquainted with the field. With Monk AI this can be done in a seemingly easier way. Using Monk AI [2], one can accomplish various computer vision tasks like object detection with very few lines of code. This project can help gain a better understanding of the MonkAI toolkit. Through this blog, I will share some insights about MonkAI, and how it can be used to simplify the process of object detection and build other computer vision applications.

Tutorial available on GitHub.

Cars and pools detection(Images are taken from Kaggle dataset)

Features of Monk AI

A low code programming environment.
Using MonkAI one can access PyTorch, MXNet, Keras, TensorFlow, etc. with a common syntax.
For Competition and Hackathon participants: The hassle-free setup makes prototyping faster and easier

The Dataset

For this project, Satellite images are used for training the model to detect cars and pools. Annotations are stored in VOC format. The dataset has 3748 train images and 2703 test images. The dataset is available on Kaggle.

CornerNet

CornerNet, a new approach to object detection where one can detect an object bounding box as a pair of key points, the top-left corner, and the bottom-right corner, using a single convolution neural network. By detecting objects as paired key points, it is possible to eliminate the need for designing a set of anchor boxes that were previously used commonly in single-stage detectors.

Don’t worry about architecture MONK will take care of this(Image Ref-https://arxiv.org/abs/1808.01244)

Installation instructions
Use an already trained model
Train a custom detector

— Annotations conversion (VOC to COCO via MONK)

— Training

4. Inference model

1. Installation instructions

Here we are using Google Colab for training as it provides CUDA GPU. But one can use a local device or Kaggle notebook. Now we will setup MonkAI toolkit and dependencies on the colab.

2. Use an already trained model

Monk helps to understand the detection results using a pre-trained model to demonstrate our application.

Downloading the pre-trained model.

Unzip the folder

The obj_satellite_car_pool_trained folder will have the pre-trained model file and some test images.

Setting up a detector

From a given folder, we are using a weight file (obj_satellite_car_pool_trained/CornerNet_Saccade_final-1000.pkl)

From the unzipped folder, we are using some images for inference purposes.

Inference-1

Inference-2

3. Train a custom detector

We are using a dataset from Kaggle so we have to install Kaggle API on colab.

Please follow the steps below to download and use Kaggle data within Google Colab[4]:

Go to your account, Scroll to API section and Click Expire API Token to remove previous tokens
Click on Create New API Token — It will download the kaggle.json file on your machine.
Go to your Google Colab project file and run the following commands:

Time to download your dataset

Go to the dataset you want to download on Kaggle and copy the API command which Kaggle provides. That should look like the following:

To train a model using Cornernet-Saccade, the annotation should be in the COCO format, but we have the annotation in VOC format. Hence, we need to convert VOC format to COCO format via MONK format. You can find detailed code about this on Github

VOC format(Dataset directory structure)

Monk format

COCO format(Desired annotations)

Annotation Conversion

You can find detailed code about this on Github

Training

Using Monk AI we can build concurrent pipeline, we can do model selection and it is easy to set hyperparameters

Importing dependencies
Set detector path
Set dataset and annotation path
Select model (here we are using CornerNet_Saccade)
Set hyperparameters(learning rate as 0.00025,total iterations 10000)

6. Now complete setup for training. Here we are loading annotations in memory followed by index creation and loading model.

7. Now we will start our training as

Now here we can see all hyperparameters settings. The total number of available parameters for training is 116849063. It is always best practice that we should shuffle data during each iteration so we are shuffling dataset’s indices.

Weights file will be stored in ‘cache/nnet/CornerNet_Saccade/’ as intermediate as well as the final file.

4.Inference

It will be similar to the pre-trained model but now we will use our own trained model so the model path will be different.

Set detector path
Define classes.

3. Set the trained model path.

4. Provide some test images for testing.

Test image-1

Test image-2

From multiple trial and error threshold is set as 0.3

For threshold value, less than 0.3 multiple detector boxes are observed and for threshold value greater than 0.3 it is difficult for a detector to detect car.

As we used satellite images the size of the car is much less than the pool in terms of pixels. More features are available for pools

Conclusion

To conclude, our task was done in very few lines of code. We have demonstrated only one pipeline in this article, but Monk AI has a total of seven such pipelines from GluonCV to YOLOv3. All in all, Monk AI is a great library that makes working with such computer vision tasks pretty easy.

All the code shown in this article resides in this colab notebook.

You can find many more examples of detection and segmentation in the application model zoo.

Thanks for Reading! I hope you find this article informative & useful. Do share your feedback in the comments section!

References

CornerNet-https://arxiv.org/abs/1808.01244
Monk AI- https://github.com/Tessellate-Imaging/Monk_Object_Detection
Kaggle dataset- https://www.kaggle.com/kbhartiya83/swimming-pool-and-car-detection
Downloading Kaggle dataset on Google colab- https://www.kaggle.com/general/74235

Car and Pool Detector Using Monk AI was originally published in Towards AI — Multidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Frequently Used, Contextual References

Resources

Publication

Car and Pool Detector Using Monk AI

Author(s): Omkar Tupe

Computer Vision

About the project

Features of Monk AI

The Dataset

CornerNet

Table of contents

1. Installation instructions

2. Use an already trained model

3. Train a custom detector

Annotation Conversion

Training

4.Inference

Conclusion

References

Towards AI Team

Feedback ↓ Cancel reply

Popular posts

Best Laptops for Deep Learning, Machine Learning (ML), and Data Science for 2023

Best Workstations for Deep Learning, Data Science, and Machine Learning (ML) for 2022

Descriptive Statistics for Data-driven Decision Making with Python

Best Machine Learning (ML) Books - Free and Paid - Editorial Recommendations for 2022

Best Data Science Books - Free and Paid - Editorial Recommendations for 2022

Updates

Recent Posts

LAI #71: Open-Sora: $200K Video Model, HPC’s Unsung Hero, and 10 Ways LLMs Fail in the Wild

Using CrewAI to Build Agentic Systems

Future of the Job Market — Impact of AI on Various Roles in 2025

Multimodal Autonomous AI Agents: Enhancing Web Interactions Through Tree Search

TAI #148: New API Models from OpenAI (4.1) & xAI (grok-3); Exploring Deep Research’s Scaling Laws

The World’s Leading AI and Technology Publication.

Company

CONTACT US

🔥 Recommended Articles 🔥

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

Car and Pool Detector Using Monk AI

Author(s): Omkar Tupe

About the project

Features of Monk AI

The Dataset

CornerNet

Table of contents

1. Installation instructions

2. Use an already trained model

3. Train a custom detector

Annotation Conversion

Training

4.Inference

Conclusion

References

Towards AI Team

Related posts

Feedback ↓ Cancel reply

Popular posts

Updates

Recent Posts

The World’s Leading AI and Technology Publication.

Company

CONTACT US

GDPR CCPA Statement

Subscribe to our AI newsletter!

🔥 Recommended Articles 🔥