Image Classification with Neural Network

Last Updated on July 26, 2023 by Editorial Team

Author(s): Mugunthan

Originally published on Towards AI.

Deep Learning

A feedforward neural network is an artificial neural network wherein connections between the nodes do not form a cycle. As such, it is different from its descendant: recurrent neural networks.

The feedforward neural network was the first and simplest type of artificial neural network devised.In this network, the information moves in only one direction — forward — from the input nodes, through the hidden nodes (if any) and to the output nodes. There are no cycles or loops in the network.

— reference from Wikipedia

A Convolution neural network(CNN) is a deep learning algorithm that takes images as input, learn objects/aspects from image and differentiate each image. In primitive methods, filters are hand-engineered and applied to images. Using CNN, filters are learned from the images with enough training.

Why CNN over feedforward neural network?

An Image is nothing but a matrix, why not flatten it into a 1D array and input it into a feed-forward network?.

Let me explain this with an example, Consider an Image with spatial dimensions (64 x 64 x 3), which is converted to dimension (12288 x 1).

Now even if the first layer of the model contains 10 neurons, learnable parameters become 10*12288 = 122880 for a single layer. This will increase the number of learnable parameters in the model and thus computations also increase.

In the case of CNN, the image is fed as such ie. (64 x 64 x 3). convolution layers are a set of filters convoluted over input volume.

These filters or kernel matrices here act as learnable parameters. Hence these will be updated on each iteration of the datasets. So how does it reduce the learnable parameter? It is still the same number!!

The answer is No! Because weights that are learned at a particular depth can be applied to other depths. So a number of parameters are not the same.

So how can we compute the dimensions of the output?

W- dimension of the image, F- Filter Dimension, P- Padding, S- Stride

The stride S controls the size of the step by which the filter is moving. The padding P controls the size of the output by adding zeros (or other values) to the border of the input.

Convolutional Neural networks will easily have a million learning parameters because of the data that they are learning from. This is also one of the reasons there are limited resources on mobile CNN which can lack computational power.

Transfer Learning will help everyone to model without actually training the entire model.

Tensorflow has Object Detection API and model zoo to help with detection and classification problems.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Frequently Used, Contextual References

Resources

Publication

Image Classification with Neural Network

Author(s): Mugunthan

Deep Learning

models/research/object_detection at master · tensorflow/models

Creating accurate machine learning models capable of localizing and identifying multiple objects in a single image…

models/tf2_detection_zoo.md at master · tensorflow/models

We provide a collection of detection models pre-trained on the COCO 2017 dataset. These models can be useful for…

Feedback ↓ Cancel reply

Popular posts

Best Laptops for Deep Learning, Machine Learning (ML), and Data Science for 2023

Best Workstations for Deep Learning, Data Science, and Machine Learning (ML) for 2022

Descriptive Statistics for Data-driven Decision Making with Python

Best Machine Learning (ML) Books - Free and Paid - Editorial Recommendations for 2022

Best Data Science Books - Free and Paid - Editorial Recommendations for 2022

Updates

Recent Posts

Sleepless Nights: A Statistical Look at Modern Sleep Patterns

Our NEW 8-Hour AI Crash Course for Developers!

Cache-Augmented Generation (CAG) vs Retrieval-Augmented Generation (RAG)

Accelerating Drug Approvals Using Advanced RAG

How AI is Transforming Evaluation Practices

The World’s Leading AI and Technology Publication.

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

Image Classification with Neural Network

Author(s): Mugunthan

models/research/object_detection at master · tensorflow/models

Creating accurate machine learning models capable of localizing and identifying multiple objects in a single image…

models/tf2_detection_zoo.md at master · tensorflow/models

We provide a collection of detection models pre-trained on the COCO 2017 dataset. These models can be useful for…

Related posts

Feedback ↓ Cancel reply

Popular posts

Updates

Recent Posts

The World’s Leading AI and Technology Publication.

Company

CONTACT US

GDPR CCPA Statement