Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

Statistics 101- Part 2- Probability Distributions, Types, and Applications
Latest

Statistics 101- Part 2- Probability Distributions, Types, and Applications

Last Updated on September 25, 2022 by Editorial Team

Author(s): Kumar kaushal

Originally published on Towards AI the World’s Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses.

Definition of the probability distribution, different types of distributions, their explanation, and applications

Photo by Naser Tamimi onΒ Unsplash

This article is in continuation of Statistics 101-Part 1 article. Here, we will discuss probability distributions, their types, and applications. Probability distributions are building blocks of business decisions based on statistical analysis, machine learning, and others. Hence, understanding the concepts related to probability distribution isΒ vital.

Contents

  • Probability distribution definition and important concepts
  • Types of probability distributions
  • Choosing the right distribution
  • Applications

Probability distribution definition and important concepts

The probability distribution is related to frequency distribution. When actually conducting an experiment and observing frequencies of all outcomes, it is called frequency distribution. Whereas, when we list probabilities of all possible outcomes that could take place if an experiment was done, it is called probability distribution.

let us suppose that we have some number of defects and their probability of happening recorded as shown in the belowΒ table:

Image by theΒ author

The probability distribution of the above example would be as shownΒ here:

Image byΒ author

Another important concept to know is Random Variables. Random variables are variables that are assigned different values basis a random experiment. It changes from occurrence to occurrence with no pattern that can be predicted.

Types of probability distributions

We can broadly classify probability distributions into two categories- discrete probability distribution and continuous probability distribution.

Discrete probability distribution

If we are considering discrete variables in the probability distribution, then it would be a discrete probability distribution, and the values taken are only a few possible values. For example, the probability of the outcome of a number in a dice would be a discrete probability because outcomes can only have six possible values. A function defining a discrete probability distribution is called a Probability Mass FunctionΒ (PMF).

A Binomial Distribution is an example of a discrete probability distribution for a discrete random variable. It is defined as shownΒ below:

Image byΒ author

where p=probabilty of success, q=1-p=probability of failure, r = number of success required, and n= number of trials. Conditions for using a Binomial distribution are mentioned below:

  • Each trial should have only two outcomes ( example- yes or no, pass orΒ fail)
  • The probability of the outcome of a trial remains constant over time. For example, the probability of a head in a fair coin’s toss will be constant, i.e.,Β 0.5
  • The trials are independent of each other, i.e., the outcome of a trial will not depend on the outcome of anotherΒ trial.

Another well-known discrete probability distribution is the Poisson distribution. This distribution is defined as per the below equation:

Image byΒ Author

where lambda= mean number of occurrences perΒ interval

If we are interested in knowing the probability of x occurrences, the below conditions should be met in the case of Poisson distribution:

  • The probability of exactly one occurrence in one interval is a very small number and isΒ constant
  • The probability of two or more occurrences in one interval can be assigned a zeroΒ value
  • The number of occurrences per interval is independent ofΒ time
  • The number of occurrences per interval is independent of the number of occurrences in other time intervals.

Another example of the discrete probability distribution is the discrete uniform distribution.

Continuous probability distribution

Variables considered under continuous probability distribution can take any value within a range of values. For example, the probability distribution of heights of a region’s population would be a continuous probability distribution, as heights a person can take anyΒ value.

A function defining a continuous probability distribution is called a Probability Density Function (PDF). A well know continuous probability distribution is a Gaussian distribution, also referred to as a Normal distribution.

Image byΒ author
Representation of Normal Distribution ( Image byΒ author)

The normal distribution should have the following characteristics:

  • The distribution should be unimodal i.e., it must have only a singleΒ peak
  • The mean for the curve should beΒ centered
  • Mean should be equal to mode andΒ median
  • The two tails of the distribution should extend indefinitely and never touch the horizontal axis

An exponential distribution is defined as per the below equation:

Image byΒ author

where lambda is called a rate parameter

An example of Exponential distribution (image byΒ author)

In the first article of this series, we have seen the case of t-distribution, also known as Student’s distribution. This distribution is used in cases where the sample size is less than 30, and the population standard deviation is not known. Like a normal distribution, this distribution, too, is symmetrical but is a bit flatter than a normal distribution. There are different t-distributions for every possible sample size, depending on the degrees of freedom. As the sample size increases, it becomes close to a normal distribution.

Examples of other continuous probability distributions are log-normal, continuous uniform, Chi-Square distribution, andΒ others.

Choosing the right distribution

It is critical that the right distribution is selected for the application. We must be able to categorize if the distribution to be used is discrete or continuous and, subsequently, if condition(s) for a specific distribution is being met. For example, in order to be able to use a Binomial distribution, the conditions listed in the section on β€œtypes of probability distributions- Discrete probability distribution” must be met. Similarly, if we want to use a normal distribution, the conditions for such distribution must beΒ met.

Applications

Probability distributions are widely used in various applications. We have to understand the use case and the prevailing conditions to apply a specific distribution and arrive at the probability of an event of interest.

If a business manager is attempting to estimate the probability of x number of defects from a pool of n components, this may be a use case of Binomial distribution.

If a clinic’s manager wants to calculate the number of patient arrivals in a day, Poisson distribution may be used for thisΒ case.

The income distribution of a country can be represented as a normal distribution, provided meeting the required conditions.

We can use Chi-Square distribution in scenarios where we want to test the independence of two parameters, such as marital status and education level, or gender andΒ salary.

Hence, there are numerous examples to show that probability distributions can be applied for specificΒ usages.

Follow me (kumarkaushal.bit) for more interesting topics related to Data Science and Statistics.


Statistics 101- Part 2- Probability Distributions, Types, and Applications was originally published in Towards AI on Medium, where people are continuing the conversation by highlighting and responding to this story.

Join thousands of data leaders on the AI newsletter. It’s free, we don’t spam, and we never share your email address. Keep up to date with the latest work in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.

Published via Towards AI

Feedback ↓