Different Probability Distributions Part 2
Last Updated on December 7, 2021 by Editorial Team
Author(s): Priyansh Tripathi
Originally published on Towards AI the World’s Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses.
Now we will see the Continuous variable distributions whereas in part 1 we saw the discrete distributions. In continuous distributions the point probability is equal to “0” and some of the probabilities over the entire range will be “1”. Different types of Continuous Distributions are:
1. Uniform Distribution
2. Normal Distribution
3. Exponential Distribution
4. Chi-squared Distribution
5. Gamma distribution
6. Student T-Distribution
8. Log-Normal Distribution
In a uniform distribution or rectangular distribution, data is uniformly distributed over a given interval where X takes a value between 2 specified values (a, b).
a<X<b or X€(a, b) where X~U(a, b)
PDF=f(x)=1/(b-a) when a<X<b or 0 otherwise.
The total area under the curve would be 1.
CDF=f(x)=x-a/(b-a) when a<X<b and 1 when x>=b
Mean=(b + a)/2
Here the data is distributed in a symmetrical manner or gaussian manner in a bell-shaped curve equally divided into 2 by mean. The total area under the curve is 1. The change in mean will change the graph to shift left or right. The graph ends never touches the baseline (asymptotes).
To solve the question we have to convert this distribution to a Standard Normal Distribution. It has all the properties of Normal Distribution, where μ is 0 and σ² is 1.
Here ƶ is a standard normal variant which is equal to (x-μ)/σ.
ƶ ~ SND(0, 1)
Whenever we change the σ the graph will become wider or thinner. We will calculate the z-score which means how far we are from the mean and we will get the probability value from the z-score table.
The exponential distribution is used to model the time until some specific independent event occurs at a constant average rate.
X is the waiting time or time taken by the event to occur.
λ is the rate at which events occur.
When we square the standard normal variant we will get the ?². It is widely used in hypothesis testing and in finding confidence intervals which we will see in further articles. Here we have degrees of freedom which is “k” we can find it by subtracting 1 from the number of features available(n).
?² €(0,∞) and ?~N(μ,σ²). As we increase the k our graph will be more normalized.
For CDF we need to have the knowledge of Gamma Distribution because the Chi-squared Distribution is a special case of Gamma Distribution.
This Distribution is mostly used for modeling the waiting time until an event occurs. Gamma, Exponential, Poisson distribution is the same aspects of the Poisson process. Gamma function and Gamma Distribution are 2 different concepts.
Here we have 2 parameters a shape parameter α and a scale parameter β. When we divide 1 by β we will have a rate parameter.
A small sample is taken from the population(which is normally distributed) to get the estimates about the population and we also don’t know the standard deviation of the population. It is used in assessing the statistical significance, constructing confidence intervals, and in linear regression analysis.
Variance=N/(N-2) where N>2 and N/(N-1) where N≥1.
F-Distribution is frequently used in the analysis of variance.
Whenever we have a skewed curve and to get a Gaussian curve we take its log so we get a normal curve then this distribution is called Log-Normal Distribution.
We have discussed both types of distribution from a data science perspective the knowledge about distribution one should have will be enough for the further topics after reading this article. Check the other articles too for better understanding.
Different Probability Distributions Part 2 was originally published in Towards AI on Medium, where people are continuing the conversation by highlighting and responding to this story.
Join thousands of data leaders on the AI newsletter. It’s free, we don’t spam, and we never share your email address. Keep up to date with the latest work in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.
Published via Towards AI
where is part 1 ?