Minimizing the Mean Square Error: Frequentist Approach

Last Updated on June 4, 2024 by Editorial Team

Author(s): Varun Nakra

Originally published on Towards AI.

Statistical Inference and Mean Square Error

In a utopian world, we would have access to unlimited data, i.e., the entire population, so there wouldn’t be a need for an ‘inference’, as we would know for certain about what we are interested in. However, in all practical settings, we don’t have access to the ‘population’ but only have access to observations that comprise a sample. This results in the need to make ‘inferences’ about the population using the sample. Statistical inference is drawing some type of conclusions about one or more parameters (population characteristics), and point estimation is selecting a single number, based on sample data, that represents a sensible value for those parameters. This gives rise to two concepts — the true value of the parameter, the ‘ground truth’, which is hidden from us as we don’t have access to the population, and the point estimate of the parameter.

The Frequentist considers the true value of the parameter θ as fixed but unknown. The Bayesian considers the true value of the parameter θ as an ‘observed’ value of a random variable Θ (which does not get observed by us, but is only a realization of the true value)

Again, under ideal settings, we could find an estimator that is exactly equal to the true value of the parameter, always! However, since the estimate of the true value of the parameter is a function of the sample, it is a random variable per se. This is because different samples would result in different values of the estimate. For some samples, the true parameter will be overestimated and for others underestimated. This leads us to the idea of an ‘error’ in our estimation.

Now, there could be multiple ways, such as squared error, absolute error, etc., in which the error of estimation mentioned in 1 can be quantified. However, we will stick to only squared error in this article and explore other ‘error functions’ in related articles in the future.

It makes sense to ‘average’ the squared error defined in 2 to measure the performance of the estimator on an ‘average’. This gives rise to the concept of ‘Mean Square Error’, which is defined as follows

Minimizing Mean Square Error: Frequentist approach

As mentioned above, under the frequentist approach, we will consider the true parameter as fixed but unknown and attempt to minimize it.

Consider a simple example of estimating the population mean using the sample mean for n observations x1, . . . , xn. The sample mean is the estimator of the population mean and is defined as follows

Instead of using 4, we use a modified version of sample mean that is multiplied by a constant k. The objective is to solve for the optimal value of the constant for which the mean square error will be minimum.

As evident from 11, the optimal value of k depends on the unknown parameter θ. In other words, the estimator of minimum mean square error is not realizable because, being dependent on θ, it depends on the bias term. Therefore, finding the minimum mean square error estimator becomes an unrealized task for this simple example and also for most practical cases. It is for this very reason that we constrain the bias to zero and find the estimator that minimizes the variance, leading to the concept of Minimum Variance Unbiased Estimator(MVUE). However, unfortuntately, even the MVUE estimator does not always exist. Are there alternative ways of finding estimators with minimum variance and minimum bias? Stay tuned for the answer in the sequel article ‘Minimizing the Mean Square Error: Bayesian approach’

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Frequently Used, Contextual References

Resources

Publication

Minimizing the Mean Square Error: Frequentist Approach

Author(s): Varun Nakra

Statistical Inference and Mean Square Error

Minimizing Mean Square Error: Frequentist approach

Feedback ↓ Cancel reply

Popular posts

Best Laptops for Deep Learning, Machine Learning (ML), and Data Science for 2023

Best Workstations for Deep Learning, Data Science, and Machine Learning (ML) for 2022

Descriptive Statistics for Data-driven Decision Making with Python

Best Machine Learning (ML) Books - Free and Paid - Editorial Recommendations for 2022

Best Data Science Books - Free and Paid - Editorial Recommendations for 2022

Updates

Recent Posts

Fine-Tuning vs Distillation vs Transfer Learning: What’s The Difference?

#63: Full of Frameworks: APDTFlow, NSGM, MLFlow, and more!

Vector Databases 101: A Beginner’s Guide to Vector Search and Indexing

AI Agent Developer: A Journey Through Code, Creativity, and Curiosity

AlphaGeometry2: A Deep Dive into a Gold-Medalist AI Geometry Solver

The World’s Leading AI and Technology Publication.

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

Minimizing the Mean Square Error: Frequentist Approach

Author(s): Varun Nakra

Statistical Inference and Mean Square Error

Minimizing Mean Square Error: Frequentist approach

Related posts

Feedback ↓ Cancel reply

Popular posts

Updates

Recent Posts

The World’s Leading AI and Technology Publication.

Company

CONTACT US

GDPR CCPA Statement