Open Access Open Access  Restricted Access Subscription or Fee Access

High-Dimensional Data and the Bias Variance Tradeoff in Model Selection

Eligo Workineh Menna

Abstract



The exploration of function estimation from a specific dataset was primarily theoretical until the 1990s, despite the development of statistical learning theory in the late 1960s. This shift allowed statistical learning theory to be applied in practical multidimensional function estimation methods alongside theoretical evaluations. Statistical learning represents a contemporary approach to machine learning that employs mathematical models, algorithms, and statistical techniques to detect patterns and make predictions from large datasets. The extensive use of statistical learning in fields ranging from genetics and biology to economics and social datasets is clear from a thorough examination of the literature. This methodology has recently been used in a number of disciplines, including computer science, biology, finance, and social science. Model selection is one of the main obstacles to solving each of these studies' problems. Because the influence of bias-variance tradeoff in model selection plays a vital role especially in analyzing high-dimensional data. This comprehensive scientific review aims to investigate the role of the bias-variance tradeoff in model selection for high-dimensional datasets. We examine the concepts of bias and variance, their interrelationship, and their effects on model performance regarding prediction accuracy and generalization. Additionally, we highlight numerous strategies and techniques used to balance the bias-variance tradeoff during model selection. This review provides insights into the importance of understanding and managing the bias-variance tradeoff for effective model selection in high-dimensional data analysis.

Keywords


Biasvariance trade-off, Model selection, Regularization, Cross-validation, Ensemble Methods, Statistical Inference

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information. Also: DOI is paid service which provided by a third party. We never mentioned that we go for this for our any journal. However, journal have no objection if author go directly for this paid DOI service.