Overview of Regression Models and How to Determine the Best Model for Data

Latha, N Madhavi and Geetha, K. and Damodharan, S (2024) Overview of Regression Models and How to Determine the Best Model for Data. Journal of Scientific Research and Reports, 30 (10). pp. 250-266. ISSN 2320-0227

[thumbnail of Damodharan30102024JSRR123452.pdf] Text
Damodharan30102024JSRR123452.pdf - Published Version

Download (442kB)

Abstract

The selection of the appropriate regression model for your data is an essential stage that has a significant impact on the precision and interpretability of your analytical process. The purpose of regression models is to investigate the connections that exist between a dependent variable and one or more independent variables from a statistical perspective. The process of selection starts with gaining an awareness of the many kinds of regression models that are accessible. These models include linear, polynomial, and logistic regression, among others. Each of these models is suitable for a different kind of data and a different kind of connection. In order to reduce the number of possibilities, it is helpful to do an analysis of the features of your data, which may include the existence of outliers, multicollinearity, and distribution. Furthermore, each m0odel is accompanied by a set of unique assumptions, such as the linearity and normalcy characteristics that are necessary for linear regression. In order to get findings that can be relied upon, it is essential to verify that these assumptions are correct; if they are not, different models such as generalized linear models would be required. In order to prevent overfitting, which occurs when the model captures noise rather than the actual data structure, it is essential to strike a balance between the complexities of the model. When it comes to making this judgment, methods such as cross-validation might be of assistance. In conclusion, it is important to take into consideration the trade-off between interpretability and predictive strength. Models that are simpler, such as linear regression, are simpler to explain, but models that are more complicated might produce better forecasts. You will be able to pick the regression model that is the most suitable for your data if you give careful consideration to these aspects, which will result in insights that are both robust and relevant. Selecting regression types depends on data characteristics: linear for trends, logistic for probabilities, and polynomial for complex curves. Proper pre-processing ensures accurate model outcomes.

Item Type: Article
Subjects: OA Library Press > Multidisciplinary
Depositing User: Unnamed user with email support@oalibrarypress.com
Date Deposited: 27 Sep 2024 08:14
Last Modified: 27 Sep 2024 08:14
URI: http://archive.submissionwrite.com/id/eprint/1582

Actions (login required)

View Item
View Item