ndCurveMaster

The "Statistics" window

The Statistics window presents statistical analysis for each model from the model equation window.

the Statistics window

Statistical parameters presented in this window are calculated as follows:

The "VIF" column in the regression analysis table presents the variance inflation factor (VIF):

VIF

The formula for calculating VIF is as follows:
VIFi = 1 / (1 - R Squarei)

where:

The VIF (Variance Inflation Factor) is commonly used for detection of multicollinearity (more details can be found here). When the VIF value exceeds 5, it indicates a high degree of collinearity. In such cases, predictors with VIF values surpassing the threshold of 5 are marked in red.

The "SA %" column presents the results of the sensitivity analysis:

VIF

Sensitivity analysis is a technique used in data analysis and machine learning to assess the impact of individual input variables on forecast outcomes. It allows for the identification of input variables that are critical to prediction accuracy, as well as those that can be omitted without quality loss, especially when other significance measures, such as t-value, cannot be used. This situation occurs when the distribution of model residuals is not normal, making inference difficult.

The ndCurveMaster program determines the importance of a given variable based on the ratio of the estimation error (RMSE) for the model with the omitted variable (RMSEo) to the estimation error for the model with all input variables (RMSEf), as follows:

SA = (RMSEo/RMSEf - 1) * 100

where:

A higher value of this coefficient indicates a greater impact of the given variable on the prediction outcome. The removed variable causing the largest increase in error achieves the highest RMSER value and can be considered the most significant. Based on SA values, a ranking of variables by importance can be developed, and non-significant variables can be eliminated.

Unacceptable values of statistics such as t-value, VIF or SA are displayed in red color. In such cases, the "Recommendation" column shows the message "suggested removal," which means that the predictor is insignificant.

The highest t-value values and the lowest p-values are displayed in blue color. This indicates that the predictor is the most significant among the others.

Unacceptable RRMSE values are marked in red while acceptable values are shown in green

Blocked predictors are displayed in italics.

If all equations in the model are significant, the "Auto Reduce" option is not available.

Back to Contents