Introduction
The time series of climate variables depends on chaos and heterogeneity, which is the reason for the emergence of the problem of non-linearity and uncertainty. Time series is defined as a set of observations with a certain time sequence in which each observation depends on its predecessors, which generates a driving force used in forecasting the future of the studied phenomenon (1). The non-linearity of minimum temperature data makes forecasting a complex process and may also not lead to accurate forecasts. Some researchers have proposed to use the autoregressive model and integral moving averages (ARIMA), which is a traditional statistical method for forecasting single-variable time series data. It is possible to obtain the appropriate ARIMA model for the minimum temperature by following the methodology proposed by Box-Jenkins (2). The inaccuracy of the ARIMA models ' forecast is caused by the fact that it is a linear time series model (3) whereas the temperature data is nonlinear data. To forecast time series, past and current values are used to forecast future values (4-6).
The state space (SS) was used based on the ARIMA model, which is called the ARIMA-SS hybrid model, to deal with random uncertainty caused by the non-linearity of minimum temperature data . The forecasting methods represented by ARIMA model and ARIMA-SS hybrid model (7) were used to forecast the minimum temperature of the cities of Mosul and Baghdad, since the data of the minimum temperature of Mosul and Baghdad are heterogeneous, therefore, the time stratified (TS) was used[8] to align the data in two seasons (the hot season and the cold season) to obtain more homogeneous data for forecasting the minimum temperatures for the daily data for the period (1/11/2011-31/10/2022). The autocorrelation function (ACF) and the partial autocorrelation function (PACF) were used to determine the ARIMA model in addition to plotting the time series to diagnose the stationarity of the time series and determine the rank of the model .From watching the results obtained after applying the double seasonal ARIMA model and the ARIMA-SS model. The results showed the superiority of the ARIMA-SS hybrid model over the doubled seasonal ARIMA model because it was more accurate in forecasting small temperatures based on the Mean Squared Error (MSE) scale (8). The autocorrelation function (ACF) and the partial autocorrelation function (PACF) were used to determine the ARIMA model in addition to plotting the time series to diagnose the stationarity of the time series and determine the rank of the model .
From watching the results obtained after applying the double seasonal ARIMA model and the ARIMA-SS model. The results showed the superiority of the ARIMA-SS hybrid model over the doubled seasonal ARIMA model because it was more accurate in forecasting minimum temperatures based on the Mean Squared Error (MSE) scale. Minimum-temperature data is classified within the interval variables and not within the relative variables Ratio variables, because zero in them is not absolute and does not represent the absence of minimum temperature, therefore parametric tests can be used provided that the data distribution is normal or the sample size is large enough (minimum 30), according to the central limit theory.
The researchers (9) presented a non-Gaussian homogeneity methodology for analyzing time series. This method is based on the modeling of a non-Gaussian state space and is especially relevant for time series that cannot be satisfactorily analyzed by conventional time series models. The researchers (7) used the SS model to forecast the wind speed, which is non-linear, which leads to uncertainty. To obtain the best initial parameters of the SS model, AR will be used to create the SS model structure to handle random uncertainty and improve forecasting. The researchers (10) using state space models as a modeling framework for analyzing environmental time series data. It is commonly used to model population dynamics including population metabolic dynamics. it has a long history in fishery stock assessment and has recently been used as a means of analyzing sparse Biodiversity data moreover, it has been a preferred approach in the movement environment for more than a decade and has been increasingly used with biological registration.
Materials and methods
Data and framework of this study
In this study, the forecasting methods represented by the ARIMA model and the ARIMA-SS hybrid model were used to forecast the minimum temperature of Mosul and Baghdad, since the data of the minimum temperature of Mosul and Baghdad are heterogeneous, and the data were aligned in two seasons (hot season and cold season) to obtain more homogeneous data to forecast the minimum temperatures for daily data for the period (1/11/2011-31/10/2022). The framework includes the following paragraphs.
- Identify the appropriate ARIMA model using the Box-Jenkins methodology.
- Build the ARIMA-SS hybrid model to use in improving forecasts.
- Compare the methods used to determine which model will provide the best forecasts
Autoregressive integrated moving average (ARIMA) model (11, 12)
The ARIMA (p,d,q) model is one of the most prominent nonstationary time series, it can be expressed as as follows:
, Are the parameters of the autoregression and moving averege at step k, respectively, and reflect the effect of changing the time series variable ( ) and the random variable ( ) at step k, respectively, p indicates the rank of the usual self-regression model, q stands for the rank of the usual moving circles, d represents the number of usual differences necessary to achieve stationarity, B is the back-shift operator, is the White Noise with zero arithmetic mean and variance , that is also:
The model AR (p) and the model MA (q) are special cases of ARIMA (p,d,q). AR (p) can be expressed in the form (ARIMA (p,0,0), and MA (q) can be expressed in the form ARIMA (0, 0,q). Seasonal time series data can be generalized and written as a multiplicative seasonal ARIMA model or ARIMA (p,d,q) (P,D,Q)s (4, 13). It includes seasonal and non-seasonal parameters and their differences in general as follows.
where , , and .
and are the seasonal parameters of the autoregressive and the moving averages at step k, respectively, P indicate the rank of the seasonal autoregressive model, Q stands for the rank of the seasonal moving media, D represents the number of seasonal differences necessary to achieve stationarity, s represents the seasonal period.
We can build the ARIMA model following the Box-Jenkins methodology (14), which is an iterative methodology that contains four consecutive steps: recognition, estimation, diagnostic examination and forecasting, which is the most famous and important application of time series. Based on time series observations, we can forecast future observations. Accurate forecasting plays an important role in various applicable areas based on time intervals. Recognition is the first step in the Box-Jenkins methodology and involves stationaries of mean and variance of time series data. Stationarity is a fundamental concept in time series, which indicates that the common probability is not affected by the temporal displacement of all observations backward or forward by an amount of K time interval. The first step in analyzing time series is to plot the time series data and verify the stationarity of the series. After performing the first step of the Box-Jenkins methodology and determining the time series model, the second step that follows is to estimate the parameters of the time series model using the Maximum Likelihood Estimation (MLE) method, and then the third step, which is to verify the validity of the estimated model, the most prominent diagnostic tests is to test the significance of the parameters and ACF of the residuals series. The parameters must be significant using the t - test. The remainder series are independent random variables with similar distributions . The estimated time series model after passing the previous three steps is ready to forecast the time series .After plotting the time series data and achieving stationarity the mean and variance as mentioned earlier, determining the p and q ranks is the most important procedure for determining the model. The use of ACF and PACF is useful for determining the appropriate time series model and the P and Q ranks as shown in Table 1.
Table 1. ACF and PACF patterns according to time series models
|
Model
|
ACF
|
PACF
|
|
AR (p)
|
Gradually decays
|
Suddenly breaks after lag p
|
|
MA (q)
|
Suddenly breaks after lag q
|
Gradually decays
|
|
ARIMA (p,q)
|
Gradually decays but goes to zero after lag q
|
Gradually decays but goes to zero after lag p
|
State Space (SS) model
State space models are commonly used in the modeling framework for analyzing time series data because they directly represent the automatic temporal correlation. The SS model is a kind of pyramid model (15, 16). The use of the ARIMA model as a linear statistical model for modeling nonlinear time series data will lead to the presence of random uncertainty (17), which reduces the accuracy of the forecast. To address random uncertainty, the SS method is used for more accurate forecasting due to its good performance in forecasting time series (18-20).
The SS method can be presented as a statistical method for estimating and forecasting unmeasured state space equations. The SS method is to combine observations with modern forecasts values by using weights that reduce biases and errors. The state equation of State (SE) and the observation equation of State (OE) form an equation system that can be called the linear model of the state space and can be written as:
is a state vector of dimension m, is a finite input vector, is a vector of output observations, and are independent white noise vectors, A, B and C are constant matrices. Based on the ARIMA model in equation(6), the SE equation, which is equation(7), and the OE equation, which is equation(8), can be written in the state space, respectively, in the following form (21)
g is the number of boundaries of the delayed series of the variable , j is the number of boundaries of the delayed remainder series , is the state vector with Dimension r , is the vector of the delayed residuals series , is the rounded vectors of the current residuals.
, ,
Matrix A is the state transition matrix with Dimension ( )
The Matrix B is the commutative matrix of dimension ( )
The matrix C is the transition matrix of the observations by Dimension ( ), are the observed cyclic vectors representing the SS output series, are the cyclic vectors of the current remainders, and are the parameter values of the delayed series of the variable and the delayed series of residuals respectively in the ARIMA model equation. Equation(9) and equation(10) are complex in application and for simplification they can be reformulated as follows (4, 22).
Is the state vector in dimension m, A the state transition matrix in dimension (m x m), C the observation transition matrix in dimension (1 x m), the observed cyclic vectors representing the SS output Series, m is the number of the boundaries of the delayed series of the variable and the sum of the series of delayed residuals of the variable on the right-hand side of the ARIMA data model equation after simplifying it and keeping only on the left-hand side.
Is all the parameter values of the delayed series of the variable and all the delayed remainder series of the variable on the right side of the ARIMA data model equation after simplifying it and keeping only on the left side. The Matrix A, the row vector C and the variables on the right-hand side of the ARIMA data model equation will determine the input variables and the SS structure used in this research. The time stratified method (TS) was used, which is an analytical method that aligns the data chronologically according to seasonal influences, which are clearly manifested as influences on the behavior of the time series and the behavior of forecasting results. Time Stratified can be applied to different time series in the event that they include recurring seasonal time trends with the same context and influence and works to reach more homogeneous data than the aggregate data and thus obtain more accurate results (23, 24). The mean squared error (MSE) is one of the measures used to express the forecasting accuracy of the data and the mean Squared Error (MSE) equation(8) can be written as follows:
; , n: number of observations, real observation at time t, the forecasting value for the observation y at the time t, is the residuals series at the time t.
Results and conclusions
Data and Framework used in this study
In this study, the forecasting methods represented by the ARIMA model and the ARIMA-SS hybrid model were used to forecast the minimum temperature of Mosul and Baghdad. The minimum temperature data of Mosul and Baghdad are heterogeneous, therefore, the TS method was used to obtain more homogeneous data for forecasting minimum temperatures for daily data for the period (1/11/2011-31/10/2022) (23, 24), which is by aligning the data for the hot season and the cold season separately, as the data for the hot season for consecutive months (may - October) for the years within the period (2012-2022). And the data of the cold season for consecutive months (November– April) for the years within the period (2011-2022). 1656 obseravtion were used for training for the warm season for both Mosul and Baghdad. 1629 observations were used for training for the cold season for both Mosul and Baghdad. 368 Test observations were used for the warm season for both Mosul and Baghdad and for the cold season 362 Test observations for both Mosul and Baghdad were used to forecast the minimum temperature. These data were modeled using the double seasonal ARIMA model. Based on the above, the seasonality of the data and its models will be assumed on the basis that each seasonal cycle will consist of six months, i.e. (s=6). The working steps will be as follows:
- Build an ARIMA model based on the Box-Jenkins methodology.
- Take the right-hand side of the ARIMA models as input to the SS model.
- Initialize each of the following variables:
- The target variable , which is the left end of the ARIMA model.
- The variable is the right-hand end of the ARIMA model except for the current residuals series .
- The variable is the current residuals series.
ARIMA Model
The Box-Jenkins methodology in which the first step is to identify the model, one of the basic conditions of which is that the data is stationary. To detect the degree of (stationarity) of the data is done by plotting time series data, and it is also possible to use the ACF and PACF functions . To achieve stationarity, the usual and seasonal sequential differences are taken on the assumption that the variation is stationary, since the minimum temperature is a natural phenomenon that is not subject to ambient interference. One regular difference and one seasonal difference were taken, respectively, and stationarity was tested at each stage, and it turned out that the data was completely stationary after taking the two differences above. The determination of the model is also through the plotting of the functions ACF and PACF. Figure 1 shows ACF and PACF of the city of Mosul the hot season after stationarity, i.e. when (D=1,d=1, s=6).
Figure 1. ACF and PACF for the city of Mosul is the hot season when (D=1, d=1, s=6)
It can be seen from Figure 1 that ACF shows a break after the usual Lag 2 and after the seasonal Lag 6, while PACF shows a decaying pattern in the normal and seasonal phases, so the best ARIMA model for the city of the hot season connector is ARIMA (0,1,2)(0,1,1)6. Table 2. It shows the parameter values of the ARIMA model for the city of Mosul hot season and significant indicators..
Table 2. Parameter values for the ARIMA model
|
Type
|
Parameter
|
Calc. t
|
p-value
|
|
|
0.4861
|
19.83
|
0.000
|
|
|
0.1675
|
6.82
|
0.000
|
|
|
0.9817
|
14664.08
|
0.000
|
From Table 2. It turns out that all the values of the estimated parameters are significant (p-value is less than the level of significance 0.05), which are the parameters belonging to the ARIMA (0,1,2)(0,1,1)6. model. The ACF function of the remainders was also tested and it turned out that they match the conditions of a good model. Thus, the ARIMA (0,1,2)(0,1,1)6. will have successfully passed the diagnostic tests. The ARIMA (0,1,2)(0,1,1)6. model of the minimum temperature of the city of Mosul for the hot season can be expressed by the following formula:
Let is the stationary series after taking difference procedures i.e. , and
After multiplying the brackets and making mathematical simplifications, we get the following:
After compensating for the values of the parameters found in table2. In equation(18), the equation of the ARIMA model becomes as follows:
Table 3. The MSE scale for Mosul city, the hot season for both training and testing periods for ARIMA model
|
Training Period
|
Testing Period
|
|
4.4404
|
4.1050
|
The Box-Jenkins methodology in which the first step is to identify the model, one of the basic conditions of which is that the data be stationary. To detect the degree of stationarity of the data, time series data is plotted, and it is also possible to use the ACF and PACF functions . To achieve stationarity, the usual and seasonal sequential differences are taken on the assumption that the variation is stationary, since the minimum temperature is a natural phenomenon that is not subject to ambient interference. One regular difference and one seasonal difference were taken, respectively, and stationarity was tested at each stage, and it turned out that the data was completely stationary after taking the two differences above. The determination of the model is also through the plotting of the functions ACF and PACF. Figure 2. ACF and PACF for Mosul show the cold season after stationarity, i.e. when (D=1,d=1, s=6)
Figure2. ACF and PACF for the city of Mosul, Cold Season (D=1,d=1,s=6)
Figure 2 shows that ACF shows a break after the usual Lag 3 and after seasonal Lag 18, while PACF shows a break after the usual Lag 1 but shows decay in the seasonal phase, so the best ARIMA model for Mosul in the cold season is ARIMA (1,1,2)(0,1,3)6. . Table 4. Shows the parameter values of the ARIMA (1,1,2)(0,1,3)6. model for the city of Mosul cold season and significant indicators.
Table 4. Parameter Values for the ARIMA model
|
Type
|
Parameter
|
Calc. t
|
p-value
|
|
|
0.6127
|
16.72
|
0.000
|
|
|
0.8332
|
21.19
|
0.000
|
|
|
0.0695
|
2.15
|
0.031
|
|
|
0.7818
|
951.10
|
0.000
|
|
|
0.1094
|
4.45
|
0.000
|
|
|
0.0922
|
3.76
|
0.000
|
From Table 4 it is clear that all the values of the estimated parameters are significant (p - value is less than the level of significance 0.05), which are the parameters belonging to the ARIMA (1,1,2)(0,1,3)6. model . The ACF function of the remainders was also tested and it turned out that they match the conditions of a good model. Thus, the A ARIMA (1,1,2)(0,1,3)6. model has successfully passed the diagnostic tests. The ARIMA (1,1,2)(0,1,3)6. model of the minimum temperature of Mosul in the cold season can be expressed by the following formula:
After multiplying the brackets and making mathematical simplifications, we get the following:
After compensating for the values of the parameters found in table 3 in equation(21), the ARIMA (1,1,2)(0,1,3)6 model becomes as follows:
Table 5. Below are the MSE values of the ARIMA (1,1,2)(0,1,3)6 model's forecasts for the minimum temperature of Mosul in the cold season for the training and testing periods.
Table 5. The MSE scale for Mosul city is the cold season for both training and testing periods
|
Testing Period
|
Training Period
|
|
4.1610
|
4.0958
|
The determination of the model is also through the drawing of the functions ACF and PACF. Figure 3. ACF and PACF for the city of Baghdad show the hot season after stationarity.
Figure 3. ACF and PACF for the city of Baghdad, hot season (D=1,d=1, s=6).
It is evidenced by Figure 3. The ACF shows a break after the usual Lag 1 and after the seasonal Lag 6, while the PACF shows a decay in the usual and seasonal phases, so the best ARIMA model for Baghdad in the hot season is ARIMA(0,1,1)(0,1,1)6. Table 6. It presents the parameter values of the ARIMA(0,1,1)(0,1,1)6 model for the city of Baghdad, the hot season and significance indicators.
Table 6. Parameter Values for the ARIMA model
|
Type
|
Parameter
|
Calc. t
|
p-value
|
|
|
0.8638
|
70.58
|
0.000
|
|
|
0.9854
|
23100.81
|
0.000
|
From Table 6. It turns out that all the values of the estimated parameters are significant (p - value is less than the level of 0.05), which are the parameters belonging to the ARIMA(0,1,1)(0,1,1)6 model. The ACF function of the remainders was also tested and it turned out that they match the conditions of a good model. Thus, the ARIMA(0,1,1)(0,1,1)6 model has successfully passed the diagnostic tests. The ARIMA(0,1,1)(0,1,1)6 model of the minimum temperature of the city of Baghdad in the hot season can be expressed by the following formula:
After multiplying the brackets and making mathematical simplifications, we get the following:
After compensating for the values of the parameters found in table 6 in equation(24), the ARIMA(0,1,1)(0,1,1)6 model becomes as follows:
Table 7. The following shows the MSE values for ARIMA(0,1,1)(0,1,1)6 model forecasts for the minimum temperature of the city of Baghdad hot season for the training and testing periods.
Table 7. The MSE scale for Baghdad city is the hot season for training and testing periods for ARIMA
|
Training Period
|
Testing Period
|
|
32.5704
|
20.6798
|
The determination of the model is also through the drawing of the functions ACF and PACF. Figure 4. ACF and PACF for the city of Baghdad show the cold season after stationarity.
Figure 4. ACF and PACF for the city of Baghdad cold season (D=1,d=1, s=6)
It is evidenced by Figure 4. The ACF shows a break after the usual Lag 2 and after the seasonal Lag 6, while the PACF shows a break after the usual Lag 1 but shows a decay in the seasonal phase, so the best ARIMA model for Baghdad city in the cold season is ARIMA(0,1,2)(0,1,1)6. Table 8 . Displays parameter values of ARIMA(0,1,2)(0,1,1)6 model for Baghdad city of cold season and significant indicators.
Table 8. Parameter Values for the ARIMA model
|
Type
|
Parameter
|
Calc. t
|
p-value
|
|
|
0.4464
|
7.95
|
0.000
|
|
|
0.7571
|
12.94
|
0.000
|
|
|
0.0986
|
2.32
|
0.020
|
|
|
0.9806
|
7094.52
|
0.000
|
From Table 8. It turns out that all the values of the estimated parameters are significant (p - value is less than the level of significance 0.05), which are the parameters belonging to the ARIMA(1,1,2)(0,1,1)6. model . The ACF function of the remainders was also tested and it turned out that they match the conditions of a good model. Thus, the ARIMA(1,1,2)(0,1,1)6. model has successfully passed the diagnostic tests. The ARIMA(1,1,2)(0,1,1)6. model of the minimum temperature of the city of Baghdad cold season can be expressed by the following formula:
After multiplying the brackets and making mathematical simplifications, we get the following:
After compensating for the values of the parameters found in table 7 in equation(27), the ARIMA model becomes as follows:
Table 9 below shows the MSE values of the ARIMA model temperature forecasts for the city of Baghdad cold season for the training and test periods.
Table 9. The MSE scale for Baghdad city is the cold season for training and testing periods for ARIMA
|
Training Period
|
Testing Period
|
|
9.3753
|
11.8174
|
When forecasting using the ARIMA model for the training and testing periods, respectively, for the city of Mosul during the hot season, the MSE was equal to 4.4404 and 4.1050, respectively, as shown in Table 3. While for the city of Mosul during the cold season, the MSE was equal to 4.0958 and 4.1610, respectively, as in Table 5. For the city of Baghdad, the hot season MSE was 32.5704 and 20.6798 respectively in Table 7. For the city of Baghdad, the cold season MSE was 9.3753 and 11.8174, respectively, as shown in Table 9.
ARIMA-SS hybrid approach
The input of the SS model is the variables on the right-hand side of the equation of the doubled seasonal ARIMA model, excluding the variable after keeping on the left-hand side. When taking the variables on the right side of the equation of the doubled seasonal ARIMA model regardless of the parameters and signals, the hybrid method is called ARIMA-SS without parameters, and when taking the variables on the right side of the equation of the doubled seasonal ARIMA model with parameters and signals (i.e. multiplying variables by parameters and signals), the hybrid method is called ARIMA-SS with parameters. The framework for completing the process of forecasting minimum temperatures using the ARIMA-SS hybrid model in the Matlab program is summarized as follows.
- Modeling of minimum temperature time series data using the ARIMA model.
- The input variables of the ARIMA-SS hybrid model are the same as the variables and their parameters at the right end of the final estimated ARIMA models, which include parameters, self-regression variables, moving averages, as well as the remainder variable.
- Initialize the input variables and the target variable , which is the variable (the original series), the remainder and the two matrices referred to in equations 11 and 12 and insert them into (Matlab).
- Build the ARIMA-SS hybrid model using the data configured in the previous step.
- Calculation of the MSE error criterion.
When using the ARIMA-SS hybrid method to forecast the temperatures of Mosul in the hot season, equation(19) was the variables that determined the structure of the SS model. Table 10 below shows the MSE values of the ARIMA-SS hybrid model temperature forecasts for Mosul city during the hot season for the training and testing period .
Table 10. The MSE scale for Mosul city is the hot season for training and testing periods for ARIMA-SS model.
|
|
Training
|
Testing
|
|
Without Parameters
|
1.5199
|
1.2575
|
|
With Parameters
|
1.4478
|
1.1744
|
The MSE value in Table 10 when using the ARIMA-SS hybrid method for forecasting the training period without parameters and with parameters, respectively, was 1.5199 and 1.4478, respectively, which is smaller than the MSE for the training period when using the ARIMA model for forecasting. For the test period without parameters and with parameters respectively, the MSE value was 1.2575 and 1.1744, respectively, which is smaller than the MSE for the test period when using the ARIMA model. Thus, it is clear that the use of the ARIMA-SS hybrid method for forecasting dealt with uncertainty and had more accuracy in forecasting than the ARIMA model, when using which the forecast results were not good, because the data in the hot season are more scattered and less homogeneous. Figure 5 and Figure 6 below show the extent of correspondence and harmony between the original time series variable and the forecast series for the training and testing periods respectively for the hot season of Mosul using the ARIMA -SS hybrid model with and without parameters.
Figure 5. The Fitting figure between the original time series and the forecast series for the training and testing periods respectively for the hot season of Mosul using the ARIMA-SS hybrid model without parameters
.
Figure 6. The Fitting figure is formed between the original time series and the forecasting series for the training and testing periods respectively for the hot season of the conductor using the ARIMA-SS hybrid model with the parameters
It can be seen from Figure 5 and Figure 6 that the ARIMA-SS hybrid model performed well in forecasting the minimum temperatures of Mosul in the hot season due to the convergence of the original time series with the forecasting series of the ARIMA -SS hybrid model without parameters and with parameters for both training and testing periods. When using the ARIMA-SS hybrid method to forecast the temperatures of Mosul in the cold season, equation(22) was the variables that determined the structure of the SS model .Table 11 below shows the MSE values of the ARIMA-SS hybrid model temperature forecasts for Mosul cold season for the training and test periods.
Table 11. The MSE scale for Mosul city is the cold season for both training and testing periods for ARIMA-SS
|
Training Period
|
Testing period
|
|
Without Parameters
|
6.1452
|
Without Parameters
|
3.0313
|
|
With Parameters
|
6.1426
|
With Parameters
|
2.8891
|
The MSE in Table 11 when using the ARIMA-SS hybrid method for forecasting the training period without parameters and with parameters, respectively, was 6.1452 and 6.1426, respectively, which is greater than the MSE for the training period when using the ARIMA model for forecasting. For the test period without parameters and with parameters respectively, the MSE values were 3.0313 and 2.8891, respectively, which is smaller than the MSE for the test period when using the ARIMA model. Thus, it becomes clear that the use of the ARIMA-SS hybrid method for forecasting in the training period had less accuracy in forecasting than the ARIMA model, which forecast results when using it were good, because the data for the city of Mosul for the cold season are less scattered and more homogeneous, while in the test period the forecast results when using the ARIMA-SS hybrid method for forecasting were more accurate than the ARIMA model. Figure 7 and figure 8 below show the extent of correspondence and harmony between the original time series variable and the forecast series for the training and testing periods respectively for the cold season of Mosul using the ARIMA-SS hybrid model with and without parameters.
Figure 7. The Fitting figure between the original time series and the forecasting series of the training and testing periods respectively for the cold season of Mosul using the ARIMA-SS hybrid model without parameters.
Figure 8. The Fitting figure is formed between the original time series and the forecasting series for the training and testing periods respectively for the cold season of Mosul using the ARIMA-SS hybrid model with parameters.
It is shown in Figure 7 and Figure 8 that the ARIMA-SS hybrid model performed well in forecasting the minimum temperatures of Mosul in the cold season due to the convergence of the original time series with the forecasting series of the ARIMA-SS hybrid model without parameters and with parameters for both training and testing periods.
When using the ARIMA-SS hybrid method to forecast the temperatures of the city of Baghdad in the hot season, equation(25) was the variables that determined the structure of the SS model. Table 12 below shows the MSE values for the ARIMA-SS hybrid model temperature forecasts for Baghdad during the hot season for the training and testing periods
Table 12. The MSE scale for Baghdad city is the hot season for both training and testing periods for ARIMA-SS
|
Training Period
|
Testing period
|
|
Without Parameters
|
1.5712
|
Without Parameters
|
6.7694
|
|
With Parameters
|
1.5875
|
With Parameters
|
6.7697
|
The MSE value in Table 12 when using the ARIMA-SS hybrid method for forecasting the training period without parameters and with parameters, respectively, was 1.5712 and 1.5875, respectively, which is smaller than the MSE for the training period when using the ARIMA model for forecasting. For the test period without parameters and with parameters respectively, the MSE value was 6.7694 and 6.7697, respectively, which is smaller than the MSE for the test period when using the ARIMA model for forecasting. Thus, it is clear that the use of the ARIMA-SS hybrid method for forecasting dealt with uncertainty and had more accuracy in forecasting than the ARIMA model, when using which the forecast results were not good, because the data in the hot season are more scattered and less homogeneous. Figure 9 and Figure 10 below show the extent of correspondence and harmony between the original time series variable and the forecast series for the training and testing periods respectively for the hot season of Baghdad using the ARIMA-SS hybrid model with and without parameters.
Figure 9. The Fitting figure between the original time series and the forecasting series for the training and testing periods respectively for the Baghdad hot season using the ARIMA-SS hybrid model without parameters.
Figure 10. The Fitting figure between the original time series and the forecasting series for the training and testing periods respectively for the Baghdad hot season using the ARIMA-SS hybrid model with parameters.
It is shown in Fig. 9 and fig. 10 that the ARIMA-SS hybrid model performed well in forecasting the minimum temperatures of Baghdad during the hot season due to the convergence of the original time series with the forecasting series of the ARIMA-SS hybrid model without parameters and with parameters for both training and testing periods.
When using the ARIMA-SS hybrid method to forecast the temperatures of the city of Baghdad in the cold season, equation(28) was the variables that determined the structure of the SS model .Table 13 below shows the MSE values of the ARIMA-SS hybrid model temperature forecasts for the city of Baghdad cold season for the training and test periods.
Table 13. MSE scale for Baghdad city is the cold season for both training and testing for ARIMA-SS
|
Training Period
|
Testing period
|
|
Without Parameters
|
20.9981
|
Without Parameters
|
6.4420
|
|
With Parameters
|
21.0414
|
With Parameters
|
6.4414
|
The MSE value in Table 13 when using the ARIMA-SS hybrid method for forecasting the training period without parameters and with parameters, respectively, was 20.9981 and 21.0414, respectively, which is greater than the MSE for the training period when using the ARIMA model for forecasting. For the test period without parameters and with parameters respectively, the MSE values were 6.4420 and 6.4414, respectively, which is smaller than the MSE for the test period when using the ARIMA model. Thus, it becomes clear that the use of the ARIMA-SS hybrid method for forecasting in the training period had a lower accuracy in forecasting than the ARIMA model, which forecast results when using it were good, because the data for the city of Baghdad for the cold season are less scattered and more homogeneous, while in the test period the forecast results when using the ARIMA-SS hybrid method for forecasting were more accurate than the ARIMA model. Figure 11 and Figure 12 below show the extent of congruence and harmony between the original time series variable and the forecast series for the training and testing periods respectively for the cold season of the city of Baghdad using the ARIMA-SS hybrid model with and without parameters.
Figure 11. The Fitting figure between the original time series and the forecasting series for the training and testing periods respectively for the cold season of Baghdad using the ARIMA-SS hybrid model without parameters.
Figure 12. The Fitting figure between the original time series and the forecasting series for the training and testing periods respectively for the cold season of Baghdad using the ARIMA-SS hybrid model with parameters.
It is shown in Figure 11 and Figure 12 that the ARIMA-SS hybrid model performed well in forecasting the minimum temperatures of the city of Baghdad in the cold season due to the convergence of the original time series with the forecasting series of the ARIMA-SS hybrid model without parameters and with parameters for both training and testing periods.
Conclusions
The ARIMA-SS hybrid method was used to improve the accuracy of minimum-temperature forecasting. Two sets of minimum-temperature data were used for the cities of Mosul and Baghdad. The results showed that ARIMA and ARIMA-SS hybrid models were effective. However, the MSE forecast results indicated that the ARIMA-SS hybrid model was the most effective tool for improving the accuracy of minimum-temperature forecasts, especially after aligning the data using Time Stratified (TS) to eliminate the problem of heterogeneity. But in the cold season for the cities of Mosul and Baghdad, for the training period, the ARIMA model had more accuracy in forecasting than the ARIMA-SS hybrid model, because the data in the cold season were less scattered and more homogeneous. Also, the use of parameters or not in the configuration of the input structure of the hybrid model did not have a significant difference in the efficiency and accuracy of forecasts equally. The advantages of the hybrid model were to deal with the problem of uncertainty that is caused by the non-linearity of the data. Therefore, it is possible to conclude that the use of the ARIMA-SS hybrid model will result in better forecasting accuracy for the minimum temperature compared to the forecasting performance of the traditional ARIMA model.