C:\Users\VanOp\AppData\Local\Temp\ipykernel_15916\1212899292.py:14: DeprecationWarning: Importing display from IPython.core.display is deprecated since IPython 7.14, please import from IPython display
  from IPython.core.display import display, HTML, SVG

Autosaving every 500 seconds

'%.4f'

432000 l/h 10368000 l/d

3615 Lupa: 3.46 %, estimated drainage area: 8.3333

520 Lupa: 24 %

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4199 entries, 2009-01-01 to 2020-06-30
Freq: D
Data columns (total 6 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  4199 non-null   float64
 1   Flow_Rate_Lupa  4199 non-null   float64
 2   doy             4199 non-null   int64  
 3   Month           4199 non-null   int64  
 4   Year            4199 non-null   int64  
 5   ET01            3834 non-null   float64
dtypes: float64(3), int64(3)
memory usage: 229.6 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4480 entries, 2009-01-01 to 2021-04-07
Freq: D
Data columns (total 1 columns):
 #   Column   Non-Null Count  Dtype  
---  ------   --------------  -----  
 0   Portata  4362 non-null   float64
dtypes: float64(1)
memory usage: 199.0 KB

Date
2009-01-01     NaN
2009-01-02     NaN
2009-01-03     NaN
2009-01-04     NaN
2009-01-05     NaN
              ... 
2020-06-26   -0.16
2020-06-27   -0.10
2020-06-28   -0.11
2020-06-29   -0.03
2020-06-30   -0.25
Freq: D, Name: Diff, Length: 4199, dtype: float64

<reliability.Fitters.Fit_Everything at 0x1c6fd416610>

10.720848160507124

<reliability.Fitters.Fit_Everything at 0x1c6fd5300a0>

2021-05-08 14:56:12,771 [14176] WARNING  py.warnings: c:\program files\python38\lib\site-packages\statsmodels\tsa\statespace\sarimax.py:966: UserWarning: Non-stationary starting autoregressive parameters found. Using zeros as starting parameters.
  warn('Non-stationary starting autoregressive parameters'

2021-05-08 14:56:12,771 [14176] WARNING  py.warnings: c:\program files\python38\lib\site-packages\statsmodels\tsa\statespace\sarimax.py:978: UserWarning: Non-invertible starting MA parameters found. Using zeros as starting parameters.
  warn('Non-invertible starting MA parameters found.'

========================================
Cross-validating your time series models
========================================
Like scikit-learn, ``pmdarima`` provides several different strategies for
cross-validating your time series models. The interface was designed to behave
as similarly as possible to that of scikit to make its usage as simple as
possible.

pmdarima version: 1.8.2
Model 1 CV scores: [200.0, 36.5652808620072, 200.00000000000003, 121.85661559538535, 93.81143636894583, 200.00000000000003, 104.01604418272268, 112.21085170028057]
Model 2 CV scores: [128.42870452935162, 29.244095481046624, 200.00000000000003, 8.157473882452118, 200.0, 200.0, 125.97853712573462, 143.72172542399971]
Lowest average SMAPE: 129.4413170553231 (model2)
Best model:  ARIMA(1,0,1)(1,0,0)[12] intercept

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4199 entries, 2009-01-01 to 2020-06-30
Freq: D
Data columns (total 8 columns):
 #   Column           Non-Null Count  Dtype  
---  ------           --------------  -----  
 0   Rainfall_Terni   4199 non-null   float64
 1   Flow_Rate_Lupa   4199 non-null   float64
 2   doy              4199 non-null   int64  
 3   Month            4199 non-null   int64  
 4   Year             4199 non-null   int64  
 5   ET01             3834 non-null   float64
 6   Flow_log         4199 non-null   float64
 7   Flow_log_pct_ch  4198 non-null   float64
dtypes: float64(5), int64(3)
memory usage: 455.2 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 12 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  3834 non-null   float64
 1   Flow_Rate_Lupa  3834 non-null   float64
 2   doy             3834 non-null   float64
 3   Month           3834 non-null   float64
 4   Year            3834 non-null   float64
 5   ET01            3834 non-null   float64
 6   Infilt_         3834 non-null   float64
 7   Infiltsum       3834 non-null   float64
 8   Rainfall_Ter    3834 non-null   float64
 9   Flow_Rate_Lup   3834 non-null   float64
 10  Infilt_m3       3834 non-null   float64
 11  Week            3834 non-null   int64  
dtypes: float64(11), int64(1)
memory usage: 549.4 KB

132

(553, 1)

(574, 11)

Performing stepwise search to minimize aic
 ARIMA(0,1,0)(0,0,0)[52] intercept   : AIC=675.953, Time=0.05 sec
 ARIMA(1,1,0)(1,0,0)[52] intercept   : AIC=294.686, Time=8.21 sec
 ARIMA(0,1,1)(0,0,1)[52] intercept   : AIC=395.662, Time=12.33 sec
 ARIMA(0,1,0)(0,0,0)[52]             : AIC=674.029, Time=0.09 sec
 ARIMA(1,1,0)(0,0,0)[52] intercept   : AIC=293.282, Time=0.23 sec
 ARIMA(1,1,0)(0,0,1)[52] intercept   : AIC=294.800, Time=2.60 sec
 ARIMA(1,1,0)(1,0,1)[52] intercept   : AIC=inf, Time=10.42 sec
 ARIMA(2,1,0)(0,0,0)[52] intercept   : AIC=288.957, Time=0.33 sec
 ARIMA(2,1,0)(1,0,0)[52] intercept   : AIC=290.314, Time=10.99 sec
 ARIMA(2,1,0)(0,0,1)[52] intercept   : AIC=290.448, Time=5.03 sec
 ARIMA(2,1,0)(1,0,1)[52] intercept   : AIC=inf, Time=14.38 sec
 ARIMA(3,1,0)(0,0,0)[52] intercept   : AIC=290.318, Time=0.48 sec
 ARIMA(2,1,1)(0,0,0)[52] intercept   : AIC=290.666, Time=0.51 sec
 ARIMA(1,1,1)(0,0,0)[52] intercept   : AIC=288.685, Time=0.24 sec
 ARIMA(1,1,1)(1,0,0)[52] intercept   : AIC=290.114, Time=9.57 sec
 ARIMA(1,1,1)(0,0,1)[52] intercept   : AIC=290.229, Time=11.08 sec
 ARIMA(1,1,1)(1,0,1)[52] intercept   : AIC=inf, Time=13.75 sec
 ARIMA(0,1,1)(0,0,0)[52] intercept   : AIC=396.920, Time=0.27 sec
 ARIMA(1,1,2)(0,0,0)[52] intercept   : AIC=290.689, Time=0.55 sec
 ARIMA(0,1,2)(0,0,0)[52] intercept   : AIC=315.906, Time=0.40 sec
 ARIMA(2,1,2)(0,0,0)[52] intercept   : AIC=290.356, Time=0.60 sec
 ARIMA(1,1,1)(0,0,0)[52]             : AIC=286.675, Time=0.32 sec
 ARIMA(1,1,1)(1,0,0)[52]             : AIC=288.129, Time=9.73 sec
 ARIMA(1,1,1)(0,0,1)[52]             : AIC=288.255, Time=7.62 sec
 ARIMA(1,1,1)(1,0,1)[52]             : AIC=287.903, Time=4.73 sec
 ARIMA(0,1,1)(0,0,0)[52]             : AIC=394.958, Time=0.16 sec
 ARIMA(1,1,0)(0,0,0)[52]             : AIC=291.286, Time=0.11 sec
 ARIMA(2,1,1)(0,0,0)[52]             : AIC=289.349, Time=0.44 sec
 ARIMA(1,1,2)(0,0,0)[52]             : AIC=288.688, Time=0.46 sec
 ARIMA(0,1,2)(0,0,0)[52]             : AIC=313.925, Time=0.22 sec
 ARIMA(2,1,0)(0,0,0)[52]             : AIC=286.943, Time=0.29 sec
 ARIMA(2,1,2)(0,0,0)[52]             : AIC=288.343, Time=0.57 sec

Best model:  ARIMA(1,1,1)(0,0,0)[52]          
Total fit time: 126.794 seconds

count    3834.00
mean        2.70
std         1.26
min         0.36
25%         1.64
50%         2.43
75%         3.61
max         6.28
Name: ET01, dtype: float64

2.8819995032290113

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 138 entries, 0 to 137
Data columns (total 3 columns):
 #   Column                 Non-Null Count  Dtype  
---  ------                 --------------  -----  
 0   (Year, )               138 non-null    int64  
 1   (Month, )              138 non-null    int64  
 2   (Rainfall_Terni, sum)  138 non-null    float64
dtypes: float64(1), int64(2)
memory usage: 3.4 KB

2009    1095.65
2010     913.90
2011     620.51
2012    1112.51
2013    1069.83
2014    1039.81
2015     804.93
2016     667.06
2017     820.98
2018     823.33
2019     948.42
2020        NaN
dtype: float64

6       39.56
7       69.01
8      176.11
9      255.53
10     377.24
        ...  
133    657.82
134    712.82
135    765.02
136    880.22
137    948.42
Length: 132, dtype: float64

DatetimeIndex(['2009-06-01', '2009-07-01', '2009-08-01', '2009-09-01',
               '2009-10-01', '2009-11-01', '2009-12-01', '2010-01-01',
               '2010-02-01', '2010-03-01',
               ...
               '2019-08-01', '2019-09-01', '2019-10-01', '2019-11-01',
               '2019-12-01', '2020-01-01', '2020-02-01', '2020-03-01',
               '2020-04-01', '2020-05-01'],
              dtype='datetime64[ns]', length=132, freq='MS')

2009-06-01     39.56
2009-07-01     69.01
2009-08-01    176.11
2009-09-01    255.53
2009-10-01    377.24
               ...  
2020-01-01    657.82
2020-02-01    712.82
2020-03-01    765.02
2020-04-01    880.22
2020-05-01    948.42
Freq: MS, Length: 132, dtype: float64

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 130 entries, 2009-09-01 to 2020-06-01
Freq: MS
Data columns (total 2 columns):
 #   Column            Non-Null Count  Dtype  
---  ------            --------------  -----  
 0   Monthly rainfall  130 non-null    float64
 1   Flow_Rate_Lupa    103 non-null    float64
dtypes: float64(2)
memory usage: 7.1 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 365 entries, 2019-01-01 to 2019-12-31
Freq: D
Data columns (total 1 columns):
 #   Column         Non-Null Count  Dtype  
---  ------         --------------  -----  
 0   Rainfall_Anca  365 non-null    float64
dtypes: float64(1)
memory usage: 5.7 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 1060 entries, 2009-01-01 to 2022-05-25
Data columns (total 5 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  915 non-null    float64
 1   Flow_Rate_Lupa  1060 non-null   float64
 2   doy             366 non-null    float64
 3   Month           366 non-null    float64
 4   Year            366 non-null    float64
dtypes: float64(5)
memory usage: 49.7 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4018 entries, 2010-01-01 to NaT
Data columns (total 15 columns):
 #   Column            Non-Null Count  Dtype         
---  ------            --------------  -----         
 0   Rainfall_Terni    3834 non-null   float64       
 1   Flow_Rate_Lupa    3834 non-null   float64       
 2   doy               3834 non-null   float64       
 3   Month             3834 non-null   float64       
 4   Year              3834 non-null   float64       
 5   ET01              3834 non-null   float64       
 6   Infilt_           3834 non-null   float64       
 7   Infiltsum         3834 non-null   float64       
 8   Rainfall_Ter      3834 non-null   float64       
 9   Flow_Rate_Lup     3834 non-null   float64       
 10  Infilt_m3         3834 non-null   float64       
 11  Week              3834 non-null   float64       
 12  Date_excel        3834 non-null   datetime64[ns]
 13  log_Flow          3834 non-null   float64       
 14  Lupa_Mean99_2011  4018 non-null   float64       
dtypes: datetime64[ns](1), float64(14)
memory usage: 631.3 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 138 entries, 2009-01-01 to 2020-06-01
Data columns (total 2 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Rainfall_Terni_scale_12                   127 non-null    float64
 1   Rainfall_Terni_scale_12_calculated_index  127 non-null    float64
dtypes: float64(2)
memory usage: 3.2 KB

DatetimeIndex(['2010-01-01', '2010-01-02', '2010-01-03', '2010-01-04',
               '2010-01-05', '2010-01-06', '2010-01-07', '2010-01-08',
               '2010-01-09', '2010-01-10',
               ...
               '2020-06-20', '2020-06-21', '2020-06-22', '2020-06-23',
               '2020-06-24', '2020-06-25', '2020-06-26', '2020-06-27',
               '2020-06-28', '2020-06-29'],
              dtype='datetime64[ns]', length=3833, freq='D')

2010-01-01    1.07
2010-01-02    1.07
2010-01-03    1.07
2010-01-04    1.07
2010-01-05    1.07
              ... 
2020-06-25    0.12
2020-06-26    0.12
2020-06-27    0.12
2020-06-28    0.12
2020-06-29    0.12
Freq: D, Name: Rainfall_Terni_scale_12_calculated_index, Length: 3833, dtype: float64

pandas.core.series.Series

count    4199.00
mean        0.48
std         0.50
min         0.00
25%         0.00
50%         0.00
75%         1.00
max         1.00
Name: Dormant, dtype: float64

108.85714285714283 21.77142857142857 5.442857142857142

70 77 91 94

C:\Users\Kurt\AppData\Local\Temp\ipykernel_5440\2688920214.py:1: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Lupa_excel2["Infilt2"] =Lupa_excel2["Rainfall_Terni"]-Lupa_excel2["runoffdepth2"]

5.443 5.44   S: 108.9

C:\Users\Kurt\AppData\Local\Temp\ipykernel_5440\1908158279.py:1: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Lupa_excel2["Infilt2sum"] =  Lupa_excel2["Infilt2"].cumsum()

CN1: 51 in meters

CN3: 2703   in meters

                            OLS Regression Results                            
==============================================================================
Dep. Variable:                      y   R-squared:                       0.991
Model:                            OLS   Adj. R-squared:                  0.987
Method:                 Least Squares   F-statistic:                     232.5
Date:                Thu, 26 May 2022   Prob (F-statistic):            0.00427
Time:                        11:31:57   Log-Likelihood:                 10.440
No. Observations:                   4   AIC:                            -16.88
Df Residuals:                       2   BIC:                            -18.11
Df Model:                           1                                         
Covariance Type:            nonrobust                                         
==============================================================================
                 coef    std err          t      P>|t|      [0.025      0.975]
------------------------------------------------------------------------------
const          0.9742      0.033     29.169      0.001       0.831       1.118
x1            -0.0206      0.001    -15.247      0.004      -0.026      -0.015
==============================================================================
Omnibus:                          nan   Durbin-Watson:                   2.423
Prob(Omnibus):                    nan   Jarque-Bera (JB):                0.879
Skew:                          -1.092   Prob(JB):                        0.644
Kurtosis:                       2.289   Cond. No.                         65.6
==============================================================================

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly specified.

c:\program files\python38\lib\site-packages\statsmodels\stats\stattools.py:74: ValueWarning: omni_normtest is not valid with less than 8 observations; 4 samples were given.
  warn("omni_normtest is not valid with less than 8 observations; %i "

                            OLS Regression Results                            
==============================================================================
Dep. Variable:                      y   R-squared:                       0.976
Model:                            OLS   Adj. R-squared:                  0.952
Method:                 Least Squares   F-statistic:                     40.74
Date:                Thu, 26 May 2022   Prob (F-statistic):             0.0989
Time:                        11:32:02   Log-Likelihood:                 11.468
No. Observations:                   3   AIC:                            -18.94
Df Residuals:                       1   BIC:                            -20.74
Df Model:                           1                                         
Covariance Type:            nonrobust                                         
==============================================================================
                 coef    std err          t      P>|t|      [0.025      0.975]
------------------------------------------------------------------------------
const          0.2344      0.010     23.681      0.027       0.109       0.360
x1            -0.0007      0.000     -6.383      0.099      -0.002       0.001
==============================================================================
Omnibus:                          nan   Durbin-Watson:                   2.561
Prob(Omnibus):                    nan   Jarque-Bera (JB):                0.284
Skew:                           0.076   Prob(JB):                        0.868
Kurtosis:                       1.500   Cond. No.                         160.
==============================================================================

Notes:
[1] Standard Errors assume that the covariance matrix of the errors is correctly specified.

c:\program files\python38\lib\site-packages\statsmodels\stats\stattools.py:74: ValueWarning: omni_normtest is not valid with less than 8 observations; 3 samples were given.
  warn("omni_normtest is not valid with less than 8 observations; %i "

16.94329275755012

<AxesSubplot:xlabel='Count', ylabel='Infiltrate'>

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'Flow_Rate_Lup', 'Infilt_m3',
       'Week', 'Date_excel', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate'],
      dtype='object')

array([2010., 2011., 2012., 2013., 2014., 2015., 2016., 2017., 2018.,
       2019., 2020.])

True

DatetimeIndex(['2010-01-01', '2010-01-02', '2010-01-03', '2010-01-04',
               '2010-01-05', '2010-01-06', '2010-01-07', '2010-01-08',
               '2010-01-09', '2010-01-10',
               ...
               '2020-06-22', '2020-06-23', '2020-06-24', '2020-06-25',
               '2020-06-26', '2020-06-27', '2020-06-28', '2020-06-29',
               '2020-06-30',        'NaT'],
              dtype='datetime64[ns]', name='Date', length=3835, freq=None)

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5', 'Flow_Rate_Lup',
       'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff'],
      dtype='object')

c:\program files\python38\lib\site-packages\openpyxl\worksheet\_reader.py:312: UserWarning: Unknown extension is not supported and will be removed
  warn(msg)
c:\program files\python38\lib\site-packages\openpyxl\worksheet\_reader.py:312: UserWarning: Conditional Formatting extension is not supported and will be removed
  warn(msg)

<AxesSubplot:>

c:\program files\python38\lib\site-packages\openpyxl\worksheet\_reader.py:312: UserWarning: Sparkline Group extension is not supported and will be removed
  warn(msg)

<AxesSubplot:xlabel='Date_time', ylabel='DroughtIndex'>

Date
2010-01-01     82.24
2010-01-02     88.90
2010-01-03     93.56
2010-01-04     96.63
2010-01-05     98.65
2010-01-06    102.15
2010-01-07    106.57
2010-01-08    110.57
2010-01-09    117.00
2010-01-10    124.15
2010-01-11    130.30
2010-01-12    135.60
2010-01-13    140.13
2010-01-14    143.60
2010-01-15    146.82
2010-01-16    149.64
2010-01-17    152.13
2010-01-18    153.59
2010-01-19    154.92
2010-01-20    155.98
2010-01-21    156.60
2010-01-22    157.40
2010-01-23    157.56
2010-01-24    157.79
2010-01-25    158.08
2010-01-26    158.23
2010-01-27    158.19
2010-01-28    158.41
2010-01-29    158.52
2010-01-30    158.42
2010-01-31    159.86
Freq: D, Name: Flow_Rate_Lupa, dtype: float64

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4199 entries, 2009-01-01 to 2020-06-30
Freq: D
Data columns (total 6 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  4199 non-null   float64
 1   Flow_Rate_Lupa  4150 non-null   float64
 2   doy             4199 non-null   int64  
 3   Month           4199 non-null   int64  
 4   Year            4199 non-null   int64  
 5   Rainfall_5      4199 non-null   float64
dtypes: float64(3), int64(3)
memory usage: 358.7 KB

<AxesSubplot:ylabel='Date'>

Date
2010-01-01    574.24
2010-02-01    571.83
2010-03-01    570.16
2010-04-01    543.40
2010-05-01    505.95
               ...  
2019-08-01    308.72
2019-09-01    320.39
2019-10-01    255.04
2019-11-01    504.22
2019-12-01    510.36
Freq: MS, Name: sum_5, Length: 120, dtype: float64

Date
2010-01-01        NaN
2010-02-01        NaN
2010-03-01        NaN
2010-04-01        NaN
2010-05-01    1026.95
               ...   
2019-08-01     551.19
2019-09-01     540.67
2019-10-01     521.01
2019-11-01     474.82
2019-12-01     445.18
Freq: MS, Name: sum_5, Length: 120, dtype: float64

10220.83738413908

pandas.core.frame.DataFrame

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 114 entries, 2012-01-02 to 2012-09-01
Data columns (total 5 columns):
 #   Column              Non-Null Count  Dtype  
---  ------              --------------  -----  
 0   Lupa flowrate 2012  114 non-null    float64
 1   _doy                114 non-null    float64
 2   doy                 114 non-null    object 
 3   dayrest             114 non-null    object 
 4   delta               114 non-null    object 
dtypes: float64(2), object(3)
memory usage: 5.3+ KB

DT
2012-01-02    59.50
2012-01-03    59.19
2012-01-05    58.87
2012-01-07    58.56
2012-01-09    58.25
              ...  
2012-08-09    31.63
2012-08-16    31.32
2012-08-23    30.69
2012-08-31    30.38
2012-09-01    30.06
Name: Lupa flowrate 2012, Length: 63, dtype: float64

Index(['01-01', '01-02', '01-03', '01-04', '01-05', '01-06', '01-07', '01-08',
       '01-09', '01-10',
       ...
       '12-22', '12-23', '12-24', '12-25', '12-26', '12-27', '12-28', '12-29',
       '12-30', '12-31'],
      dtype='object', name='DT', length=365)

DatetimeIndex(['2009-01-01', '2009-01-02', '2009-01-03', '2009-01-04',
               '2009-01-05', '2009-01-06', '2009-01-07', '2009-01-08',
               '2009-01-09', '2009-01-10',
               ...
               '2009-12-22', '2009-12-23', '2009-12-24', '2009-12-25',
               '2009-12-26', '2009-12-27', '2009-12-28', '2009-12-29',
               '2009-12-30', '2009-12-31'],
              dtype='datetime64[ns]', length=365, freq='D')

Timestamp('2010-01-01 00:00:00+0200', tz='Europe/Helsinki')

DatetimeIndex(['2010-01-01', '2010-01-02', '2010-01-03', '2010-01-04',
               '2010-01-05', '2010-01-06', '2010-01-07', '2010-01-08',
               '2010-01-09', '2010-01-10',
               ...
               '2010-12-22', '2010-12-23', '2010-12-24', '2010-12-25',
               '2010-12-26', '2010-12-27', '2010-12-28', '2010-12-29',
               '2010-12-30', '2010-12-31'],
              dtype='datetime64[ns]', length=365, freq=None)

DatetimeIndex(['2008-01-01', '2008-01-02', '2008-01-03', '2008-01-04',
               '2008-01-05', '2008-01-06', '2008-01-07', '2008-01-08',
               '2008-01-09', '2008-01-10',
               ...
               '2008-12-22', '2008-12-23', '2008-12-24', '2008-12-25',
               '2008-12-26', '2008-12-27', '2008-12-28', '2008-12-29',
               '2008-12-30', '2008-12-31'],
              dtype='datetime64[ns]', length=365, freq=None)

DatetimeIndex(['2007-01-01', '2007-01-02', '2007-01-03', '2007-01-04',
               '2007-01-05', '2007-01-06', '2007-01-07', '2007-01-08',
               '2007-01-09', '2007-01-10',
               ...
               '2007-12-22', '2007-12-23', '2007-12-24', '2007-12-25',
               '2007-12-26', '2007-12-27', '2007-12-28', '2007-12-29',
               '2007-12-30', '2007-12-31'],
              dtype='datetime64[ns]', length=365, freq=None)

DatetimeIndex(['2006-01-01', '2006-01-02', '2006-01-03', '2006-01-04',
               '2006-01-05', '2006-01-06', '2006-01-07', '2006-01-08',
               '2006-01-09', '2006-01-10',
               ...
               '2006-12-22', '2006-12-23', '2006-12-24', '2006-12-25',
               '2006-12-26', '2006-12-27', '2006-12-28', '2006-12-29',
               '2006-12-30', '2006-12-31'],
              dtype='datetime64[ns]', length=365, freq=None)

doy
1      96.53
2      97.29
3      97.80
4      98.27
5      98.63
       ...  
362    96.80
363    97.32
364    97.69
365    97.84
366    89.25
Name: Flow_Rate_Lupa, Length: 366, dtype: float64

2010-01-01    117.81
2010-01-02    117.81
2010-01-03    120.38
2010-01-04    118.86
2010-01-05    121.07
               ...  
2010-12-27    108.77
2010-12-28    110.44
2010-12-29    111.37
2010-12-30    111.83
2010-12-31    113.40
Name: Lupa_Mean99_2011, Length: 365, dtype: float64

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 14 columns):
 #   Column          Non-Null Count  Dtype         
---  ------          --------------  -----         
 0   Rainfall_Terni  3834 non-null   float64       
 1   Flow_Rate_Lupa  3834 non-null   float64       
 2   doy             3834 non-null   int64         
 3   Month           3834 non-null   int64         
 4   Year            3834 non-null   int64         
 5   ET01            3834 non-null   float64       
 6   Infilt_         3834 non-null   float64       
 7   Infiltsum       3834 non-null   float64       
 8   Rainfall_Ter    3834 non-null   float64       
 9   Flow_Rate_Lup   3834 non-null   float64       
 10  Infilt_m3       3834 non-null   float64       
 11  Week            3834 non-null   int64         
 12  Date_excel      3834 non-null   datetime64[ns]
 13  log_Flow        3834 non-null   float64       
dtypes: datetime64[ns](1), float64(9), int64(4)
memory usage: 449.3 KB

array([115.61])

<class 'pandas.core.frame.DataFrame'>
Index: 367 entries, 2010-01-01 00:00:00 to 2011-01-01
Data columns (total 2 columns):
 #   Column            Non-Null Count  Dtype  
---  ------            --------------  -----  
 0   Lupa_Mean99_2011  366 non-null    float64
 1   DayofYear         366 non-null    float64
dtypes: float64(2)
memory usage: 16.7+ KB

array([  1,   2,   3, ..., 364, 365, 366], dtype=int16)

115.61

array([ 0.29,  1.04,  1.91, ..., -0.73, -0.6 , -0.39])

array([5.29, 5.86, 5.7 , ..., 1.19, 0.64, 0.02])

Distributions sorted by goodness of fit:
----------------------------------------
  Distribution  chi_square  p_value
2        gamma        8.46     0.73
3      lognorm       19.11     0.62
1         beta       53.38     0.30
0        expon      139.87     0.02

RV : 
 <scipy.stats._distn_infrastructure.rv_frozen object at 0x000001969FDE9820> a:  0.3

Random Variates : 
 [4.59e+01 1.01e+00 2.13e+01 1.09e+01 1.17e+01 9.87e+10 4.07e+00 8.59e+00
 3.80e+01 1.76e+02]

Probability Distribution : 
 [0.   0.   0.   0.   0.   0.01 0.01 0.01 0.02 0.02]

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4199 entries, 2009-01-01 to 2020-06-30
Freq: D
Data columns (total 14 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  4199 non-null   float64
 1   Flow_Rate_Lupa  4199 non-null   float64
 2   doy             4199 non-null   float64
 3   Month           4199 non-null   float64
 4   Year            4199 non-null   float64
 5   PET             4199 non-null   float64
 6   PETs            4199 non-null   float64
 7   Infilt_         4199 non-null   float64
 8   Infiltsum       4199 non-null   float64
 9   Infilt_35       4165 non-null   float64
 10  Flow_35         4165 non-null   float64
 11  Net_35          4165 non-null   float64
 12  Flow_Rate_Lup   4199 non-null   float64
 13  Infilt_m3       4199 non-null   float64
dtypes: float64(14)
memory usage: 492.1 KB

1875000

Flow rates: Minimum: 1 Average: 179.12121933793762 Maximum: 366 St.d.: 105.3640542156648 Variation: 11101.58392076155

Flow rates: Minimum: 1 Average: 179.12121933793762 Maximum: 366 St.d.: 105.3640542156648 Variation: 11101.58392076155

Date
2008-07-01    -4.59
2009-07-01    30.37
2010-07-01    -3.97
2011-07-01   -59.88
2012-07-01    33.56
2013-07-01    40.50
2014-07-01   -34.86
2015-07-01   -26.62
2016-07-01   -17.67
2017-07-01    18.65
2018-07-01    13.60
2019-07-01    10.91
Freq: AS-JUL, Name: Rainfall_Terni, dtype: float64

Date
2009-01-01    4227.85
2009-02-01    4421.84
2009-03-01    5569.66
2009-04-01    5390.40
2009-05-01    5221.52
               ...   
2020-02-01    3126.24
2020-03-01    3193.85
2020-04-01    2938.61
2020-05-01    2737.95
2020-06-01    2324.96
Freq: MS, Name: Flow_Rate_Lupa, Length: 138, dtype: float64

Date
2008-07-01    133.54
2009-07-01    211.41
2010-07-01    342.54
2011-07-01   -313.96
2012-07-01     50.03
2013-07-01    297.24
2014-07-01    -16.11
2015-07-01   -169.00
2016-07-01   -230.55
2017-07-01    -90.53
2018-07-01    -76.68
2019-07-01   -137.93
Freq: AS-JUL, Name: Flow_Rate_Lupa, dtype: float64

Date
2009-01-01    2.55e+09
2010-01-01    2.88e+09
2011-01-01    1.74e+09
2012-01-01    2.36e+09
2013-01-01    2.60e+09
2014-01-01    2.68e+09
2015-01-01    1.83e+09
2016-01-01    2.19e+09
2017-01-01    2.19e+09
2018-01-01    3.05e+09
2019-01-01    3.10e+09
2020-01-01    7.95e+08
Freq: AS-JAN, Name: Rainfall_Ter, dtype: float64

Date
2009-03-01    5569.66
2009-04-01    5390.40
2009-05-01    5221.52
2009-06-01    4398.44
2009-07-01    3942.35
               ...   
2020-02-01    3126.24
2020-03-01    3193.85
2020-04-01    2938.61
2020-05-01    2737.95
2020-06-01    2324.96
Freq: MS, Name: Flow_Rate_Lupa, Length: 136, dtype: float64

3599.4780072463764

Rainfall_Terni       2.56
Flow_Rate_Lupa     118.30
doy                179.12
Month                6.39
Year              2014.26
PET                  3.46
PETs                 3.46
Infilt_             -0.90
Infiltsum        -1958.12
dtype: float64

                                     SARIMAX Results                                      
==========================================================================================
Dep. Variable:                     Flow_Rate_Lupa   No. Observations:                  136
Model:             SARIMAX(2, 0, 0)x(2, 0, 0, 12)   Log Likelihood               -1069.244
Date:                            Thu, 29 Apr 2021   AIC                           2148.488
Time:                                    11:45:45   BIC                           2163.051
Sample:                                03-01-2009   HQIC                          2154.406
                                     - 06-01-2020                                         
Covariance Type:                              opg                                         
==============================================================================
                 coef    std err          z      P>|z|      [0.025      0.975]
------------------------------------------------------------------------------
ar.L1          1.3917      0.082     17.045      0.000       1.232       1.552
ar.L2         -0.4404      0.089     -4.957      0.000      -0.615      -0.266
ar.S.L12       0.1810      0.079      2.281      0.023       0.025       0.337
ar.S.L24       0.3126      0.071      4.397      0.000       0.173       0.452
sigma2       3.76e+05   3.39e+04     11.106      0.000     3.1e+05    4.42e+05
===================================================================================
Ljung-Box (L1) (Q):                   0.14   Jarque-Bera (JB):               104.56
Prob(Q):                              0.71   Prob(JB):                         0.00
Heteroskedasticity (H):               0.69   Skew:                             1.51
Prob(H) (two-sided):                  0.22   Kurtosis:                         6.05
===================================================================================

Warnings:
[1] Covariance matrix calculated using the outer product of gradients (complex-step).

                                      SARIMAX Results                                      
===========================================================================================
Dep. Variable:                      Flow_Rate_Lupa   No. Observations:                  136
Model:             SARIMAX(2, 0, 2)x(3, 0, [], 12)   Log Likelihood               -1066.606
Date:                             Sun, 09 May 2021   AIC                           2149.212
Time:                                     20:01:18   BIC                           2172.513
Sample:                                 03-01-2009   HQIC                          2158.681
                                      - 06-01-2020                                         
Covariance Type:                               opg                                         
==============================================================================
                 coef    std err          z      P>|z|      [0.025      0.975]
------------------------------------------------------------------------------
ar.L1          1.5230      0.387      3.934      0.000       0.764       2.282
ar.L2         -0.5655      0.359     -1.575      0.115      -1.269       0.138
ma.L1         -0.1845      0.442     -0.417      0.677      -1.051       0.682
ma.L2         -0.0755      0.180     -0.418      0.676      -0.429       0.278
ar.S.L12       0.1287      0.080      1.604      0.109      -0.029       0.286
ar.S.L24       0.2671      0.080      3.341      0.001       0.110       0.424
ar.S.L36       0.2436      0.090      2.717      0.007       0.068       0.419
sigma2      3.558e+05   3.21e+04     11.066      0.000    2.93e+05    4.19e+05
===================================================================================
Ljung-Box (L1) (Q):                   0.13   Jarque-Bera (JB):               133.35
Prob(Q):                              0.72   Prob(JB):                         0.00
Heteroskedasticity (H):               0.67   Skew:                             1.58
Prob(H) (two-sided):                  0.18   Kurtosis:                         6.68
===================================================================================

Warnings:
[1] Covariance matrix calculated using the outer product of gradients (complex-step).

                                      SARIMAX Results                                      
===========================================================================================
Dep. Variable:                      Flow_Rate_Lupa   No. Observations:                  136
Model:             SARIMAX(2, 0, 1)x(2, 0, [], 12)   Log Likelihood               -1063.494
Date:                             Sun, 09 May 2021   AIC                           2140.988
Time:                                     19:55:59   BIC                           2161.377
Sample:                                 03-01-2009   HQIC                          2149.274
                                      - 06-01-2020                                         
Covariance Type:                               opg                                         
==============================================================================
                 coef    std err          z      P>|z|      [0.025      0.975]
------------------------------------------------------------------------------
intercept    238.1905    101.611      2.344      0.019      39.036     437.345
ar.L1          1.5994      0.102     15.711      0.000       1.400       1.799
ar.L2         -0.6991      0.101     -6.952      0.000      -0.896      -0.502
ma.L1         -0.2916      0.149     -1.958      0.050      -0.584       0.000
ar.S.L12       0.0996      0.078      1.269      0.204      -0.054       0.253
ar.S.L24       0.2335      0.075      3.112      0.002       0.086       0.381
sigma2      3.275e+05   3.14e+04     10.428      0.000    2.66e+05    3.89e+05
===================================================================================
Ljung-Box (L1) (Q):                   0.01   Jarque-Bera (JB):               158.66
Prob(Q):                              0.93   Prob(JB):                         0.00
Heteroskedasticity (H):               0.66   Skew:                             1.69
Prob(H) (two-sided):                  0.16   Kurtosis:                         7.08
===================================================================================

Warnings:
[1] Covariance matrix calculated using the outer product of gradients (complex-step).

c:\program files\python38\lib\site-packages\statsmodels\base\model.py:566: ConvergenceWarning: Maximum Likelihood optimization failed to converge. Check mle_retvals
  warnings.warn("Maximum Likelihood optimization failed to "

<AxesSubplot:title={'center':'Lupa Flow month  Seasonal Decomposition'}, ylabel='data'>

Date
2009-03-01    5569.66
2009-04-01    5390.40
2009-05-01    5221.52
2009-06-01    4398.44
2009-07-01    3942.35
               ...   
2020-02-01    2866.54
2020-03-01    2883.22
2020-04-01    2612.11
2020-05-01    2515.14
2020-06-01    2255.91
Freq: MS, Name: Flow_Rate_Lupa, Length: 136, dtype: float64

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
Input In [130], in <cell line: 1>()
----> 1 import pmdarima as pm
      2 from pmdarima import datasets
      3 from pmdarima import preprocessing

ModuleNotFoundError: No module named 'pmdarima'

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3833 entries, 0 to 3832
Data columns (total 2 columns):
 #   Column    Non-Null Count  Dtype         
---  ------    --------------  -----         
 0   date      3833 non-null   datetime64[ns]
 1   log_Flow  3833 non-null   float64       
dtypes: datetime64[ns](1), float64(1)
memory usage: 60.0 KB

3153600000

62750000.0000

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'PET',
       'PETs', 'Infilt_', 'Infiltsum'],
      dtype='object')

(16071.0, 16801.0)

(0.14761866387549288, 6.876277686622363e-22)

(0.12698045530353386, 1.4653364854225727e-16)

(0.12046154798275299, 4.798577191927823e-15)

(0.09353058745832321, 1.26120129759726e-09)

(0.06692408804886261, 1.4233019265743439e-05)

0.2446

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'Flow_Rate_Lup', 'Infilt_m3',
       'Week', 'Date_excel', 'log_Flow', 'Lupa_Mean99_2011', 'runoffdepth2',
       'Infilt2', 'Infilt2sum'],
      dtype='object')

0.00041666666666666664

500 13.536385332763828
510 13.442144860859681
520 13.341805465927875
530 13.234493421774832
540 13.119328625195655
550 12.995629451108991
560 12.863424169453957
570 12.724638856714765
580 12.585580639577637
590 12.460927412187173
600 12.375739498223215
610 12.355020214397117
620 12.401437027745347
630 12.492393780014359
640 12.601518264962296
650 12.71220917942946
660 12.816961283303373
670 12.913276728366789
680 13.000885588136734
690 13.080376016920832

(127,) (127,)

xcorr Flow_Rate_Lup-Rainfall_Ter 1983 0.24187466748320438

xcorr Flow_Rate_Lup-Rainfall_Ter 83 0.10069751592926483

5.432876712328767

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5', 'Flow_Rate_Lup',
       'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg', 'Add'],
      dtype='object')

xcorr Flow_log-P5 3807 0.771417121202296

25

2021-05-11 12:58:00,694 [15656] WARNING  py.warnings: c:\program files\python38\lib\site-packages\pandas\core\indexing.py:670: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  iloc._setitem_with_indexer(indexer, value)

Date
2017-04-03 00:00:00    4.36
2017-04-04 00:00:00    4.36
2017-04-05 00:00:00    4.35
Name: Flow_log, dtype: float64

Date
2009-06-04 00:00:00    5.05
2009-06-05 00:00:00    5.05
2009-06-06 00:00:00    5.05
Name: Flow_log, dtype: float64

2021-05-11 12:58:27,624 [15656] WARNING  py.warnings: c:\program files\python38\lib\site-packages\pandas\core\indexing.py:670: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  iloc._setitem_with_indexer(indexer, value)

254 192

<class 'pandas.core.frame.DataFrame'>
Index: 254 entries, 2017-04-03 00:00:00 to Flow_logdelta
Data columns (total 26 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  252 non-null    float64
 1   Flow_Rate_Lupa  252 non-null    float64
 2   doy             252 non-null    float64
 3   Month           252 non-null    float64
 4   Year            252 non-null    float64
 5   Diff            252 non-null    float64
 6   pct_ch          252 non-null    float64
 7   Flow_7          252 non-null    float64
 8   Flow_3          252 non-null    float64
 9   Flow_12         252 non-null    float64
 10  Rainfall_Ter    252 non-null    float64
 11  Flow_Rate_Mad   252 non-null    float64
 12  Rainfall_m3_7   252 non-null    float64
 13  Rainfall_m3_10  252 non-null    float64
 14  Rainfall_m3_14  252 non-null    float64
 15  Rainfall_m3_17  252 non-null    float64
 16  Rainfall_m3_20  252 non-null    float64
 17  Rainfall_m3_22  252 non-null    float64
 18  Rainfall_m3_25  252 non-null    float64
 19  Rainfall_m3_30  252 non-null    float64
 20  Rainfall_m3_35  252 non-null    float64
 21  Flow_Rate_Lup   252 non-null    float64
 22  Flow_m3_7       252 non-null    float64
 23  R_F_cumdif      252 non-null    float64
 24  Flow_log        253 non-null    float64
 25  Flow_logdelta   252 non-null    float64
dtypes: float64(26)
memory usage: 63.6+ KB

array([[  0],
       [  1],
       [  2],
       ...,
       [251],
       [252],
       [253]])

2021-03-18 15:34:49,390 [9280] WARNING  py.warnings:109: [JupyterRequire] <ipython-input-59-6843ccf2dc65>:1: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Freefall2017A["alphac"]= Freefall2017A.Flow_logdelta /Freefall2017A.timedelta

2021-03-18 15:34:49,390 [9280] WARNING  py.warnings:109: [JupyterRequire] <ipython-input-59-6843ccf2dc65>:2: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Freefall2009B["alphac"]= Freefall2009B.Flow_logdelta /Freefall2009B.timedelta

(0.0, 0.01)

0.004574706739698717

Date
2009-08-01    4.69e-03
2009-08-02    4.71e-03
2009-08-03    4.72e-03
2009-08-04    4.70e-03
2009-08-05    4.69e-03
2009-08-06    4.67e-03
2009-08-07    4.66e-03
2009-08-08    4.65e-03
2009-08-09    4.65e-03
2009-08-10    4.69e-03
2009-08-11    4.70e-03
2009-08-12    4.71e-03
2009-08-13    4.70e-03
2009-08-14    4.71e-03
2009-08-15    4.72e-03
2009-08-16    4.73e-03
2009-08-17    4.76e-03
2009-08-18    4.80e-03
2009-08-19    4.81e-03
2009-08-20    4.81e-03
2009-08-21    4.81e-03
2009-08-22    4.83e-03
2009-08-23    4.85e-03
2009-08-24    4.89e-03
2009-08-25    4.90e-03
2009-08-26    4.90e-03
2009-08-28    4.90e-03
2009-08-29    4.91e-03
2009-08-30    4.92e-03
2009-08-31    4.92e-03
Name: alphac, dtype: float64

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4199 entries, 2009-01-01 to 2020-06-30
Data columns (total 5 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  4199 non-null   float64
 1   Flow_Rate_Lupa  3817 non-null   float64
 2   doy             4199 non-null   int64  
 3   Month           4199 non-null   int64  
 4   Year            4199 non-null   int64  
dtypes: float64(2), int64(3)
memory usage: 325.9 KB

2017-07-15 00:00:00 2017-08-31 00:00:00
0.18500000000000227 41.165 [0.16 0.14 0.16 ... 0.2  0.21 0.19] [50.05 49.89 49.73 ... 41.55 41.38 41.16]

<AxesSubplot:>

2016-07-15 00:00:00 2016-08-31 00:00:00
0.6649999999999991 104.10499999999999 [0.61 0.62 0.72 ... 0.64 0.68 0.66] [137.86 137.2  136.53 ... 105.41 104.76 104.1 ]

<AxesSubplot:>

2015-07-15 00:00:00 2015-08-31 00:00:00
0.375 75.975 [0.58 0.52 0.56 ... 0.64 0.56 0.38] [99.91 99.27 98.77 ... 76.98 76.41 75.97]

<AxesSubplot:>

2014-07-15 00:00:00 2014-08-31 00:00:00
0.9149999999999991 123.845 [0.98 1.12 1.19 ... 0.77 0.89 0.91] [161.62 160.53 159.45 ... 125.53 124.7  123.84]

<AxesSubplot:>

10.66

time(UTC+1)
00:00    7.79
01:00    7.79
02:00    7.79
03:00    7.79
04:00    7.79
05:00    7.79
06:00    7.79
07:00    7.79
08:00    7.79
09:00    7.79
10:00    7.79
11:00    7.79
12:00    7.79
13:00    7.79
14:00    7.79
15:00    7.79
16:00    7.79
17:00    7.79
18:00    7.79
19:00    7.79
20:00    7.79
21:00    7.79
22:00    7.79
23:00    7.79
Name: MJ_m2d, dtype: float64

XGBRegressor(base_score=None, booster='gbtree', colsample_bylevel=None,
             colsample_bynode=None, colsample_bytree=None,
             enable_categorical=False, gamma=None, gpu_id=None,
             importance_type=None, interaction_constraints=None,
             learning_rate=0.001, max_delta_step=None, max_depth=9,
             min_child_weight=1, missing=nan, monotone_constraints=None,
             n_estimators=5000, n_jobs=3, num_parallel_tree=None,
             predictor=None, random_state=42, reg_alpha=None, reg_lambda=None,
             scale_pos_weight=None, subsample=None, tree_method=None,
             validate_parameters=None, verbosity=None)

XGBRegressor(base_score=None, booster='gbtree', colsample_bylevel=None,
             colsample_bynode=None, colsample_bytree=None,
             enable_categorical=False, gamma=None, gpu_id=None,
             importance_type=None, interaction_constraints=None,
             learning_rate=0.001, max_delta_step=None, max_depth=9,
             min_child_weight=1, missing=nan, monotone_constraints=None,
             n_estimators=5000, n_jobs=3, num_parallel_tree=None,
             predictor=None, random_state=42, reg_alpha=None, reg_lambda=None,
             scale_pos_weight=None, subsample=None, tree_method=None,
             validate_parameters=None, verbosity=None)

C:\Users\VanOp\.conda\envs\rioxarray_env\lib\site-packages\xgboost\data.py:262: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
  elif isinstance(data.columns, (pd.Int64Index, pd.RangeIndex)):

array([0., 0., 0., ..., 0., 0., 0.], dtype=float32)

Index(['Rainfall_Terni', 'doy', 'Month', 'Year', 'ET01', 'Infilt_',
       'Infiltsum', 'Infilt_m3', 'P5', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg',
       'Rainfall_720'],
      dtype='object')

R2 score on test data is 99.93% with mean error of 0.69

C:\Users\VanOp\.conda\envs\rioxarray_env\lib\site-packages\xgboost\data.py:262: FutureWarning: pandas.Int64Index is deprecated and will be removed from pandas in a future version. Use pandas.Index with the appropriate dtype instead.
  elif isinstance(data.columns, (pd.Int64Index, pd.RangeIndex)):

<AxesSubplot:xlabel='Date'>

4.090551181102362

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 11 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  3834 non-null   float64
 1   Flow_Rate_Lupa  3834 non-null   float64
 2   doy             3834 non-null   int64  
 3   Month           3834 non-null   int64  
 4   Year            3834 non-null   int64  
 5   ET01            3834 non-null   float64
 6   Week            3834 non-null   int64  
 7   Moist           3834 non-null   int32  
 8   RainyDay5       3834 non-null   float64
 9   RainyDay35      3834 non-null   float64
 10  RainyDay365     3834 non-null   float64
dtypes: float64(6), int32(1), int64(4)
memory usage: 344.5 KB

Date
2010-01-01       2.89
2010-01-02       2.89
2010-01-03       2.89
2010-01-04       2.89
2010-01-05       2.89
               ...   
2010-12-27       2.89
2010-12-28       2.89
2010-12-29       2.89
2010-12-30       2.89
2010-12-31    1730.40
Name: Rainfall_365, Length: 365, dtype: float64

1095

<class 'pandas.core.frame.DataFrame'>
Index: 3833 entries, 2010-01-01 00:00:00 to 2020-06-29 00:00:00
Data columns (total 39 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Rainfall_Terni                            3833 non-null   float64
 1   Flow_Rate_Lupa                            3833 non-null   float64
 2   doy                                       3833 non-null   float64
 3   Month                                     3833 non-null   float64
 4   Year                                      3833 non-null   float64
 5   ET01                                      3833 non-null   float64
 6   Infilt_                                   3833 non-null   float64
 7   Infiltsum                                 3833 non-null   float64
 8   Rainfall_Ter                              3833 non-null   float64
 9   Flow_Rate_Lup                             3833 non-null   float64
 10  Infilt_m3                                 3833 non-null   float64
 11  P5                                        3833 non-null   float64
 12  Week                                      3833 non-null   float64
 13  log_Flow                                  3833 non-null   float64
 14  Lupa_Mean99_2011                          3833 non-null   float64
 15  Rainfall_Terni_minET                      3833 non-null   float64
 16  Infiltrate                                3833 non-null   float64
 17  log_Flow_10d                              3833 non-null   float64
 18  log_Flow_20d                              3833 non-null   float64
 19  α10                                       3833 non-null   float64
 20  α20                                       3833 non-null   float64
 21  log_Flow_10d_dif                          3833 non-null   float64
 22  log_Flow_20d_dif                          3833 non-null   float64
 23  α10_30                                    3833 non-null   float64
 24  Infilt_7YR                                3833 non-null   float64
 25  Infilt_2YR                                3833 non-null   float64
 26  α1                                        3833 non-null   float64
 27  α1_negatives                              3833 non-null   float64
 28  ro                                        3833 non-null   float64
 29  Infilt_M6                                 3833 non-null   float64
 30  Infilt_M6_diff                            3833 non-null   float64
 31  Rainfall_Terni_scale_12_calculated_index  3833 non-null   float64
 32  SMroot                                    3833 non-null   float64
 33  Neradebit                                 3833 non-null   float64
 34  smian                                     3833 non-null   float64
 35  DroughtIndex                              3833 non-null   float64
 36  Deficit                                   3833 non-null   float64
 37  PET_hg                                    3833 non-null   float64
 38  Rainfall_720                              3833 non-null   float64
dtypes: float64(39)
memory usage: 1.3+ MB

Index(['Date_excel', 'Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month',
       'Year', 'ET01', 'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5',
       'Flow_Rate_Lup', 'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index'],
      dtype='object')

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5', 'Flow_Rate_Lup',
       'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'DroughtIndex',
       'DI_12', 'DI_12_s'],
      dtype='object')

(3833, 19) (3833,)

Index(['Year', 'Infiltsum', 'Rainfall_Ter', 'Week', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'α10', 'Infilt_2YR', 'ro',
       'Infilt_M6', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg',
       'Rainfall_720'],
      dtype='object')

(3449, 19) (3449,)

'1.6.1'

array([0.07, 0.04, 0.05, 0.03, 0.04, 0.09, 0.09, 0.05, 0.04, 0.09, 0.09,
       0.05, 0.04, 0.04, 0.05, 0.03, 0.03, 0.04, 0.04], dtype=float32)

Index(['Year', 'Infiltsum', 'Rainfall_Ter', 'Week', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'α10', 'Infilt_2YR', 'ro',
       'Infilt_M6', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg',
       'Rainfall_720'],
      dtype='object')

R2 score on test data is -255.51% with mean error of 0.28

Mean Absolute Percentage Error (MAPE): 3.06
Accuracy: 96.94

(3834, 11)

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 11 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  3834 non-null   float64
 1   Flow_Rate_Lupa  3834 non-null   float64
 2   doy             3834 non-null   int64  
 3   Month           3834 non-null   int64  
 4   Year            3834 non-null   int64  
 5   ET01            3834 non-null   float64
 6   Week            3834 non-null   int64  
 7   Moist           3834 non-null   int32  
 8   RainyDay5       3834 non-null   float64
 9   RainyDay35      3834 non-null   float64
 10  RainyDay365     3834 non-null   float64
dtypes: float64(6), int32(1), int64(4)
memory usage: 504.5 KB

Index(['Date_excel', 'Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month',
       'Year', 'ET01', 'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5',
       'Flow_Rate_Lup', 'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index'],
      dtype='object')

(3833,) (3833, 20)

Date_excel
2020-06-23    74.88
2020-06-24    74.58
2020-06-25    74.29
2020-06-26    73.93
2020-06-27    73.60
2020-06-28    73.14
2020-06-29    72.88
Name: Flow_Rate_Lupa, dtype: float64

(3449, 20) (3449,) (384, 20) (384,)

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4353 entries, 2010-01-01 to 2021-12-01
Data columns (total 18 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Flow_Rate_Lupa                            3833 non-null   float64
 1   doy                                       3833 non-null   float64
 2   Month                                     3833 non-null   float64
 3   Year                                      3833 non-null   float64
 4   Infiltsum                                 3833 non-null   float64
 5   P5                                        3833 non-null   float64
 6   Infilt_m3                                 3833 non-null   float64
 7   Week                                      3833 non-null   float64
 8   Lupa_Mean99_2011                          3833 non-null   float64
 9   Rainfall_Terni_minET                      3833 non-null   float64
 10  Infiltrate                                3833 non-null   float64
 11  α20                                       3833 non-null   float64
 12  Infilt_2YR                                3833 non-null   float64
 13  α1_negatives                              3833 non-null   float64
 14  Rainfall_Terni_scale_12_calculated_index  3833 non-null   float64
 15  DroughtIndex                              4353 non-null   float64
 16  DI_12                                     4353 non-null   float64
 17  DI_12_s                                   4353 non-null   float64
dtypes: float64(18)
memory usage: 646.1 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4162 entries, 2010-01-01 to NaT
Data columns (total 25 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Rainfall_Terni                            3833 non-null   float64
 1   Flow_Rate_Lupa                            3833 non-null   float64
 2   doy                                       3833 non-null   float64
 3   Month                                     3833 non-null   float64
 4   Year                                      3833 non-null   float64
 5   Infiltsum                                 3833 non-null   float64
 6   P5                                        3833 non-null   float64
 7   Week                                      3833 non-null   float64
 8   Lupa_Mean99_2011                          3833 non-null   float64
 9   Rainfall_Terni_minET                      3833 non-null   float64
 10  Infiltrate                                3833 non-null   float64
 11  α10                                       3833 non-null   float64
 12  α20                                       3833 non-null   float64
 13  α10_30                                    3804 non-null   float64
 14  Infilt_2YR                                3833 non-null   float64
 15  α1_negatives                              3833 non-null   float64
 16  ro                                        3833 non-null   float64
 17  Infilt_M6                                 3833 non-null   float64
 18  Rainfall_Terni_scale_12_calculated_index  3833 non-null   float64
 19  SMroot                                    3833 non-null   float64
 20  Neradebit                                 3833 non-null   float64
 21  smian                                     4008 non-null   float64
 22  DroughtIndex                              4139 non-null   float64
 23  Deficit                                   3988 non-null   float64
 24  PET_hg                                    4162 non-null   float64
dtypes: float64(25)
memory usage: 845.4 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3804 entries, 2010-01-16 to 2020-06-15
Data columns (total 25 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Rainfall_Terni                            3804 non-null   float64
 1   Flow_Rate_Lupa                            3804 non-null   float64
 2   doy                                       3804 non-null   float64
 3   Month                                     3804 non-null   float64
 4   Year                                      3804 non-null   float64
 5   Infiltsum                                 3804 non-null   float64
 6   P5                                        3804 non-null   float64
 7   Week                                      3804 non-null   float64
 8   Lupa_Mean99_2011                          3804 non-null   float64
 9   Rainfall_Terni_minET                      3804 non-null   float64
 10  Infiltrate                                3804 non-null   float64
 11  α10                                       3804 non-null   float64
 12  α20                                       3804 non-null   float64
 13  α10_30                                    3804 non-null   float64
 14  Infilt_2YR                                3804 non-null   float64
 15  α1_negatives                              3804 non-null   float64
 16  ro                                        3804 non-null   float64
 17  Infilt_M6                                 3804 non-null   float64
 18  Rainfall_Terni_scale_12_calculated_index  3804 non-null   float64
 19  SMroot                                    3804 non-null   float64
 20  Neradebit                                 3804 non-null   float64
 21  smian                                     3804 non-null   float64
 22  DroughtIndex                              3804 non-null   float64
 23  Deficit                                   3804 non-null   float64
 24  PET_hg                                    3804 non-null   float64
dtypes: float64(25)
memory usage: 772.7 KB

Index(['Rainfall_Terni', 'doy', 'Month', 'Year', 'Infiltsum', 'P5', 'Week',
       'Lupa_Mean99_2011', 'Rainfall_Terni_minET', 'Infiltrate', 'α10', 'α20',
       'α10_30', 'Infilt_2YR', 'α1_negatives', 'ro', 'Infilt_M6',
       'Rainfall_Terni_scale_12_calculated_index', 'SMroot', 'Neradebit',
       'smian', 'DroughtIndex', 'Deficit', 'PET_hg'],
      dtype='object')

{'importances_mean': array([ 0.  ,  0.12,  0.01, ...,  0.01,  0.06, -0.  ]),
 'importances_std': array([0.  , 0.01, 0.  , ..., 0.  , 0.05, 0.  ]),
 'importances': array([[ 0.  ,  0.  ,  0.  , ...,  0.  ,  0.  ,  0.  ],
        [ 0.13,  0.12,  0.14, ...,  0.11,  0.14,  0.14],
        [ 0.01,  0.01,  0.01, ...,  0.01,  0.01,  0.01],
        ...,
        [ 0.01,  0.01,  0.01, ...,  0.01,  0.02,  0.02],
        [ 0.1 ,  0.01,  0.12, ...,  0.14, -0.01,  0.05],
        [ 0.  ,  0.  ,  0.  , ..., -0.  ,  0.  , -0.  ]])}

sklearn.utils.Bunch

numpy.ndarray

24

               0             1
1   2.098973e-04  5.214583e-05
2   1.231481e-01  1.486333e-02
3   6.497371e-03  7.457938e-04
4   1.332268e-16  2.035072e-16
5   1.209906e-01  7.968725e-03
6   6.930338e-02  1.196907e-02
7   2.149503e-02  3.599439e-03
8   3.416554e-01  3.196317e-02
9   1.002299e-04  2.733726e-05
10  9.394863e-05  3.494130e-05
11  7.612652e-03  5.188572e-03
12  5.532388e-07  1.140896e-06
13  9.559144e-07  1.816442e-06
14 -2.860976e-03  8.823191e-04
15  1.246019e-03  7.126765e-04
16  3.163906e-05  2.237637e-05
17  1.936885e-04  4.878547e-05
18  1.430110e-01  2.861212e-02
19  6.415342e-02  1.120721e-01
20  4.187867e-02  8.493755e-03
21  1.910810e-01  6.077271e-02
22  1.422894e-02  4.694185e-03
23  6.058065e-02  4.547789e-02
24 -5.630858e-04  2.365746e-03

Index(['Rainfall_Terni', 'doy', 'Month', 'Year', 'Infiltsum', 'P5', 'Week',
       'Lupa_Mean99_2011', 'Rainfall_Terni_minET', 'Infiltrate', 'α10', 'α20',
       'α10_30', 'Infilt_2YR', 'α1_negatives', 'ro', 'Infilt_M6',
       'Rainfall_Terni_scale_12_calculated_index', 'SMroot', 'Neradebit',
       'smian', 'DroughtIndex', 'Deficit', 'PET_hg'],
      dtype='object')

numpy.ndarray

numpy.ndarray

array([[-0.39, -1.55, -1.56, ...,  0.43, -0.5 , -1.12],
       [-0.04, -1.54, -1.56, ...,  0.43, -0.5 , -1.46],
       [-0.39, -1.53, -1.56, ...,  0.43, -0.5 , -1.08],
       [-0.39, -1.52, -1.56, ...,  0.43, -0.5 , -1.03],
       [-0.39, -1.51, -1.56, ...,  0.43, -0.5 , -1.06]])

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    0.2s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:    1.3s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:    3.2s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:    5.8s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:    9.0s
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:   13.1s
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:   17.8s
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:   23.3s
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:   29.6s
[Parallel(n_jobs=3)]: Done 4994 tasks      | elapsed:   36.6s
[Parallel(n_jobs=3)]: Done 6044 tasks      | elapsed:   44.3s
[Parallel(n_jobs=3)]: Done 6300 out of 6300 | elapsed:   46.2s finished

RandomForestRegressor(max_features=24, min_samples_split=4, n_estimators=6300,
                      n_jobs=3, random_state=1100, verbose=1)

Feature ranking:
1. feature 4 (0.492333)
2. feature 13 (0.116826)
3. feature 3 (0.086711)
4. feature 20 (0.080009)
5. feature 1 (0.064020)
6. feature 7 (0.035143)
7. feature 18 (0.032208)
8. feature 10 (0.027808)
9. feature 17 (0.026450)
10. feature 6 (0.014625)
11. feature 19 (0.007935)
12. feature 22 (0.007026)
13. feature 2 (0.003878)
14. feature 21 (0.003086)
15. feature 5 (0.000774)
16. feature 23 (0.000772)
17. feature 14 (0.000246)
18. feature 0 (0.000037)
19. feature 15 (0.000030)
20. feature 16 (0.000025)
21. feature 8 (0.000020)
22. feature 9 (0.000019)
23. feature 11 (0.000017)
24. feature 12 (0.000000)

[(0, 'Rainfall_Terni'), (1, 'doy'), (2, 'Month'), (3, 'Year'), (4, 'Infiltsum'), (5, 'P5'), (6, 'Week'), (7, 'Lupa_Mean99_2011'), (8, 'Rainfall_Terni_minET'), (9, 'Infiltrate'), (10, 'α10'), (11, 'α20'), (12, 'α10_30'), (13, 'Infilt_2YR'), (14, 'α1_negatives'), (15, 'ro'), (16, 'Infilt_M6'), (17, 'Rainfall_Terni_scale_12_calculated_index'), (18, 'SMroot'), (19, 'Neradebit'), (20, 'smian'), (21, 'DroughtIndex'), (22, 'Deficit'), (23, 'PET_hg')]

24

RandomForestRegressor(max_features=24, min_samples_split=4, n_estimators=6300,
                      n_jobs=3, random_state=1100, verbose=1)

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:    0.2s
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:    0.3s
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:    0.4s
[Parallel(n_jobs=3)]: Done 4994 tasks      | elapsed:    0.5s
[Parallel(n_jobs=3)]: Done 6044 tasks      | elapsed:    0.6s
[Parallel(n_jobs=3)]: Done 6300 out of 6300 | elapsed:    0.7s finished

-1.4967937735167514

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:    0.2s
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:    0.3s
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:    0.4s
[Parallel(n_jobs=3)]: Done 4994 tasks      | elapsed:    0.5s
[Parallel(n_jobs=3)]: Done 6044 tasks      | elapsed:    0.6s
[Parallel(n_jobs=3)]: Done 6300 out of 6300 | elapsed:    0.6s finished

-1.4967937735167514

(381,)

2019-06-01    114.74
2019-06-02    116.56
2019-06-03    118.29
2019-06-04    119.84
2019-06-05    121.34
Name: Flow_Rate_Lupa, dtype: float64

[107.1  108.12 107.16 ...  99.85  98.93  98.84]

2019-06-01    114.74
2019-06-02    116.56
2019-06-03    118.29
2019-06-04    119.84
2019-06-05    121.34
               ...  
2020-06-11     79.12
2020-06-12     78.63
2020-06-13     78.29
2020-06-14     77.90
2020-06-15     77.43
Name: Flow_Rate_Lupa, Length: 381, dtype: float64 <class 'pandas.core.series.Series'>

Mean Absolute Error: 19.769157906397744
Mean Squared Error: 664.2064720546967
Root Mean Squared Error: 25.77220347689923
Mean Absolute Percentage Error (MAPE): 21.84
Accuracy: 78.16

381

(381,) (381,)

(3834, 12)

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 8 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  3834 non-null   float64
 1   Flow_Rate_Lupa  3834 non-null   float64
 2   doy             3834 non-null   int64  
 3   Month           3834 non-null   int64  
 4   Year            3834 non-null   int64  
 5   ET01            3834 non-null   float64
 6   Infilt_         3834 non-null   float64
 7   Week            3834 non-null   UInt32 
dtypes: UInt32(1), float64(4), int64(3)
memory usage: 418.3 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3834 entries, 2010-01-01 to 2020-06-30
Data columns (total 8 columns):
 #   Column          Non-Null Count  Dtype  
---  ------          --------------  -----  
 0   Rainfall_Terni  3834 non-null   float64
 1   Flow_Rate_Lupa  3834 non-null   float64
 2   doy             3834 non-null   int64  
 3   Month           3834 non-null   int64  
 4   Year            3834 non-null   int64  
 5   ET01            3834 non-null   float64
 6   Infilt_         3834 non-null   float64
 7   Week            3834 non-null   UInt32 
dtypes: UInt32(1), float64(4), int64(3)
memory usage: 258.3 KB

2.891575378195096

0.0

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5', 'Flow_Rate_Lup',
       'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg',
       'Flow_Rate_root', 'Rainfall_shi_3d'],
      dtype='object')

(4162,) (4162, 15)

Date_excel
2020-06-28    8.552193
2020-06-29    8.536978
Name: Flow_Rate_root, dtype: float64

Date_excel
2019-06-12    11.390347
2019-06-13    11.421033
2019-06-14    11.447270
2019-06-15    11.471268
2019-06-16    11.486514
                ...    
2020-06-25     8.619165
2020-06-26     8.598256
2020-06-27     8.579044
2020-06-28     8.552193
2020-06-29     8.536978
Name: Flow_Rate_root, Length: 384, dtype: float64

(3449, 14) (3449,) (384, 14) (384,)

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    2.6s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:   11.7s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:   29.7s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:   53.2s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:  1.4min
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:  2.0min
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:  2.7min
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:  3.6min
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:  4.5min
[Parallel(n_jobs=3)]: Done 4500 out of 4500 | elapsed:  5.0min finished

ExtraTreesRegressor(criterion='absolute_error', max_depth=15,
                    min_samples_leaf=5, min_samples_split=4, n_estimators=4500,
                    n_jobs=3, random_state=1100, verbose=1)

ExtraTreesRegressor(criterion='absolute_error', max_depth=15,
                    min_samples_leaf=5, min_samples_split=4, n_estimators=4500,
                    n_jobs=3, random_state=1100, verbose=1)

Feature ranking:
1. feature 1 (0.273538)
2. feature 0 (0.195829)
3. feature 6 (0.103642)
4. feature 7 (0.101302)
5. feature 10 (0.100236)
6. feature 5 (0.062661)
7. feature 8 (0.053130)
8. feature 12 (0.030308)
9. feature 9 (0.028147)
10. feature 11 (0.026929)
11. feature 2 (0.009204)
12. feature 13 (0.008235)
13. feature 4 (0.004022)
14. feature 3 (0.002818)

[(0, 'Year'), (1, 'Infiltsum'), (2, 'Rainfall_Ter'), (3, 'P5'), (4, 'Infilt_m3'), (5, 'Week'), (6, 'Lupa_Mean99_2011'), (7, 'Infilt_2YR'), (8, 'SMroot'), (9, 'Neradebit'), (10, 'smian'), (11, 'DroughtIndex'), (12, 'Deficit'), (13, 'PET_hg'), (14, 'Rainfall_shi_3d')]

Feature ranking:
1. feature 2 (0.271921)
2. feature 0 (0.177159)
3. feature 9 (0.106217)
4. feature 14 (0.096870)
5. feature 6 (0.064012)
6. feature 19 (0.052880)
7. feature 12 (0.041068)
8. feature 16 (0.032990)
9. feature 20 (0.030165)
10. feature 21 (0.025331)
11. feature 15 (0.024527)
12. feature 18 (0.024177)
13. feature 17 (0.021384)
14. feature 13 (0.016362)
15. feature 3 (0.005768)
16. feature 1 (0.003542)
17. feature 5 (0.002234)
18. feature 10 (0.002120)
19. feature 4 (0.001081)
20. feature 8 (0.000103)
21. feature 11 (0.000054)
22. feature 7 (0.000034)

[(0, 'Year'), (1, 'ET01'), (2, 'Infiltsum'), (3, 'Rainfall_Ter'), (4, 'P5'), (5, 'Infilt_m3'), (6, 'Lupa_Mean99_2011'), (7, 'Rainfall_Terni_minET'), (8, 'Infiltrate'), (9, 'Infilt_2YR'), (10, 'α1_negatives'), (11, 'Infilt_M6'), (12, 'SMroot'), (13, 'Neradebit'), (14, 'smian'), (15, 'DroughtIndex'), (16, 'doy_sin'), (17, 'doy_cos'), (18, 'Month_sin'), (19, 'Month_cos'), (20, 'Week_sin'), (21, 'Week_cos')]

22

ExtraTreesRegressor(criterion='absolute_error', max_depth=14,
                    min_samples_leaf=5, min_samples_split=4, n_estimators=4500,
                    n_jobs=3, random_state=1100, verbose=1)

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:    0.2s
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:    0.3s
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:    0.4s
[Parallel(n_jobs=3)]: Done 4500 out of 4500 | elapsed:    0.5s finished

-0.13045741864455063

(384,)

Date_excel
2019-06-12    11.390347
2019-06-13    11.421033
2019-06-14    11.447270
2019-06-15    11.471268
2019-06-16    11.486514
2019-06-17    11.499565
2019-06-18    11.515642
2019-06-19    11.524756
2019-06-20    11.525190
2019-06-21    11.482596
Name: Flow_Rate_root, dtype: float64

[11.18 11.15 11.13 ... 11.11 11.09 11.06]

Mean Absolute Error: 0.6861577503036583
Mean Squared Error: 0.7671886632465948
Root Mean Squared Error: 0.8758930661025893
Mean Absolute Percentage Error (MAPE): 7.17
Accuracy: 92.83

Mean Absolute Error: 0.6861577503036583
Mean Squared Error: 0.7671886632465948
Root Mean Squared Error: 0.8758930661025893
Mean Absolute Percentage Error (MAPE): 7.17
Accuracy: 92.83

array([[   0. ,  164. ,    6. , ..., 1008.7, 2886.7,  157.4],
       [   0. ,  165. ,    6. , ..., 1008.7, 2874.5,  157.4],
       [   0.2,  166. ,    6. , ..., 1008.9, 2874.7,  157.6],
       ...,
       [   0. ,  180. ,    6. , ...,  995.2, 3022.1,   81. ],
       [   0. ,  181. ,    6. , ...,  995.2, 3004.3,   81. ],
       [   0. ,  182. ,    6. , ...,  995.2, 3004.3,   81. ]])

<AxesSubplot:xlabel='Data'>

<class 'pandas.core.frame.DataFrame'>
Index: 3833 entries, 2010-01-01 00:00:00 to 2020-06-29 00:00:00
Data columns (total 24 columns):
 #   Column                Non-Null Count  Dtype         
---  ------                --------------  -----         
 0   Rainfall_Terni        3833 non-null   float64       
 1   Flow_Rate_Lupa        3833 non-null   float64       
 2   doy                   3833 non-null   float64       
 3   Month                 3833 non-null   float64       
 4   Year                  3833 non-null   float64       
 5   ET01                  3833 non-null   float64       
 6   Infilt_               3833 non-null   float64       
 7   Infiltsum             3833 non-null   float64       
 8   Rainfall_Ter          3833 non-null   float64       
 9   Flow_Rate_Lup         3833 non-null   float64       
 10  Infilt_m3             3833 non-null   float64       
 11  Week                  3833 non-null   float64       
 12  Date_excel            3833 non-null   datetime64[ns]
 13  log_Flow              3833 non-null   float64       
 14  Lupa_Mean99_2011      3833 non-null   float64       
 15  Rainfall_Terni_minET  3833 non-null   float64       
 16  Infiltrate            3833 non-null   float64       
 17  log_Flow_10d          3833 non-null   float64       
 18  log_Flow_20d          3833 non-null   float64       
 19  α10                   3833 non-null   float64       
 20  α20                   3833 non-null   float64       
 21  log_Flow_10d_dif      3833 non-null   float64       
 22  log_Flow_20d_dif      3833 non-null   float64       
 23  α10_30                3804 non-null   float64       
dtypes: datetime64[ns](1), float64(23)
memory usage: 748.6+ KB

<matplotlib.lines.Line2D at 0x18d093a48e0>

Index(['Rainfall_Terni', 'Flow_Rate_Lupa', 'doy', 'Month', 'Year', 'ET01',
       'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5', 'Flow_Rate_Lup',
       'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex'],
      dtype='object')

86.33729205805807

0.06924470011718334

0.9999997887500074 0.9999559424390152

0.0006499999084583565 0.009386724300334744

2737.5

<AxesSubplot:ylabel='Frequency'>

Infiltrate        0
Flow_Rate_Lupa    0
Flow_Rate_Lup     0
dtype: int64

Index(['Infiltrate', 'Flow_Rate_Lupa', 'log_Flow'], dtype='object')

[]

Index(['Infiltrate', 'Flow_Rate_Lupa', 'log_Flow', 'log_Flow1', 'log_Flow2',
       'log_Flow3', 'log_Flow4', 'log_Flow5', 'log_Flow6', 'log_Flow7',
       ...
       'log_Flow310', 'log_Flow311', 'log_Flow312', 'log_Flow313',
       'log_Flow314', 'log_Flow315', 'log_Flow316', 'log_Flow317',
       'log_Flow318', 'log_Flow319'],
      dtype='object', length=322)

Date_excel
2010-01-01    87.08
2010-02-01    95.84
2010-03-01    55.21
2010-04-01    42.12
2010-05-01    92.15
              ...  
2020-02-01    19.14
2020-03-01    39.64
2020-04-01    29.49
2020-05-01    16.01
2020-06-01    27.63
Freq: MS, Name: Infiltrate, Length: 126, dtype: float64

0.024051757400813913

<AxesSubplot:>

Rainfall_Terni    0
Flow_Rate_Lupa    0
Flow_Rate_Lup     0
dtype: int64

[]

Date_excel
2011-12-01    122.4
2012-01-01     38.8
2012-02-01     46.8
2012-03-01      4.6
2012-04-01    161.0
              ...  
2020-02-01     38.4
2020-03-01     71.4
2020-04-01     51.8
2020-05-01     57.8
2020-06-01     68.2
Freq: MS, Name: Rainfall_Terni, Length: 103, dtype: float64

Data
2009-01-01    4227.85
2009-02-01    4421.84
2009-03-01    5569.66
2009-04-01    5390.40
2009-05-01    3255.86
2009-06-01    4398.44
2009-07-01    3942.35
2009-08-01    3263.23
2009-09-01    2788.74
2009-10-01    1415.69
2009-11-01    2167.66
2009-12-01    2209.44
Freq: MS, Name: Portata, dtype: float64

DatetimeIndex(['2009-01-01', '2009-02-01', '2009-03-01', '2009-04-01',
               '2009-05-01', '2009-06-01', '2009-07-01', '2009-08-01',
               '2009-09-01', '2009-10-01',
               ...
               '2020-07-01', '2020-08-01', '2020-09-01', '2020-10-01',
               '2020-11-01', '2020-12-01', '2021-01-01', '2021-02-01',
               '2021-03-01', '2021-04-01'],
              dtype='datetime64[ns]', name='Date_excel', length=148, freq='MS')

Data
2009-01-01    4227.85
2009-02-01    4421.84
2009-03-01    5569.66
2009-04-01    5390.40
2009-05-01    3255.86
2009-06-01    4398.44
2009-07-01    3942.35
2009-08-01    3263.23
2009-09-01    2788.74
2009-10-01    1415.69
2009-11-01    2167.66
2009-12-01    2209.44
Freq: MS, Name: Flow_Rate_Lupa, dtype: float64

0.04079979649228839
-0.03188024511655861
-0.08573595288340796
-0.07830331345256117
-0.05310781514279004
-0.029061123471172543
-0.02443047413120684
-0.023899904761460318
-0.03369664506577493
-0.07748397365392057
-0.09201421161429409
-0.13476956565086048
-0.1851365509035896
-0.2286362926705608
-0.19884224183377563
-0.13933725753034013
-0.1226051810834394
-0.06656266626777582
-0.0269921960906338
-0.02408563367293273
-0.02470199878818502
0.028017444223247822
-0.008291151308897633

<AxesSubplot:>

0

C:\Users\Kurt\AppData\Local\Temp\ipykernel_13124\3760169820.py:8: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Lupa_excel['Infilt_M6'] = Lupa_excel.apply(lambda row: infiltration_M6(row), axis=1)

C:\Users\Kurt\AppData\Local\Temp\ipykernel_13124\180528490.py:2: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  Lupa_excel["Infilt_M6"]= np.where( Lupa_excel["Infilt_M6"]<0,0, Lupa_excel["Infilt_M6"])

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 3859 entries, 2010-01-01 to NaT
Data columns (total 28 columns):
 #   Column                Non-Null Count  Dtype  
---  ------                --------------  -----  
 0   Rainfall_Terni        3833 non-null   float64
 1   Flow_Rate_Lupa        3833 non-null   float64
 2   doy                   3833 non-null   float64
 3   Month                 3833 non-null   float64
 4   Year                  3833 non-null   float64
 5   ET01                  3833 non-null   float64
 6   Infilt_               3833 non-null   float64
 7   Infiltsum             3833 non-null   float64
 8   Rainfall_Ter          3833 non-null   float64
 9   P5                    3859 non-null   float64
 10  Flow_Rate_Lup         3833 non-null   float64
 11  Infilt_m3             3833 non-null   float64
 12  Week                  3833 non-null   float64
 13  log_Flow              3833 non-null   float64
 14  Lupa_Mean99_2011      3833 non-null   float64
 15  Rainfall_Terni_minET  3833 non-null   float64
 16  Infiltrate            3833 non-null   float64
 17  log_Flow_10d          3859 non-null   float64
 18  log_Flow_20d          3859 non-null   float64
 19  α10                   3833 non-null   float64
 20  α20                   3833 non-null   float64
 21  log_Flow_10d_dif      3833 non-null   float64
 22  log_Flow_20d_dif      3833 non-null   float64
 23  α10_30                3804 non-null   float64
 24  Infilt_7YR            3833 non-null   float64
 25  Infilt_2YR            3833 non-null   float64
 26  α1                    3833 non-null   float64
 27  α1_negatives          3833 non-null   float64
dtypes: float64(28)
memory usage: 874.3 KB

<class 'pandas.core.frame.DataFrame'>
DatetimeIndex: 4162 entries, 2010-01-01 to NaT
Data columns (total 38 columns):
 #   Column                                    Non-Null Count  Dtype  
---  ------                                    --------------  -----  
 0   Rainfall_Terni                            3833 non-null   float64
 1   Flow_Rate_Lupa                            3833 non-null   float64
 2   doy                                       3833 non-null   float64
 3   Month                                     3833 non-null   float64
 4   Year                                      3833 non-null   float64
 5   ET01                                      3833 non-null   float64
 6   Infilt_                                   3833 non-null   float64
 7   Infiltsum                                 3833 non-null   float64
 8   Rainfall_Ter                              3833 non-null   float64
 9   P5                                        3833 non-null   float64
 10  Flow_Rate_Lup                             3833 non-null   float64
 11  Infilt_m3                                 3833 non-null   float64
 12  Week                                      3833 non-null   float64
 13  log_Flow                                  3833 non-null   float64
 14  Lupa_Mean99_2011                          3833 non-null   float64
 15  Rainfall_Terni_minET                      3833 non-null   float64
 16  Infiltrate                                3833 non-null   float64
 17  log_Flow_10d                              3833 non-null   float64
 18  log_Flow_20d                              3833 non-null   float64
 19  α10                                       3833 non-null   float64
 20  α20                                       3833 non-null   float64
 21  log_Flow_10d_dif                          3833 non-null   float64
 22  log_Flow_20d_dif                          3833 non-null   float64
 23  α10_30                                    3804 non-null   float64
 24  Infilt_7YR                                3833 non-null   float64
 25  Infilt_2YR                                3833 non-null   float64
 26  α1                                        3833 non-null   float64
 27  α1_negatives                              3833 non-null   float64
 28  ro                                        3833 non-null   float64
 29  Infilt_M6                                 3833 non-null   float64
 30  Infilt_M6_diff                            3833 non-null   float64
 31  Rainfall_Terni_scale_12_calculated_index  3833 non-null   float64
 32  SMroot                                    3833 non-null   float64
 33  Neradebit                                 3833 non-null   float64
 34  smian                                     4008 non-null   float64
 35  DroughtIndex                              4139 non-null   float64
 36  Deficit                                   3988 non-null   float64
 37  PET_hg                                    4162 non-null   float64
dtypes: float64(38)
memory usage: 1.2 MB

Index(['Unnamed: 0', 'Date_excel', 'Rainfall_Terni', 'Flow_Rate_Lupa', 'doy',
       'Month', 'Year', 'ET01', 'Infilt_', 'Infiltsum', 'Rainfall_Ter', 'P5',
       'Flow_Rate_Lup', 'Infilt_m3', 'Week', 'log_Flow', 'Lupa_Mean99_2011',
       'Rainfall_Terni_minET', 'Infiltrate', 'log_Flow_10d', 'log_Flow_20d',
       'α10', 'α20', 'log_Flow_10d_dif', 'log_Flow_20d_dif', 'α10_30',
       'Infilt_7YR', 'Infilt_2YR', 'α1', 'α1_negatives', 'ro', 'Infilt_M6',
       'Infilt_M6_diff', 'Rainfall_Terni_scale_12_calculated_index', 'SMroot',
       'Neradebit', 'smian', 'DroughtIndex', 'Deficit', 'PET_hg', 'GWETTOP',
       'α1_OK', 'α4'],
      dtype='object')

C:\Users\VanOp\AppData\Local\Temp\ipykernel_16064\1855936153.py:1: SettingWithCopyWarning: 
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  values.Rainfall_Terni_minET = values.Rainfall_Terni_minET.rolling(90, min_periods=30).sum().fillna(values.Rainfall_Terni_minET.median() )

(3833,) (3833, 14)

3831    4.292375
3832    4.288814
Name: log_Flow, dtype: float64

3449    4.865532
3450    4.870913
3451    4.875503
3452    4.879691
3453    4.882347
          ...   
3828    4.307976
3829    4.303119
3830    4.298645
3831    4.292375
3832    4.288814
Name: log_Flow, Length: 384, dtype: float64

(3449, 14) (3449,) (384, 14) (384,)

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    2.6s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:   11.9s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:   27.2s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:   50.0s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:  1.3min
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:  1.9min
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:  2.5min
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:  3.3min
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:  4.2min
[Parallel(n_jobs=3)]: Done 4994 tasks      | elapsed:  5.2min
[Parallel(n_jobs=3)]: Done 6044 tasks      | elapsed:  6.4min
[Parallel(n_jobs=3)]: Done 7194 tasks      | elapsed:  7.6min
[Parallel(n_jobs=3)]: Done 8444 tasks      | elapsed:  8.9min
[Parallel(n_jobs=3)]: Done 8500 out of 8500 | elapsed:  8.9min finished

ExtraTreesRegressor(criterion='absolute_error', max_depth=19,
                    min_samples_leaf=3, min_samples_split=3, n_estimators=8500,
                    n_jobs=3, random_state=1100, verbose=1)

[(0, 'Rainfall_Terni_minET'), (1, 'Week'), (2, 'Month'), (3, 'Lupa_Mean99_2011'), (4, 'Infilt_M6'), (5, 'α1_OK'), (6, 'α10'), (7, 'SMroot'), (8, 'Neradebit'), (9, 'smian'), (10, 'DroughtIndex'), (11, 'Deficit'), (12, 'PET_hg'), (13, 'GWETTOP')]

14

ExtraTreesRegressor(criterion='absolute_error', max_depth=19,
                    min_samples_leaf=3, min_samples_split=3, n_estimators=8500,
                    n_jobs=3, random_state=1100, verbose=1)

[Parallel(n_jobs=3)]: Using backend ThreadingBackend with 3 concurrent workers.
[Parallel(n_jobs=3)]: Done  44 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 194 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 444 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 794 tasks      | elapsed:    0.0s
[Parallel(n_jobs=3)]: Done 1244 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 1794 tasks      | elapsed:    0.1s
[Parallel(n_jobs=3)]: Done 2444 tasks      | elapsed:    0.2s
[Parallel(n_jobs=3)]: Done 3194 tasks      | elapsed:    0.3s
[Parallel(n_jobs=3)]: Done 4044 tasks      | elapsed:    0.4s
[Parallel(n_jobs=3)]: Done 4994 tasks      | elapsed:    0.5s
[Parallel(n_jobs=3)]: Done 6044 tasks      | elapsed:    0.6s
[Parallel(n_jobs=3)]: Done 7194 tasks      | elapsed:    0.8s
[Parallel(n_jobs=3)]: Done 8444 tasks      | elapsed:    0.9s
[Parallel(n_jobs=3)]: Done 8500 out of 8500 | elapsed:    0.9s finished

-0.5157557944576399

(384,)

3449    4.865532
3450    4.870913
3451    4.875503
3452    4.879691
3453    4.882347
3454    4.884618
3455    4.887412
3456    4.888995
3457    4.889070
3458    4.881665
Name: log_Flow, dtype: float64

Mean Absolute Error: 0.15350245473821933
Mean Squared Error: 0.042698843785711114
Root Mean Squared Error: 0.20663698552222232
Mean Absolute Percentage Error (MAPE): 3.42
Accuracy: 96.58

[(0, 'Rainfall_Terni_minET'), (1, 'Week'), (2, 'Month'), (3, 'Lupa_Mean99_2011'), (4, 'Infilt_M6'), (5, 'α1_OK'), (6, 'α10'), (7, 'SMroot'), (8, 'Neradebit'), (9, 'smian'), (10, 'DroughtIndex'), (11, 'Deficit'), (12, 'PET_hg'), (13, 'GWETTOP')]

[0.2097965713219379,
 0.06792202647376755,
 0.07428773039335997,
 0.0974038842153177,
 0.0008356471314680961,
 0.015203932895454469,
 0.06719737686418081,
 0.09091478074594014,
 0.053506716612165564,
 0.15409055351287118,
 0.05196824707092393,
 0.03588412288803437,
 0.009109146134424892,
 0.07187926374015352]

[]

	Portata
Data
2010-12-18	189.60
2010-12-19	NaN
2010-12-20	191.03

	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018	2019	2020
2020-01-01	135.47	82.24	203.08	59.00	112.44	142.09	84.21	52.43	65.94	77.67	74.78	107.92
2020-01-02	135.24	88.90	203.68	58.75	112.31	141.89	83.68	52.36	65.69	80.26	74.64	108.04
2020-01-03	135.17	93.56	204.52	58.60	112.20	141.12	83.37	52.36	65.09	82.56	74.26	108.16
2020-01-04	134.87	96.63	205.48	58.55	112.28	140.69	82.97	52.57	64.72	84.72	74.03	108.28
2020-01-05	134.80	98.65	206.31	58.18	112.35	140.65	82.89	52.53	64.73	86.36	73.83	108.41
...	...	...	...	...	...	...	...	...	...	...	...	...
2020-12-27	76.73	199.84	59.97	112.28	143.02	85.10	53.12	67.04	65.93	75.83	105.88	NaN
2020-12-28	77.58	201.31	59.70	112.08	142.90	84.91	52.93	66.70	70.47	75.53	106.70	NaN
2020-12-29	78.18	202.14	59.31	112.18	142.67	84.69	52.83	66.62	73.81	75.29	107.37	NaN
2020-12-30	78.65	202.65	59.15	112.30	142.40	84.51	52.63	66.42	75.54	75.02	107.80	NaN
2020-12-31	NaN	NaN	NaN	112.33	NaN	NaN	NaN	66.17	NaN	NaN	NaN	NaN

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	ET01	Infilt_	Infiltsum	Rainfall_Ter	Flow_Rate_Lup	Infilt_m3	Week
Date
2010-01-01	3.27	136.20	16.0	1.0	2010.0	1.15	2.13	33.32	412398.00	11767.65	150293.32	7.39
2010-02-01	3.74	181.53	45.5	2.0	2010.0	1.44	2.30	101.46	471114.00	15684.62	167252.92	6.50
2010-03-01	2.51	234.50	75.0	3.0	2010.0	1.74	0.77	145.70	316008.00	20261.22	85275.81	10.74
2010-04-01	3.17	235.53	105.5	4.0	2010.0	2.30	0.86	170.11	398790.00	20349.45	103800.77	15.07
2010-05-01	4.10	239.19	136.0	5.0	2010.0	2.63	1.47	205.44	516600.00	20665.85	146546.67	19.42
...	...	...	...	...	...	...	...	...	...	...	...	...
2020-02-01	1.32	107.80	46.0	2.0	2020.0	1.75	-0.42	-488.36	166841.38	9314.04	16091.67	7.28
2020-03-01	2.30	103.03	76.0	3.0	2020.0	1.78	0.53	-465.05	290206.45	8901.57	71997.11	11.58
2020-04-01	1.73	97.95	106.5	4.0	2020.0	2.28	-0.55	-495.90	217560.00	8463.20	20912.97	15.93
2020-05-01	1.86	88.32	137.0	5.0	2020.0	3.03	-1.17	-518.43	234929.03	7630.93	2665.01	20.26
2020-06-01	2.27	77.50	167.5	6.0	2020.0	3.49	-1.22	-528.31	286440.00	6695.88	10401.15	24.67

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	Diff	pct_ch	Flow_log	Flow_log_pct_ch
Date
2009-06-07	21.43	1088.03	1085	42	14063	-3.14	-2.02	35.37	-0.40
2009-06-14	21.43	1050.71	1134	42	14063	-5.60	-3.71	35.13	-0.74
2009-06-21	21.43	1010.76	1183	42	14063	-5.66	-3.90	34.86	-0.78
2009-06-28	21.43	976.24	1232	42	14063	-4.52	-3.23	34.61	-0.65
2009-07-05	12.50	943.78	1281	47	14063	-3.84	-2.83	34.38	-0.57
...	...	...	...	...	...	...	...	...	...
2019-12-01	17.80	631.07	2324	78	14133	0.74	0.82	31.59	0.18
2019-12-08	15.60	635.77	2373	84	14133	0.08	0.09	31.64	0.02
2019-12-15	19.80	636.98	2422	84	14133	0.27	0.30	31.65	0.06
2019-12-22	49.60	643.10	2471	84	14133	4.72	5.15	31.72	1.10
2019-12-29	0.80	727.19	2520	84	14133	10.90	10.91	32.57	2.31

	coef	std err	z	P>\|z\|	[0.025	0.975]
ar.L1	0.6446	0.042	15.280	0.000	0.562	0.727
ma.L1	0.2685	0.056	4.791	0.000	0.159	0.378
sigma2	1630.9933	18.660	87.408	0.000	1594.421	1667.565

	Date	Rainfall_Terni	Flow_Rate_Lupa
0	01/01/2009	2.8	NaN
1	02/01/2009	2.8	NaN
2	03/01/2009	2.8	NaN
3	04/01/2009	2.8	NaN
4	05/01/2009	2.8	NaN

	Date	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	ET01	Infilt_	Infiltsum	Rainfall_Ter	Flow_Rate_Lup	Infilt_m3
0	2009-01-01	2.8	135.47	1.0	1.0	2009.0	NaN	NaN	NaN	352422.0	11704.61	NaN
1	2009-01-02	2.8	135.24	2.0	1.0	2009.0	NaN	NaN	NaN	352422.0	11684.74	NaN
2	2009-01-03	2.8	135.17	3.0	1.0	2009.0	NaN	NaN	NaN	352422.0	11678.69	NaN
3	2009-01-04	2.8	134.87	4.0	1.0	2009.0	NaN	NaN	NaN	352422.0	11652.77	NaN
4	2009-01-05	2.8	134.80	5.0	1.0	2009.0	NaN	NaN	NaN	352422.0	11646.72	NaN

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	ET01
Date
2020-06-25	0.0	74.29	177	6	2020	4.03
2020-06-26	0.0	73.93	178	6	2020	4.17
2020-06-27	0.0	73.60	179	6	2020	4.45
2020-06-28	0.0	73.14	180	6	2020	4.51
2020-06-29	0.0	72.88	181	6	2020	4.51
2020-06-30	0.0	72.53	182	6	2020	4.88

	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018	2019	2020
2020-01-31	0.96	0.85	0.99	0.95	0.75	0.85	0.94	0.91	0.95	0.97	0.96	0.98
2020-02-29	0.92	0.86	0.97	0.96	0.93	0.89	0.91	0.82	0.96	0.96	0.93	0.97
2020-03-31	0.99	0.97	0.98	0.95	0.94	0.93	0.90	0.89	0.95	0.71	0.96	0.99
2020-04-30	1.00	0.98	0.99	0.91	0.99	0.95	0.96	0.99	0.95	0.96	0.95	0.96
2020-05-31	0.94	0.91	0.93	0.97	0.94	0.99	0.96	0.94	0.95	0.97	0.83	0.94
2020-06-30	0.93	0.94	0.91	0.93	0.93	0.93	0.92	0.99	0.93	0.94	0.97	0.94
2020-07-31	0.94	0.92	0.91	0.89	0.90	0.91	0.90	0.93	0.92	0.91	0.95	NaN
2020-08-31	0.93	0.88	0.91	0.96	0.90	0.91	0.92	0.91	0.93	0.89	0.91	NaN
2020-09-30	0.94	0.91	0.94	0.99	0.90	0.92	0.94	0.92	0.97	0.91	0.93	NaN
2020-10-31	0.94	0.92	0.94	0.86	0.94	0.93	0.91	0.92	0.97	0.91	0.93	NaN
2020-11-30	0.92	0.78	0.94	0.61	0.75	0.97	0.94	0.98	0.96	0.93	0.91	NaN
2020-12-31	0.91	0.91	0.95	0.95	0.97	0.97	0.96	0.93	0.63	0.97	0.88	NaN

	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018	2019	2020
2020-01-31	0.95	0.51	0.95	0.90	0.61	0.82	0.90	0.81	0.89	0.84	0.94	0.97
2020-02-29	0.82	0.76	0.95	0.91	0.86	0.68	0.62	0.73	0.88	0.91	0.62	0.93
2020-03-31	0.95	0.89	0.96	0.91	0.85	0.88	0.86	0.70	0.79	0.50	0.91	0.97
2020-04-30	0.99	0.95	0.97	0.84	0.97	0.91	0.87	0.99	0.91	0.88	0.89	0.93
2020-05-31	0.88	0.83	0.84	0.95	0.87	0.98	0.91	0.88	0.89	0.92	0.78	0.88
2020-06-30	0.86	0.88	0.83	0.85	0.87	0.85	0.84	0.97	0.87	0.87	0.88	0.88
2020-07-31	0.87	0.84	0.83	0.83	0.80	0.83	0.82	0.85	0.86	0.81	0.88	NaN
2020-08-31	0.85	0.78	0.84	0.93	0.81	0.83	0.84	0.83	0.88	0.78	0.83	NaN
2020-09-30	0.88	0.82	0.89	0.96	0.82	0.85	0.86	0.84	0.93	0.82	0.85	NaN
2020-10-31	0.88	0.84	0.88	0.81	0.89	0.87	0.87	0.87	0.93	0.84	0.87	NaN
2020-11-30	0.86	0.74	0.89	0.53	0.57	0.93	0.89	0.96	0.93	0.89	0.75	NaN
2020-12-31	0.87	0.71	0.91	0.69	0.94	0.95	0.92	0.87	0.44	0.93	0.84	NaN

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	Diff	pct_ch	Flow_log	Flow_log_pct_ch
Date
2010-01-23	3.27	157.56	23	1	2010	-0.03	0.10	5.07	1.99e-02
2010-01-25	3.27	158.08	25	1	2010	-0.30	0.18	5.07	3.60e-02
2010-01-26	3.27	158.23	26	1	2010	-0.61	0.09	5.07	1.86e-02
2010-01-27	3.27	158.19	27	1	2010	-0.98	-0.03	5.07	-4.96e-03
2010-01-28	3.27	158.41	28	1	2010	-0.67	0.14	5.07	2.72e-02
...	...	...	...	...	...	...	...	...	...
2020-06-26	0.00	73.93	178	6	2020	-0.16	-0.48	4.32	-1.11e-01
2020-06-27	0.00	73.60	179	6	2020	-0.10	-0.45	4.31	-1.02e-01
2020-06-28	0.00	73.14	180	6	2020	-0.11	-0.62	4.31	-1.43e-01
2020-06-29	0.00	72.88	181	6	2020	-0.03	-0.36	4.30	-8.16e-02
2020-06-30	0.00	72.53	182	6	2020	-0.25	-0.48	4.30	-1.10e-01

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	Diff	pct_ch	Flow_log	Flow_log_pct_ch	FlowDiff_log	FlowDiff_log_pct_ch
Date
2009-01-04	11.19	540.75	10	4	8036	0.00	-0.44	19.66	-0.09	0.00	0.00
2009-01-11	19.58	946.12	56	7	14063	0.00	0.38	34.40	0.08	0.00	0.00
2009-01-18	19.58	951.18	105	7	14063	0.00	0.53	34.43	0.11	0.00	0.00
2009-01-25	19.58	951.85	154	7	14063	0.00	0.46	34.44	0.09	0.00	0.00
2009-02-01	19.55	979.86	203	8	14063	0.00	3.74	34.64	0.75	0.00	0.00
...	...	...	...	...	...	...	...	...	...	...	...
2019-12-01	17.80	631.07	2324	78	14133	-1.86	0.82	31.59	0.18	2.45	64.02
2019-12-08	15.60	635.77	2373	84	14133	3.13	0.09	31.64	0.02	2.54	-57.79
2019-12-15	19.80	636.98	2422	84	14133	0.51	0.30	31.65	0.06	1.23	-163.82
2019-12-22	49.60	643.10	2471	84	14133	1.07	5.15	31.72	1.10	-0.46	-156.31
2019-12-29	0.80	727.19	2520	84	14133	12.73	10.91	32.57	2.31	6.84	-68.59

Dep. Variable:	y	No. Observations:	553
Model:	SARIMAX(1, 1, 1)	Log Likelihood	-2825.307
Date:	Sat, 08 May 2021	AIC	5656.615
Time:	13:02:08	BIC	5669.555
Sample:	0	HQIC	5661.671
	- 553
Covariance Type:	opg

Ljung-Box (L1) (Q):	0.00	Jarque-Bera (JB):	157713.33
Prob(Q):	0.95	Prob(JB):	0.00
Heteroskedasticity (H):	2.41	Skew:	-4.51
Prob(H) (two-sided):	0.00	Kurtosis:	85.31

	Flow_Rate_Lupa	Flow_Rate_Lupa (t-1)	Flow_Rate_Lupa (t-2)	Flow_Rate_Lupa (t-3)	Flow_Rate_Lupa (t-4)	Flow_Rate_Lupa (t-5)	Flow_Rate_Lupa (t-6)	Flow_Rate_Lupa (t-7)	Flow_Rate_Lupa (t-8)	Flow_Rate_Lupa (t-9)	Flow_Rate_Lupa (t-10)	Flow_Rate_Lupa (t-11)	Flow_Rate_Lupa (t-12)	Flow_Rate_Lupa (t-13)	Flow_Rate_Lupa (t-14)	Flow_Rate_Lupa (t-15)	Flow_Rate_Lupa (t-16)	Flow_Rate_Lupa (t-17)	Flow_Rate_Lupa (t-18)	Flow_Rate_Lupa (t-19)	Flow_Rate_Lupa (t-20)	Flow_Rate_Lupa (t-21)	Flow_Rate_Lupa (t-22)	Flow_Rate_Lupa (t-23)	Flow_Rate_Lupa (t-24)
Date
2009-06-01	4398.44	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
2009-07-01	3942.35	4398.44	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
2009-08-01	3365.59	3942.35	4398.44	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
2009-09-01	2788.74	3365.59	3942.35	4398.44	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
2009-10-01	2512.17	2788.74	3365.59	3942.35	4398.44	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
2019-08-01	3266.60	3867.76	3853.07	2916.13	2956.20	3431.76	3073.87	2234.34	2334.30	2321.38	2724.62	3219.38	4232.48	5391.37	6177.01	7186.71	6813.94	4425.24	2754.38	2795.29	1461.65	1012.44	1138.28	1196.26	1355.55
2019-09-01	2640.87	3266.60	3867.76	3853.07	2916.13	2956.20	3431.76	3073.87	2234.34	2334.30	2321.38	2724.62	3219.38	4232.48	5391.37	6177.01	7186.71	6813.94	4425.24	2754.38	2795.29	1461.65	1012.44	1138.28	1196.26
2019-10-01	2306.65	2640.87	3266.60	3867.76	3853.07	2916.13	2956.20	3431.76	3073.87	2234.34	2334.30	2321.38	2724.62	3219.38	4232.48	5391.37	6177.01	7186.71	6813.94	4425.24	2754.38	2795.29	1461.65	1012.44	1138.28
2019-11-01	2467.15	2306.65	2640.87	3266.60	3867.76	3853.07	2916.13	2956.20	3431.76	3073.87	2234.34	2334.30	2321.38	2724.62	3219.38	4232.48	5391.37	6177.01	7186.71	6813.94	4425.24	2754.38	2795.29	1461.65	1012.44
2019-12-01	2948.94	2467.15	2306.65	2640.87	3266.60	3867.76	3853.07	2916.13	2956.20	3431.76	3073.87	2234.34	2334.30	2321.38	2724.62	3219.38	4232.48	5391.37	6177.01	7186.71	6813.94	4425.24	2754.38	2795.29	1461.65

Spring	Elevation	Outflow (l/s)
Scheggino	300	200
Lupa	365	125
Pacce	475	80
Castellone	450-325	115

	Flow_Rate_Lupa
Date
2020-01-05	108.16
2020-01-12	108.89
2020-01-19	109.74
2020-01-26	110.59
2020-02-02	111.41
2020-02-09	110.11
2020-02-16	108.41
2020-02-23	106.55
2020-03-01	104.41
2020-03-08	104.22
2020-03-15	103.46
2020-03-22	102.51
2020-03-29	102.21
2020-04-05	101.39
2020-04-12	99.67
2020-04-19	97.84
2020-04-26	95.92
2020-05-03	94.20
2020-05-10	91.66
2020-05-17	89.02
2020-05-24	86.47
2020-05-31	83.85
2020-06-07	81.52
2020-06-14	79.03
2020-06-21	76.56
2020-06-28	74.25
2020-07-05	72.70

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	ET01	Infilt_	Infiltsum	Rainfall_Ter	Flow_Rate_Lup	Infilt_m3	Week
Date
2010-01-01	40.8	82.24	1	1	2010	1.34	1.93	1.93	4.12e+05	7105.54	143639.37	53
2010-01-02	6.8	88.90	2	1	2010	1.70	1.57	3.51	4.12e+05	7680.96	130966.87	53
2010-01-04	4.2	96.63	4	1	2010	1.00	2.28	8.12	4.12e+05	8348.83	155554.40	1
2010-01-05	26.0	98.65	5	1	2010	1.28	1.99	10.11	4.12e+05	8523.36	145736.74	1
2010-01-06	18.0	102.15	6	1	2010	1.21	2.06	12.17	4.12e+05	8825.76	148019.01	1
...	...	...	...	...	...	...	...	...	...	...	...	...
2020-06-15	4.8	77.43	167	6	2020	3.00	1.80	-519.74	6.05e+05	6689.95	174476.48	25
2020-06-16	0.6	77.14	168	6	2020	3.00	-2.40	-522.14	7.56e+04	6664.90	-69920.07	25
2020-06-17	10.0	76.89	169	6	2020	3.07	6.93	-515.21	1.26e+06	6643.30	474524.27	25
2020-06-18	2.8	76.42	170	6	2020	3.31	-0.51	-515.72	3.53e+05	6602.69	47188.25	25
2020-06-19	0.2	76.39	171	6	2020	3.46	-3.26	-518.99	2.52e+04	6600.10	-109228.97	25

	Monthly rainfall	Flow_Rate_Lupa	flowrate_6	Volume	RainVolume	Volume_6
2009-09-01	107.10	NaN	NaN	NaN	4.28e+05	NaN
2009-10-01	186.52	NaN	894.0	NaN	7.46e+05	2.35e+06
2009-11-01	308.23	72.00	894.0	189343.01	1.23e+06	2.35e+06
2009-12-01	472.78	71.00	894.0	186713.24	1.89e+06	2.35e+06
2010-01-01	574.24	110.00	894.0	289274.04	2.30e+06	2.35e+06
...	...	...	...	...	...	...
2020-02-01	560.64	105.24	NaN	276758.99	2.24e+06	NaN
2020-03-01	615.64	103.01	NaN	270902.51	2.46e+06	NaN
2020-04-01	667.84	97.95	NaN	257595.03	2.67e+06	NaN
2020-05-01	783.04	88.32	NaN	232263.30	3.13e+06	NaN
2020-06-01	851.24	77.50	NaN	203804.96	3.40e+06	NaN

	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)
6	39.56	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
7	29.45	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN

	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)	(Rainfall_Terni, sum)
136	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	115.2	NaN
137	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	68.2	NaN

	2009	2010	2011	2012	2013	2014	2015	2016	2017	2018	2019	2020
118	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	402.46	NaN	NaN
119	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	459.01	NaN	NaN
120	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	528.51	NaN	NaN
121	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	579.53	NaN	NaN
122	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	611.80	NaN	NaN
123	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	690.13	NaN	NaN
124	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	799.99	NaN	NaN
125	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	823.33	NaN	NaN
126	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	74.46	NaN
127	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	97.19	NaN
128	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	187.19	NaN
129	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	231.70	NaN
130	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	504.22	NaN
131	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	584.82	NaN
132	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	605.22	NaN
133	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	657.82	NaN
134	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	712.82	NaN
135	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	765.02	NaN
136	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	880.22	NaN
137	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	948.42	NaN

	Rainfall_Terni
Date
2009-01-01	86.71
2009-02-01	77.36
2009-03-01	64.36
2009-04-01	83.70
2009-05-01	35.31
...	...
2020-02-01	52.60
2020-03-01	55.00
2020-04-01	52.20
2020-05-01	115.20
2020-06-01	68.20

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	diff	pct_ch	Flow_log	Flow_log_pct_ch	Diff
Date
2020-06-26	0.0	73.15	178	6	2020	-0.19	-0.27	4.31	-0.06	-0.19
2020-06-27	0.0	72.96	179	6	2020	-0.19	-0.27	4.30	-0.06	-0.19
2020-06-28	0.0	72.76	180	6	2020	-0.19	-0.27	4.30	-0.06	-0.19
2020-06-29	0.0	72.57	181	6	2020	-0.19	-0.27	4.30	-0.06	-0.19
2020-06-30	0.0	72.37	182	6	2020	-0.19	-0.27	4.30	-0.06	-0.19

	Rainfall
2014-01-01	1.0
2014-01-02	1.2
2014-01-03	0.6
2014-01-04	16.0
2014-01-05	0.2

	Rainfall_Anca
2019-01-01	0.0
2019-01-02	0.0
2019-01-03	0.0
2019-01-04	0.0
2019-01-05	0.0

	Rainfall
2014-05-31	0.0
2014-06-01	0.0
2014-06-02	0.0
2014-06-03	0.0
2014-06-04	0.0
...	...
2020-05-27	0.0
2020-05-28	0.0
2020-05-29	11.4
2020-05-30	1.2
2020-05-31	0.2

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year
Date
2009-01-01	14.4	135.47	1.0	1.0	2009.0
2009-01-02	0.2	135.24	2.0	1.0	2009.0
2009-01-03	0.2	135.17	3.0	1.0	2009.0
2009-01-04	0.0	134.87	4.0	1.0	2009.0
2009-01-05	0.0	134.80	5.0	1.0	2009.0
...	...	...	...	...	...
2022-05-21	NaN	64.89	NaN	NaN	NaN
2022-05-22	NaN	65.22	NaN	NaN	NaN
2022-05-23	NaN	65.03	NaN	NaN	NaN
2022-05-24	NaN	64.62	NaN	NaN	NaN
2022-05-25	NaN	64.50	NaN	NaN	NaN

	Rainfall_Terni_x	Flow_Rate_Lupa_x	doy_x	Month_x	Year_x	Rainfall_Terni_y	Flow_Rate_Lupa_y	doy_y	Month_y	Year_y
Date_excel
2010-01-01	40.8	82.24	1	1	2010	NaN	NaN	NaN	NaN	NaN
2010-01-02	6.8	88.90	2	1	2010	NaN	NaN	NaN	NaN	NaN
2010-01-03	0.0	93.56	3	1	2010	NaN	NaN	NaN	NaN	NaN
2010-01-04	4.2	96.63	4	1	2010	NaN	NaN	NaN	NaN	NaN
2010-01-05	26.0	98.65	5	1	2010	NaN	NaN	NaN	NaN	NaN
...	...	...	...	...	...	...	...	...	...	...
2020-06-25	0.0	74.29	177	6	2020	NaN	NaN	NaN	NaN	NaN
2020-06-26	0.0	73.93	178	6	2020	NaN	NaN	NaN	NaN	NaN
2020-06-27	0.0	73.60	179	6	2020	NaN	NaN	NaN	NaN	NaN
2020-06-28	0.0	73.14	180	6	2020	NaN	NaN	NaN	NaN	NaN
2020-06-29	0.0	72.88	181	6	2020	NaN	NaN	NaN	NaN	NaN

Table of Contents

Italian waterbodies data: water source Lupa¶

Data set description¶

Aquifer¶

Auser¶

Petrignano Aquifer¶

Doganella Aquifer¶

Luco Aquifer¶

Water spring¶

Amiata¶

Madonna di Canneto¶

Lupa¶

River Arno¶

Lake Bilancino¶

Introduction¶

Methodology¶

Water spring Lupa¶

References¶

Evolution of the outflow over the last 11 years¶

taking the differences¶

logarithms of the flow rate.¶

pmdArima: introduction¶

2020¶

Rainfall¶

Annual Water Budget Ratio (AWBR)¶

Arrone rainfall addition¶

Ancaiano daily pluviometry¶

Ancaiano pluviometry 2009 and 2020-2022¶

SPI¶

SPI 12 calculation via module standard-precip¶

SPEI¶

Soil moisture condition¶

TR-55: 'Urban Hydrology for Small Watersheds'¶

SCS CN- or curve number method¶

Runoff Equation¶

SCS Hydrologic Soil Groups: Soil textures¶

Runoff curve numbers for cultivated/other agricultural lands and soil types¶

Curve-number map for Umbria Region¶

Calculate runoff depth¶

The determination of CN values: by formula method, or by conversion table method¶

Infiltration coefficients method¶

Calculate the infiltrate amount i.f.o mm/day rainfall¶

Temperature¶

Heat index per year¶

T2M_MAX, T2M_MIN, relative humidity¶

AET/PET - Drought Index¶

Flow rate original data¶

Rolling sums monthly¶

The variability index (Meinzer, 1927)¶

First attempt to gather better flow rate data...¶

resample flow rate 2010-2019 monthly¶

The 2012 series¶

Maximum 1999-2011, Minimum 1999-2011, Mean 1999-2011¶

Mean of period 2010-2020 vs. the Mean of 1999-2011¶

Set list of distributions to test¶

Conversion of units etc...¶

Yearly and monthly aggregates¶

statsmodels api¶

Netto infiltration - outflow¶

statsmodels SARIMAX¶

pmdArima examples¶

Fitting an auto_arima model¶

Displaying key timeseries statistics¶

Array differencing¶

Modeling quasi-seasonal trends with date features¶

Rolling sums in m³¶

Rainfall rolling sums¶

Cumulative sums for the rainfall and outflow in cubic meters.¶

Water balance m³ rainfall - water spring outflow¶

Cross-correlation and Auto-correlation¶

Cross-correlation daily and weekly data¶

Cross-correlation daily and weekly data¶

Recession coefficient or coefficient of depletion¶

Plots of absolute vs. mean values of Q / t¶

Outflow of 2017¶

Calculating evapotranspiration method 1¶

Calculating evapotranspiration method 2¶

Evapotranspiration data for water spring Peschiera¶

Solar Radiation¶

Hargreaves method¶

Solar Radiation ¶

	Infiltrate	Flow_Rate_Lupa	Rainfall_Terni
Date_excel
2010-01-01	2.81	136.20	6.06
2010-02-01	3.42	181.53	6.09
2010-03-01	1.78	234.50	3.40
2010-04-01	1.40	235.53	3.69
2010-05-01	2.97	239.19	7.25
...	...	...	...
2020-02-01	0.66	107.80	1.32
2020-03-01	1.28	103.03	2.30
2020-04-01	0.98	97.95	1.73
2020-05-01	0.52	88.32	1.86
2020-06-01	0.95	77.67	2.35

	AMC class	moisture	Dormant season	Growing season
0	AMC I	dry	P<12.7	P<35.6
1	AMC II	medium	12.7<P<27.9	35.6<P<53.3
2	AMC III	wet	P>27.9	P>53.3

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Week	Dormant
Date
2014-04-15	2.14	92.86	105	4	16	1
2014-04-16	2.14	92.87	106	4	16	1
2014-04-17	2.14	92.88	107	4	16	1
2014-04-18	2.14	92.89	108	4	16	1
2014-04-19	2.14	92.90	109	4	16	1
2014-04-20	2.14	92.91	110	4	16	1
2014-04-21	2.14	92.92	111	4	17	0

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Week	Dormant
Date
2015-04-14	1.95	96.50	104	4	16	1
2015-04-15	1.95	96.51	105	4	16	1
2015-04-16	1.95	96.52	106	4	16	1
2015-04-17	1.95	96.53	107	4	16	1
2015-04-18	1.95	96.54	108	4	16	1
2015-04-19	1.95	96.57	109	4	16	1
2015-04-20	1.95	96.58	110	4	17	0
2015-04-21	1.95	96.59	111	4	17	0

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Week	Dormant
Date
2010-11-03	7.96	80.25	307	11	44	1
2010-11-04	7.96	80.26	308	11	44	1
2010-11-05	7.96	80.27	309	11	44	1
2010-11-06	7.96	80.28	310	11	44	1
2010-11-07	7.96	80.29	311	11	44	1
...	...	...	...	...	...	...
2019-12-02	2.60	113.45	336	12	49	1
2020-03-02	18.80	103.27	62	3	10	1
2020-03-03	8.80	104.06	63	3	10	1
2020-03-05	0.20	104.57	65	3	10	1
2020-03-06	8.60	104.56	66	3	10	1

	Cover type	Treatment	Hydrologic condition	A	B	C	D
0	Fallow	Bare soil	—	77	86	91	94
1	Woods	-	Poor	45	66	77	83
2	Woods	-	Fair	36	60	73	79
3	Woods	-	Good	30	55	70	77

	Rainfall_Terni	Flow_Rate_Lupa	doy	Month	Year	ET01	Infilt_	Infiltsum	Rainfall_Ter	Flow_Rate_Lup	Infilt_m3	Week	Date_excel	log_Flow	Lupa_Mean99_2011	Rainfall_Terni_minET	Infiltrate
Date
2017-10-01	0.0	38.02	274.0	10.0	2017.0	2.53	-2.53e+00	-824.71	0.00e+00	3284.93	-8.83e+04	39.0	2017-10-01	8.10	85.80	0.00	0.00
2017-10-02	0.0	37.91	275.0	10.0	2017.0	2.89	-2.89e+00	-827.59	0.00e+00	3275.42	-1.01e+05	40.0	2017-10-02	8.09	84.69	0.00	0.00
2017-10-03	0.0	37.81	276.0	10.0	2017.0	3.32	-3.32e+00	-830.92	0.00e+00	3266.78	-1.16e+05	40.0	2017-10-03	8.09	85.28	0.00	0.00
2017-10-04	0.0	37.69	277.0	10.0	2017.0	3.46	-3.46e+00	-834.38	0.00e+00	3256.42	-1.21e+05	40.0	2017-10-04	8.09	85.25	0.00	0.00
2017-10-05	0.0	37.59	278.0	10.0	2017.0	3.22	-3.22e+00	-837.60	0.00e+00	3247.78	-1.12e+05	40.0	2017-10-05	8.09	85.21	0.00	0.00
2017-10-06	18.2	37.55	279.0	10.0	2017.0	3.50	1.47e+01	-822.90	2.29e+06	3244.32	9.36e+05	40.0	2017-10-06	8.08	85.32	14.70	9.87
2017-10-07	7.0	37.47	280.0	10.0	2017.0	2.06	4.94e+00	-817.96	8.82e+05	3237.41	3.35e+05	40.0	2017-10-07	8.08	83.46	4.94	4.31
2017-10-08	4.4	37.42	281.0	10.0	2017.0	2.62	1.78e+00	-816.17	5.54e+05	3233.09	1.65e+05	40.0	2017-10-08	8.08	85.18	1.78	1.67
2017-10-09	0.2	37.34	282.0	10.0	2017.0	3.05	-2.85e+00	-819.02	2.52e+04	3226.18	-9.48e+04	41.0	2017-10-09	8.08	84.95	0.00	0.00
2017-10-10	3.4	37.25	283.0	10.0	2017.0	2.50	8.98e-01	-818.13	4.28e+05	3218.40	1.10e+05	41.0	2017-10-10	8.08	84.73	0.90	0.86
2017-10-11	1.8	37.16	284.0	10.0	2017.0	3.06	-1.26e+00	-819.39	2.27e+05	3210.62	-2.18e+03	41.0	2017-10-11	8.07	82.76	0.00	0.00
2017-10-12	0.0	37.09	285.0	10.0	2017.0	2.87	-2.87e+00	-822.26	0.00e+00	3204.58	-1.00e+05	41.0	2017-10-12	8.07	84.06	0.00	0.00
2017-10-13	3.0	37.05	286.0	10.0	2017.0	3.00	2.24e-03	-822.26	3.78e+05	3201.12	6.99e+04	41.0	2017-10-13	8.07	83.37	0.00	0.00
2017-10-14	21.2	36.91	287.0	10.0	2017.0	3.18	1.80e+01	-804.24	2.67e+06	3189.02	1.12e+06	41.0	2017-10-14	8.07	82.23	18.02	10.86
2017-10-15	9.0	36.80	288.0	10.0	2017.0	3.20	5.80e+00	-798.44	1.13e+06	3179.52	4.12e+05	41.0	2017-10-15	8.06	82.67	5.80	4.96
2017-10-16	0.0	36.73	289.0	10.0	2017.0	3.16	-3.16e+00	-801.60	0.00e+00	3173.47	-1.10e+05	42.0	2017-10-16	8.06	81.87	0.00	0.00
2017-10-17	0.0	36.68	290.0	10.0	2017.0	2.88	-2.88e+00	-804.49	0.00e+00	3169.15	-1.01e+05	42.0	2017-10-17	8.06	82.08	0.00	0.00
2017-10-18	0.8	36.65	291.0	10.0	2017.0	2.91	-2.11e+00	-806.59	1.01e+05	3166.56	-5.50e+04	42.0	2017-10-18	8.06	81.59	0.00	0.00
2017-10-19	0.0	36.63	292.0	10.0	2017.0	2.93	-2.93e+00	-809.53	0.00e+00	3164.83	-1.02e+05	42.0	2017-10-19	8.06	81.40	0.00	0.00
2017-10-20	0.0	36.47	293.0	10.0	2017.0	2.74	-2.74e+00	-812.26	0.00e+00	3151.01	-9.55e+04	42.0	2017-10-20	8.06	81.21	0.00	0.00
2017-10-21	10.6	36.34	294.0	10.0	2017.0	3.06	7.54e+00	-804.73	1.34e+06	3139.78	5.10e+05	42.0	2017-10-21	8.05	80.99	7.54	6.17
2017-10-22	0.2	36.17	295.0	10.0	2017.0	2.29	-2.09e+00	-806.81	2.52e+04	3125.09	-6.82e+04	42.0	2017-10-22	8.05	80.90	0.00	0.00
2017-10-23	0.0	36.25	296.0	10.0	2017.0	1.94	-1.94e+00	-808.76	0.00e+00	3132.00	-6.79e+04	43.0	2017-10-23	8.05	80.69	0.00	0.00
2017-10-24	0.0	36.20	297.0	10.0	2017.0	2.14	-2.14e+00	-810.90	0.00e+00	3127.68	-7.47e+04	43.0	2017-10-24	8.05	80.48	0.00	0.00
2017-10-25	0.0	35.89	298.0	10.0	2017.0	2.22	-2.22e+00	-813.12	0.00e+00	3100.90	-7.76e+04	43.0	2017-10-25	8.04	79.96	0.00	0.00
2017-10-26	6.4	35.78	299.0	10.0	2017.0	3.17	3.23e+00	-809.89	8.06e+05	3091.39	2.62e+05	43.0	2017-10-26	8.04	79.73	3.23	2.93
2017-10-27	24.8	35.70	300.0	10.0	2017.0	2.62	2.22e+01	-787.71	3.12e+06	3084.48	1.35e+06	43.0	2017-10-27	8.03	79.51	22.18	11.47
2017-10-28	0.0	35.62	301.0	10.0	2017.0	2.18	-2.18e+00	-789.89	0.00e+00	3077.57	-7.59e+04	43.0	2017-10-28	8.03	79.76	0.00	0.00
2017-10-29	0.0	35.53	302.0	10.0	2017.0	2.54	-2.54e+00	-792.43	0.00e+00	3069.79	-8.85e+04	43.0	2017-10-29	8.03	79.12	0.00	0.00
2017-10-30	0.0	35.35	303.0	10.0	2017.0	2.20	-2.20e+00	-794.62	0.00e+00	3054.24	-7.66e+04	44.0	2017-10-30	8.02	78.53	0.00	0.00

	Infiltrate	Flow_Rate_Lupa	Flow_Rate_Lup	Flow_shift1	Flow_m3_shift1	Flow_shift3	Flow_shift2
Date
2009-07-01	394.47	38569.66	3.33e+06	40955.66	4.10e+04	40955.66	40955.66
2010-07-01	467.01	63232.60	5.46e+06	38569.66	3.33e+06	40955.66	40955.66
2011-07-01	371.19	24915.43	2.15e+06	63232.60	5.46e+06	40955.66	38569.66
2012-07-01	675.78	46107.22	3.98e+06	24915.43	2.15e+06	38569.66	63232.60
2013-07-01	548.24	60580.10	5.23e+06	46107.22	3.98e+06	63232.60	24915.43
2014-07-01	322.42	42235.07	3.65e+06	60580.10	5.23e+06	24915.43	46107.22
2015-07-01	350.05	33402.58	2.89e+06	42235.07	3.65e+06	46107.22	60580.10
2016-07-01	415.18	29680.90	2.56e+06	33402.58	2.89e+06	60580.10	42235.07
2017-07-01	473.94	37878.33	3.27e+06	29680.90	2.56e+06	42235.07	33402.58
2018-07-01	447.66	38688.90	3.34e+06	37878.33	3.27e+06	33402.58	29680.90
2019-07-01	372.62	35221.52	3.04e+06	38688.90	3.34e+06	29680.90	37878.33

	LAT	LON	YEAR	DOY	T2M_MAX	T2M_MIN	T2M	ALLSKY_SFC_LW_DWN	RH2M	PRECTOT
Date
2010-01-01	42.59	12.77	2010	1	9.50	5.56	7.57	28.31	95.38	20.00
2010-01-02	42.59	12.77	2010	2	10.08	0.29	5.16	25.23	89.86	2.02
2010-01-03	42.59	12.77	2010	3	3.86	-2.43	0.12	22.25	81.25	0.58
2010-01-04	42.59	12.77	2010	4	3.45	-0.69	1.38	27.21	93.79	2.18
2010-01-05	42.59	12.77	2010	5	7.34	3.00	5.23	28.38	98.94	26.46

	PETmm	SoilStorage	SoilWaterDeficit	RO_mm	AET
2010-01-01	5.31	200	0	96.08	5.31
2010-02-01	9.85	200	0	81.51	9.85
2010-03-01	21.9	200	0	22.82	21.9
2010-04-01	45.62	200	0	17.17	45.62
2010-05-01	71.05	200	0	26.21	71.05
...	...	...	...	...	...
2021-01-01	26.02	200	0	160.93	26.02
2021-02-01	44.25	200	0	25.15	44.25
2021-03-01	54.74	190.26	0	0	54.74
2021-04-01	74.84	191.21	0	0	74.84
2021-05-01	113.6	125.51	0	0	113.6