For this project me and three colleagues of mine tried to forecast the price of energy in Spain using the KNIME platform, training four different models, Random Forest, XGBoost, ARIMA and LSTM on hourly energy production data of different sources from 2015 to 2018, and then we compared their performances.
- Check out the report
From the dataset all the columns containing only zeros and missing values have been removed.
The dataset has been divided in training set, which was composed by the 67% of the whole records, and the test set, composed by the remaining percentage. Afterwards a 10-folds cross validation has been performed.
The best performing model was XGBoost, and its performaces on the test set were:
XGBOOST | |
---|---|
R2 | 0.81 |
MAE | 4.50 |
RMSE | 6.23 |
MAPE | 0.1 |
ARIMA | LSTM | |
---|---|---|
MAE | 10.59 | 3.99 |
RMSE | 12.79 | 4.74 |
MAPE | 0.18 | 0.06 |
Credits to: Greta Gravina, Alessio Pasinato, Niccolò Rocchi, Marco Scatassi.