• No results found

Selection of Input Variables for a Thermal Load Prediction Model

N/A
N/A
Protected

Academic year: 2021

Share "Selection of Input Variables for a Thermal Load Prediction Model"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

1876-6102 © 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Peer-review under responsibility of the CENTRO CONGRESSI INTERNAZIONALE SRL doi: 10.1016/j.egypro.2015.11.704

Energy Procedia 78 ( 2015 ) 3001 – 3006

ScienceDirect

6th International Building Physics Conference, IBPC 2015

Selection of Input Variables for a Thermal Load Prediction Model

Dimitrios-Stavros Kapetanakis

a

*, Eleni Mangina

b

, El Hassan Ridouane

c

, Konstantinos

Kouramas

c

, Donal Finn

a

aSchool of Mechanical and Materials Engineering, University College Dublin (UCD) b

School of Computer Science and Informatics, University College Dublin (UCD)

c

United Technologies Research Center (UTRC), Cork, Ireland

Abstract

Forecasting of building thermal loads, without the use of simulation software, can be achieved using data from Building Energy Management Systems (BEMS). Experience in building load prediction using historical data has shown that data analysis is a key factor in order to produce accurate results. This paper examines issues related to the selection of appropriate input variables from wider datasets obtained from BEMS sensors. These variables will be introduced to a new data-driven model, which estimates building space loads. Results indicate that ambient temperature and relative humidity along with solar radiation are the predominant variables that should be considered as input variables to the predictive model.

© 2015 The Authors. Published by Elsevier Ltd.

Peer-review under responsibility of the CENTRO CONGRESSI INTERNAZIONALE SRL. Keywords: commercial building; thermal loads; input selection; predictive model

1. Introduction

The building sector consumes 35% of global final energy use and is responsible for about 17% of total direct energy-related CO2 emissions from final energy consumers. Space heating, water heating and space cooling account

for nearly 55% of global building energy use and represent the largest opportunity to reduce buildings energy consumption [1]. The use of efficient energy management systems in buildings is expected to generate a maximum saving of 8% of the energy consumption in the EU [2]. In order to decrease the energy usage and increase

* Corresponding author. Tel.: +353-1-7161726; fax: +353-1-283-0921.

E-mail address: [email protected]

Available online at www.sciencedirect.com

© 2015 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Peer-review under responsibility of the CENTRO CONGRESSI INTERNAZIONALE SRL

(2)

compliance with the European Directives on the energy performance of buildings [3], it is of fundamental importance to control efficiently the existing Heating Ventilation and Air Conditioning (HVAC) systems. Predicting the load of HVAC systems is important for energy management especially during peak energy demand hours [4].

Predictions of building thermal load can be estimated using appropriate simulation software [5] when detailed data such as building geometry, occupancy as well as environmental variables are available. In reality, such data are often unknown, especially for older buildings, where uncertainty arising from parameter and occupancy estimation can lead to significant additional modelling efforts [6]. An alternative way to forecast these loads is to take advantage of BEMS recorded data. These data records include underlying information regarding building thermal response and can be introduced to data-mining models, which utilise extensive assessment of input and output variables, in order to produce accurate predictions [7]. In the context of developing data driven models capable of forecasting thermal loads of commercial buildings, pre-analysis of BEMS data, is a prerequisite.

The contribution of the current paper is to address the task of identifying and prioritising input variables for data-driven predictive models. To date, input variables are identified by model developers based on a combination of domain expertise and performance heuristics. A robust and repeatable approach is described based on a statistical analysis of historic performance data, which for commercial buildings is typically available from BEMS. The proposed methodology is intended to be broadly applicable, such that it can be applied to any commercial building for any global climate. This pre-selection process is critical for the development of predictive models, since the use of redundant input variables introduces unnecessary additional complexity during the development and execution of the models. A commercial building, located in Cork, Ireland is utilised as a testbed building for the development stage of the predictive model. Data are extracted on a time step basis in order to analyse them for the investigation of possible correlations between variables. Incomplete datasets are often acquired when extracting the data. In order to preclude missing data, a detailed representation of the testbed building is implemented using EnergyPlus [8].

2. Background

The most common methods used in the literature to achieve forecasting of building thermal loads are Regression, Artificial Neural Network (ANN) and Support Vector Machine (SVM).

Aranda et al. [9] used linear regression models to predict the annual energy consumption in the Spanish banking sector. The energy consumption of a single building was predicted as a function of its construction characteristics, climatic area and energy performance. Catalina et al. [10] worked on the development of regression models to predict monthly heating demand for single-family residential sector. The inputs for the regression models were building shape factor, envelope U-value, window to floor area ratio, building time constant and climate. An update to the aforementioned work [11], in an attempt to simplify the model to obtain fast predictions, used as inputs the building global heat loss, south equivalent surface and difference between indoor and ambient temperature.

ANNs have been applied to analyse various types of building energy consumption, as well as heating loads. Kalogirou et al. [12,13] implemented ANN at an early design stage to predict the required heating load of buildings. Input data included the areas and types of windows, walls and floors, roof classification and room temperature. Gonzalez and Zamarreno [14] used an ANN approach to predict the hourly energy consumption in buildings. The inputs of the network were current and forecasted values of temperature, the current load, the hour and the day.

SVM models have been used more recently for predicting energy consumption in buildings. Dong et al. [15] were the first to introduce the use of SVM for the prediction of the energy consumption of four commercial buildings. The input variables were mean outdoor dry-bulb temperature, relative humidity and global solar radiation. Li et al. [16] used a SVM model in regression to predict hourly building cooling load for an office building. The outdoor dry bulb temperature and solar radiation intensity were the input parameters for this model. A comparison of the developed SVM model against a back propagation neural network, a radial basis function neural network and a general regression neural network was published by Li et al. [17]. Simulation results indicated that the SVM and general regression neural network methods achieved better accuracy.

The research field related to building thermal loads forecasting has been very active, involving various scientific domains. Nevertheless, it is clear from the literature that little attention has been given to the justification of the selection of the input variables to their predictive models. Hence, the investigation of which variables and why should be considered as inputs to such a model, should be a priority towards the development stage of the model.

(3)

3. Methodology

The process of data analysis to detect interrelationship between variables within the current research consists of three stages: (1) the examination of the distribution of each variable (e.g. adhering to a Gaussian distribution), (2) investigation of linear correlation by calculating Pearson correlation coefficient, and (3) investigation of monotonic correlation by calculating Spearman correlation coefficient.

The initial task of the data analysis process is the selection of BEMS variables to be assessed. All of the variables are selected based on the sensors already installed at the testbed building. These variables are divided into two categories, input and output variables. Inputs are the ones introduced to the predictive model and outputs are the thermal loads, which will be forecasted from the model. Selection of input variables is validated with the implementation of widely known statistical techniques, which reveals the existence and importance of correlation with the output variables. Data analysis is based on simulated data obtained by an EnergyPlus model of the testbed building, due to the fact that available measured data was for a period of less than one year. Furthermore, the use of simulated data precludes the possibility of dealing with incomplete data sets that are often acquired when extracting recorded data in BMS systems. The EnergyPlus model provides one year simulated data at 15-minute intervals. The examination of the correlation between input and output variables was carried out with the use of two different weather data files from Cork, Ireland and London, United Kingdom, available from EnergyPlys [2]. The Cork data includes two years of data, 2011 and 2012, whereas London data is based on nine years data from 1983 until 1991.

3.1. Testbed Building Description

The NIMBUS building [18] is a two storey quadrangle-type office building located in Cork, Ireland. It has a low pitch roof varying in height from 7.7 to 8.7 m and the ratio of transparent to opaque envelope is approximately 34 %. A view of the building and the EnergyPlus model are shown in Figure 1. The heating system of the building incorporates a 50 kW electrical / 82 kW thermal combined heat and power (CHP) unit, two gas boilers of 175 kW, a 2000 litre water calorifier. Radiators are used to distribute the heat. An extensive network of meters and sensors has been deployed to facilitate measurement and necessary data collection for the control and monitoring of electrical and thermal systems. These measurements together with relevant information about gas and electricity power consumption measurements and prices, as well as thermal/electrical loads and weather forecast, are available from the Supervisory Control and Data Acquisition System and the BEMS.

Figure 1. (a) NIMBUS building; (b) EnergyPlus model of NIMBUS building.

3.2. Data Analysis

The procedure followed for analysing the data and examining the existence of a correlation between input and output variables was based on statistical techniques. Initially the selected input and output variables were tested to check if they follow a Gaussian distribution. This was achieved with the use of statistical tests [19]. Next, the existence of a linear correlation between input and output variables was investigated by performing a Pearson correlation [20], which measures the linearity between paired data. In a sample of data, it is denoted by rP and is

constrained between -1 and 1, where positive values denote positive linear correlation, while negative values denote negative linear correlation and a value of zero denotes no linear correlation. The correlation coefficient does not relate to the gradient beyond sharing its positive or negative sign. The strength of correlation can be verbally described using the following guide for the absolute value of rP where: 0.00–0.19 represents “very weak”, 0.20–0.39

represents “weak”, 0.40–0.59 represents “moderate”, 0.60–0.79 represents “strong” and 0.80–1.00 represents “very strong” correlation.

(4)

Furthermore, the existence of monotonic relationships can be identified as well through the Spearman correlation coefficient [20]. The Spearman correlation coefficient is a statistical measure of the strength of a monotonic relationship between paired data and is denoted by rS. The principles of Spearman correlation are the same as the

Pearson correlation coefficient.

The statistical analysis was carried outing using of the IBM SPSS Statistics 20 software [21]. Input variables were grouped according to weather data and indoor variables, as given in Table 1. Heating load of the testbed building is the output variable and as an extra output, the variable of gas consumption is examined regarding its correlation to heating load.

Table 1. Input variables under examination.

Weather Data Indoor Data Ambient Temperature Zone Air Temperature Ambient Relative Humidity Zone Relative Humidity Wind Speed

Solar Radiation Sky Clearness

Zone Occupancy Zone CO2 level

3.3. Selection of Input Variables

The selection process of the input variables that will be introduced to the predictive model is based on the calculated Pearson and Spearman correlations between input and output variables. Absolute values of the calculated Pearson and Spearman coefficients are used to simplify this process. Only “moderate”, “strong” and “very strong” relationships are of interest, hence the threshold value for introducing an input variable to the predictive model is 0.5. It is important to select carefully the input variables in order to avoid unnecessary increases in the complexity during the development of the model.

4. Results and Discussion

The first task of the data analysis was the examination of the hypothesis that individual parameters follow a Gaussian (normal) distribution. The results are illustrated in Table 2, for the Cork and London weather data. It can be seen that certain variables are normally distributed, including Ambient Temperature, Zone Air Temperature and Zone Relative Humidity. In addition, the results reveal that the distribution is not influenced by the different weather data. The only variable that is not consistent is Wind Speed.

Table 2. Normality Tests results for input and output variables.

Variable Normally distributed, Cork W.D. Normally distributed, London W.D. Ambient Temperature YES YES

Ambient Relative Humidity NO NO

Wind Speed YES NO

Solar Radiation NO NO

Sky Clearness NO NO

Zone Air Temperature YES YES Zone Relative Humidity YES YES Zone CO2 level NO NO

Zone Occupancy NO NO

Heating Load NO NO

(5)

The next step was the examination of linear correlation using the Pearson correlation analysis and the results are shown in Table 3. Both normally and non-normally distributed variables were included in the examination process in order to apprehend the linear relationship between all variables.

Table 3. Absolute values of Pearson correlation coefficient results.

Variable 1 Variable 2 Cork W.D. Cork W.D. London W.D. London W.D.

rP Qualitatively rP Qualitatively

Solar Radiation Sky Clearness 0.920 very strong 0.911 very strong Zone CO2 level Zone Occupancy 0.838 – 0.870 very strong 0.790 – 0.835 very strong

Zone Air Temperature Zone Air Temperature Zone Air Temperature Zone Relative Humidity Heating Load Heating Load Gas Consumption Ambient Temperature Sky Clearness Solar Radiation

Ambient Relative Humidity Gas Consumption Ambient Temperature Zone Air Temperature

0.600 – 0.770 0.549 – 0.724 0.515 – 0.637 0.572 – 0.721 0.628 0.753 0.675 – 0.892 strong moderate to strong moderate to strong moderate to strong strong strong strong 0.839 – 0.914 0.531 – 0.644 0.579 – 0.586 0.564 – 0.737 0.365 0.369 0.311 – 0.371 strong moderate to strong moderate to strong moderate to strong weak weak weak

The Pearson coefficient indicates a correlation between well-established variable pairs, including Solar Radiation with Sky Clearness and Zone Occupancy with Zone CO2 levels. The correlation of Heating Load and Gas

Consumption with other variables is “strong” when Cork weather data is used but is “weak” when London weather data is applied. The lack of consistency when using different weather data implies that the correlation may be monotonic instead of linear. To clarify if this case is valid, a Spearman analysis is performed. The outcome of the examination for monotonic relationships between data is displayed in Table 4.

Table 4. Absolute values of Spearman correlation coefficient results.

Variable 1 Variable 2 Cork W.D. Cork W.D. London W.D. London W.D.

rS Qualitatively rS Qualitatively

Solar Radiation Sky Clearness 0.885 very strong 0.916 very strong Zone CO2 level Zone Occupancy 0.678 – 0.751 strong 0.678 – 0.752 Strong

Zone Air Temperature Zone Air Temperature Zone Air Temperature Zone Relative Humidity Heating Load Heating Load Gas Consumption Ambient Temperature Sky Clearness Solar Radiation

Ambient Relative Humidity Gas Consumption Ambient Temperature Zone Air Temperature

0.570 – 0.764 0.572 – 0.769 0.455 – 0.690 0.601 – 0.742 0.864 0.820 0.660 – 0.880 moderate to strong moderate to strong moderate to strong strong very strong very strong strong 0.802 – 0.915 0.512 – 0.681 0.540 – 0.661 0.522 – 0.742 0.835 0.790 0.311 – 0.371 very strong moderate to strong moderate to strong moderate to strong very strong strong weak

Based on the Spearman analysis, the existence of correlation between variables that had been discovered with Pearson coefficient is verified. Table 4 reveals that the correlation between Heating Load and Gas Consumption can be better described as a monotonic one, since results yield a “moderate” to “strong” correlation. Another interesting finding from the Spearman analysis is the correlation between Heating Load and Ambient Temperature, which is consistent. The relationship between Gas Consumption and Zone Air Temperature is not consistent when changing the location, but the fact that there is a “strong” to “very strong” correlation for one of them cannot be neglected.

Through this analysis, two pairs of input variables that include double-information were identified. Sky Clearness and Solar Radiation is the first one. These two variables have a “very-strong” correlation, but this was expected since when the sky is clear, higher solar radiation reaches the building. Heating Load and Gas Consumption is the second pair of variables. Initial results from the Pearson correlation coefficient indicated that using different weather data can interfere with their linear correlation. A deeper analysis of the Spearman correlation coefficient revealed nevertheless that they have a “very-strong” monotonic correlation.

(6)

5. Conclusions

It is clearly shown that the variables that affect Heating Load are Ambient Temperature, Ambient Relative Humidity and Solar Radiation. Moreover, the relationship between Zone Air Temperature and Heating Load requires deeper examination, since this might be a critical variable for the predictive model. To summarise, the input variables under consideration for the predictive model are Ambient Temperature, Ambient Relative Humidity and Solar Radiation, while Zone Air Temperature will be further assessed. Future research plans include the application of the methodology to different types of commercial buildings in various climates in order to investigate the existence of a pattern related to the selection of input variables.

Acknowledgements

This research has been conducted with financial support from the Irish Research Council and United Technologies Research Centre (UTRC). The research was also financially assisted by the Electricity Research Centre, University College Dublin, which is supported by the Commission for Energy Regulation, Bord Gis Energy, Bord na Mona Energy, Cylon Controls, EirGrid, Electric Ireland, Energia, EPRI, ESB International, ESB Networks, Gaelectric, Intel and SSE Renewables. The EnergyPlus model for the testbed building was provided by UTRC.

References

[1] International Energy Agencies. Transition to Sustainable Buildings Paris: IEA; 2013.

[2] Ferreira P, Ruano A, Silva S, Conceicao E, Neural networrks based predictive control for thermal comfort and energy savings in public buildings. Energy Build. 2012 August; 55: p. 238-251.

[3] European Commission. Directive 2010/31/EU of the European Parliament and Council on Energy Performance of Buildings (recast). Brussels; 2010.

[4] Kudiak A, Li M, Zhang Z, A data-driven approach for steam load prediction in buildings. Appl. Energy 2010; 87: p. 925-933.

[5] U.S. DOE. Building Energy Software Tools Directory. 2011. Accessed 8 Jan. 2015. Available: http://apps1.eere.energy.gov/buildings/ tools_directory.

[6] Kwok S, Lee E, A study of the importance of occupancy to building cooling load in prediction by intelligent approach. Energ Convers

Manage 2011 March; 52: p. 2555-2564.

[7] Foucquier A, Robert S, Suard F, Stéphan L, Jay A, State of the art in building modelling and energy performances prediction: A review.

Renewable and Sustainable Energy Reviews 2013; 23: p. 272-288.

[8] U.S. D.O.E. EnergyPlus Simulation Software. 2015. Accessed 8 Jan. 2015. Available: http://apps1.eere.energy.gov/buildings/ energyplus. [9] Aranda A, Ferreira G, Mainar-Toledo M, Scarpellini S, Sastresa EL, Multiple regression models to predict the annual energy consumption

in the Spanish banking sector. Energy Build. 2012; 49: p. 380-387.

[10] Catalina T, Virgone J, Blanco E, Development and validation of regression models to predict monthly heating demand for residential buildings. Energy Build. 2008; 40: p. 1825-1832.

[11] Catalina T, Iordache V, Caracaleanu B, Multiple regression model for fast prediction of the heating energy demand. Energy Build. 2013; 57: p. 302-312.

[12] Kalogirou SA, Neocleous CC, Schizas CN. Building heating load estimation using artificial neural networks. In Int. Conf. on Parallel Architectures and Compilation Techniques; San Francisco, California, US; 1997. p. 1-8.

[13] Kalogirou S, Florides G, Neocleous C, Schizas C. Estimation of Daily Heating and Cooling Loads Using Artificial Neural Networks. In 7th REHVA World Congress; Napoli, Italy; 2001: Clima 2000.

[14] González PA, Zamarreño JM, Prediction of hourly energy consumption in buildings based on a feedback artificial neural network. Energy

Build. 2005; 37(6): p. 595-601.

[15] Dong B, Cao C, Lee SE, Applying SVM to predict building energy consumption in tropical region. Energy Build. 2005; 37(5): p. 545-553. [16] Li Q, Meng Q, Cai J, Yoshino H, Mochida A, Applying SVM to predict hourly cooling load in the building. Appl. Energy 2009; 86(10): p.

2249-56.

[17] Li Q, Meng Q, Cai J, Yoshino H, Mochida A, Predicting hourly cooling load in the building: A comparison of support vector machine and different artificial neural networks. Energ Convers Manage 2009; 50(1): p. 90-96.

[18] Valdivia V, O' Connell S, Gonzalez-Espin F, El-din Mady A, Kouramas K, De Tommasi L, et al. Sustainable building integrated energy test-bed. In IEEE 5th Int. Sym. on Power Electronics for Distributed Generation Systems (PEDG); Hiroshima, Japan; 2014. p. 1-6. [19] Justel A, Peña D, Zamar R, A multivariate Kolmogorov-Smirnov test of goodness of fit. Statist. Probab. Lett 1997 October; 35(3): p. 251-9. [20] Spiegel M, Schiller J, Srinivasan A. Probability and Statistics. 4th ed.: McGraw Hill Professional; 2013.

[21] IBM Corp. IBM SPSS Statistics Version 22.0. Accessed 12 Jan. 2015. Available: http://www-01.ibm.com/software/analytics/spss/products/ statistics.

References

Related documents

• Follow up with your employer each reporting period to ensure your hours are reported on a regular basis?. • Discuss your progress with

This unique quality of HRV/HRC analyses has led to interest in its application in pre-hospital triage and emergency clinical decision making; interest which has been bol- stered by

It was decided that with the presence of such significant red flag signs that she should undergo advanced imaging, in this case an MRI, that revealed an underlying malignancy, which

To illustrate the estimation of the covariate effect θ , N = 100 samples were simulated with sample sizes n = 100, 250, 500, and 1000. Again, the true value.. The results of

For instance, North America and Europe, both being mature markets, are the most consolidated markets where the top three distributors hold 30 to 40% and 15 to 20% of the

Field experiments were conducted at Ebonyi State University Research Farm during 2009 and 2010 farming seasons to evaluate the effect of intercropping maize with

Regurgitation Area Segmentation Using the Particle Swarm Optimization and Multilevel Threshold Selection.. Kalpana Chauhan 1*