Genetic algorithm-determined deep feedforward neural network architecture for predicting electricity consumption in real buildings

(1)

ContentslistsavailableatScienceDirect

Energy

and

AI

journalhomepage:www.elsevier.com/locate/egyai

Genetic

algorithm-determined

deep

feedforward

neural

network

architecture

for

predicting

electricity

consumption

in

real

buildings

X.J.

Luo

a

,

Lukumon

O.

Oyedele

a ,∗

,

Anuoluwapo

O.

Ajayi

a

,

Olugbenga

O.

Akinade

a

,

Juan

Manuel

Davila

Delgado

a

,

Hakeem

A.

Owolabi

a

,

Ashraf

Ahmed

b

a_Big_Data_Enterprise_and_Artiﬁcial_Intelligence_Laboratory_(Big-DEAL),_University_of_the_West_of_England_(UWE),_Frenchay_Campus,_Bristol,_United_Kingdom b_Department_of_Civil_and_{Environmental}_Engineering,_Brunel_University_London,_London,_United_Kindom

h

i

g

h

l

i

g

h

t

s

g

r

a

p

h

i

c

a

l

a

b

s

t

r

a

c

t

•A GA-determined DFNN architecture for

real building electricity consumption. •Genetic algorithm to determine the op-

timal architecture of DFNN model. •Both day-ahead hourly and week-ahead

daily electricity consumption are consid- ered.

•One year and six months of measurement data from a real campus building is used for training and testing the proposed GA-DFNN model.

•The proposed model has better performance than ANN, LSTM and TCN.

a

r

t

i

c

l

e

i

n

f

o

Articlehistory: Received 30 April 2020

Received in revised form 16 June 2020 Accepted 30 June 2020

Available online 7 July 2020 Keywords:

Prediction Deep learning

Feedforward neural network Genetic algorithm Electricity consumption

a

b

s

t

r

a

c

t

A genetic algorithm-determined deep feedforward neural network architecture (GA-DFNN) is proposed for both day-ahead hourly and week-ahead daily electricity consumption of a real-world campus building in the United Kingdom. Due to the comprehensive relationship between affecting factors and real-world building electricity consumption, the adoption of multiple hidden layers in the deep neural network (DFNN) algorithm would improve its prediction accuracy. The architecture of a DFNN model mainly refers to its quantity of hidden layers, quantity of neurons in the hidden layers, activation function in each layer and learning process to obtain the connecting weights. The optimal architecture of DFNN model was generally determined through a trial-and-error process, which is an exponential combinatorial problem and a tedious task. To address this problem, genetic algorithm (GA) is adopted to automatically design an optimal architecture with improved generalization ability. One year and six months of measurement data from a campus building is used for training and testing the proposed GA- DFNN model, respectively. To demonstrate the effectiveness of the proposed GA-DFNN prediction model, its prediction performance, including mean absolute percentage error, coefficient of determination, root mean square error and mean absolute error, was compared to the reference feedforward neural network models with single hidden layer, DFNN models with other architecture, random search determined DFNN model, long-short-term- memory model and temporal convolutional network model. The comparison results show that the proposed GA-DFNN predictive model has superior performance than all the reference prediction models, demonstrating the optimization effectiveness of GA and the prediction effectiveness of DFNN model with multiple hidden layers and optimal architecture.

∗_{Corresponding}_author.

E-mailaddresses:[email protected](X.J. Luo)., [email protected], [email protected](L.O. Oyedele).. https://doi.org/10.1016/j.egyai.2020.100015

(2)

Nomenclature

d Dayoftheweek

E Electricityconsumption

f Activationfunction

h Houroftheday

m Monthoftheyear

MAPE Meanabsolutepercentageerror

MPE Meanpercentageerror

n Quantityofhiddenlayer

N Quantity

R2 _Coeﬃcient_of_{determination} RMSE Rootmeansquareerror

T Totaltrainingsamplesortemperature

W Setofweightingfactors

x Neuronintheinputlayer

X Inputdataset

Y Neuroninthehiddenlayer

z Neuronintheoutputlayer ̂

𝑍 EstimatedoutputthroughtheDFNNmodel

Z Outputdataset

Subscripts

d Daily

db Dry-bulb

dew Dew-point

i Numberofneuronsintheinputlayer

j Numberofneuronsinthehiddenlayer

k Numberofhiddenlayers

h Hourly

H Hiddenlayer

IN Inputlayer

O Outputlayer

t Numberofsamples,i.e.timestep

Abbreviations

ADAM Adaptivemomentestimation ANN Artiﬁcialneuronnetwork elu Exponentiallinearfunction DFNN Deepfeedforwardneuralnetwork FNN Feedforwardneuralnetwork

GA Geneticalgorithm

GA-DFNN Geneticalgorithmdetermineddeepfeedforward neu-ralnetworkarchitecture

LSTM Long-term-short-memory

NADAM Nesterov-acceleratedAdaptiveMomentEstimation PSO Particleswarmoptimization

relu Rectiﬁedlinearunitfunction SGD Stochasticgradientdescentapproach TCN Temporalconvolutionalnetworks

1. Introduction

Therapideconomicandpopulationgrowthareacceleratingthe con-sumptionofelectricalenergy[1] .AsindicatedbyWorldEnergyOutlook 2017,theprimaryenergyconsumptionallovertheworldwasprojected togrowatacompoundannualgrowthrateof1.0%over2016–2040 [2] .Duetoitsphysicalcharacteristics,electricalenergymustbe con-sumedassoonasitisgeneratedinthepowerplant.Therefore,accurate predictionofelectricitydemandisessentialforstablepowersupply.On theotherhand,duetothefactthatpeoplespendasubstantialfraction oftimeinbuildings,electricityconsumptioninbuildingsisincreasing rapidly[3 ,4] .Asaresult,itisofsigniﬁcantimportancetoaccurately predictelectricityconsumptionandfacilitatethebuildingenergy sys-temmanagement.

1.1. Literaturereview

Withtheadventoftheeraofbigdata,buildingshavebecomenot onlyenergy-intensivebutalsodata-intensive.Data-miningtechnologies havebeenwidelyutilisedtoinvestigatethevaluesofmassiveamounts ofbuildingoperationdatawiththeaimofimprovingtheoperating per-formanceofbuildingenergysystems[5] .Asrevealedbythereview pa-pers[6–9] ,variousartificialintelligencetechniqueshavebeenapplied inforecastingbuildingenergyconsumption,includingmultiplelinear regression,statisticalregression,decisiontree,autoregressiveintegrated movingaveragemodel,supportvectorregression,neuralnetworkand theirensemblemodels.Duetoitsabilitytoapproximatethecomplicated nonlinearrelationshipbetweentheinputandtheoutputdatasetsofa complicatedsystemwitharbitraryandprecision,neuralnetworkshave beenwidelyadoptedtoinvestigatetheperformanceofbuildingenergy. Theneuralnetworkmodelsmainlyincludefeedforwardneural net-work(FNN),convolutionneuralnetworkandrecurrentneuralnetwork. TheFNNwithsinglehiddenlayeristhemostpopulartypeofneural networks inbuildingenergyanalysis.Luoetal.[10 ,11] proposed an FNNpredictionmodelforbuildingheatingandcoolingdemands.The quantityofneuronsinthehiddenlayerwaschosenthroughsensitivity analysisintherangeof60–80whilerectifiedlinearunitwasadoptedas theactivationfunctioninboththehiddenandoutputlayer.Singhetal. [12] proposedanFNNpredictivemodelwithsinglehiddenlayerfor electricityconsumptioninurbanarea.Inthehiddenlayer,therewere20 neuronswhilethesigmoidactivationfunctionwasadopted.Menaetal. [13] proposedtwoFNNpredictivemodelsforelectricityconsumption predictioninabioclimaticbuilding.Thequantityofneuronsintheinput andhiddenlayerwas17and10inthecompleteFNNmodel,while6and 15intheparticularFNNmodel.Thehyperbolictangentwasadoptedas activationfunctioninbothFNNmodels.Kusiaketal.[14] evaluated theperformanceof anFNNpredictivemodelforbuildingsteamload predictionwithdifferentarchitecture.Thequantityof neuronsin the singlehiddenlayerwastestedintherangebetween4and9.Asfor acti-vationfunctions,logistic,hyperbolictangentandexponentialfunctions weretestedinthehiddenlayerwhilesine,identityandlogistic func-tionsweretestedattheoutputlayer,respectively.Debetal.[15] pro-posedaneuralnetworkpredictivemodelforcoolingenergy consump-tionin aninstitutional buildingusingthefeedforwardstructure.The optimalquantityofneuronsinthehiddenlayer,20,wasdetermined throughatrial-and-errorprocess.Thesigmoidactivationfunctionwas adoptedinboththehiddenandoutputlayers.Ahmadetal.[16] pro-posedthreeFNNpredictivemodelsusingthemultiplelinearregression, GaussianprocessregressionandLevenberg–Marquardtbackpropagation approach,respectively.TherewasonlyonehiddenlayerineachANN model,whilethedifferentquantitiesofneuronsinthehiddenlayerwere testedat10,15and20.Yangetal.[17] proposedanFNNmodelfor en-ergyconsumptionpredictioninanofficebuildinginCanada.The quan-tityofneuronsinthehiddenlayerwasdeterminedusingtheempirical equationin[18] ,whichisonemorethantwicethequantityofinput neurons.Sigmoidandlinearactivationfunctionwasadoptedinthe hid-denandoutputlayer,respectively.Wangetal.[19] proposedanFNN modelforcoolingloadinahigh-riseofficebuildinginHongKong.The quantityofneuronsinthehiddenlayerwasdecidedthroughthe em-piricalequationin[20] ,whichisthetotalvalueofquantityof input andoutputneuronsandthesquarerootoftrainingsamples.Hyperbolic tangentandlinearactivationfunctionwereadoptedinthehiddenand outputlayer,respectively.Bagnascoetal.[21] proposedanFNNmodel forpredictingelectricityconsumptioninahospitalbuilding.Tofindout theoptimalquantityofneuronsinthehiddenlayer,itwastestedinthe rangeof6–33,whilethebestvaluewasfoundtobe10and30intwo differentcases.Moreover,differentactivationfunctionsweretestedin thehiddenandoutputlayers,withthecombinationof logsig/tansig, tansig/purelin,logsig/purelinandtansig/tansig.Thetansigactivation functionwasfoundtobeoptimalinboththehiddenandoutputlayers. Muhammadetal.[22] comparedthepredictionperformancebetween

(3)

anFNNandarandomforestmodelforenergyconsumptionofair con-ditioningsysteminahotel.SensitivityanalysisoftheFNNmodelwas conductedfordiﬀerentquantityofneuronsinthehiddenlayerinthe rangeof10–15.Meanwhile,sigmoidactivationfunctionwasadoptedin bothhiddenandoutputlayers.

Intheabove-mentionedFNNpredictivemodelswithsinglehidden layer,Levenberg–Marquardt backpropagationwas adopted toobtain theweighting factorsin most of thestudies. Toachieve faster con-vergenceandhigherefficiency,evolutionaryoptimizationalgorithms, suchasteaching-learningalgorithm[23] ,particleswarmoptimization (PSO)[24–26] andgeneticalgorithm(GA)[26] ,wereadoptedto au-tomaticallyadjustweightingfactorsandthresholdvaluesofFNN mod-els.Lietal.[23] proposed aFNNpredictive modelforbuilding en-ergyconsumptionandadopteditintwocampusbuildingslocatedin theUSAandChina,respectively.Thequantityofneuronsinthe hid-denlayerwasfixedat20whilehyperbolictangentactivationfunction wasadoptedinboththehiddenandoutputlayers.Toevaluatethe ef-fectsofchannellength,depth,widthandairmassflowrateonenergetic performanceofabuildingintegratedphotovoltaic/thermalsystem, Al-naqietal.[24] proposedaFNNmodel.SensitivityanalysisoftheFNN modelwasconductedfordifferentquantityofneuronsin thehidden layerwithintherangeof1–8,while8wasfoundtobetheoptimalone. Hyperbolictangentandsigmoidactivationfunctionweretestedinboth thehiddenandoutputlayers,respectively.PSOwasadoptedtoadjust theweightingfactorsintheFNNtoaccelerateitsconvergence.Lietal. [25] proposedaFNNmodelforpredictingelectricityconsumptionin alibraryinEastChina.Thequantityofneuronswaschosenbasedon theempiricalequationwhilehyperbolictangentsigmoidwasadopted inthehiddenandoutputlayersasactivationfunction.Particleswarm optimizationwasadoptedtosearchfortheoptimalweightingfactors inFNN.Muralitharanetal.[26] proposedaFNNapproachtopredict energydemandattheconsumerend.TheweightingfactorsoftheFNN wereautomaticallyadjustedbyGA.

Deepfeedforwardneuralnetworkmodels(DFNN)havedeeper archi-tectures,whichenabletheinputdatabeingtransformedmultipletimes beforederivingtheoutput.Owingtoitsmultiplehiddenlayers,DFNN is moreappropriatefor comprehensivedata [27 ,28] . Marijana etal. [29] adoptedtheDFNNmodelswithdiﬀerentarchitectureforenergy consumptionpredictioninpublicbuildings.Thearchitectureincluded 2hiddenlayerswiththecombinationof75–59,71–87,57–63,aswellas 3hiddenlayerswiththecombinationof87–99–94,14–31–28and61– 70–69.Sigmoidfunctionwasadoptedastheactivationfunctionwhile adaptivemomentestimationoptimizationalgorithmwasadoptedto ob-taintheweightingfactors.Bendaetal.[30] proposedaDFNNmodelfor predictingelectricityconsumptioninCapeTownControlcenter.There weretwodiﬀerentarchitectureofDFNNmodelwithtwohiddenlayers, onewiththecombinationof56–15,andtheotherwith58–20.Torres etal.[31] proposedaDFNNpredictionmodelforelectricity consump-tioninSpain.Randomsearchwasadoptedtotunethehyper-parameters, includingthequantityof hiddenlayers,quantity ofneurons ineach hiddenlayerandthelearningrate.Thequantityofhiddenlayerswas searchedintherangeof1–5whilethequantityofneuronsinthehidden layerintherangeof10–100.Parketal.[32] proposedaDFNNmodelfor predictingenergyproductionofagroundsourceheatpump.Througha trial-and-errorprocess,thearchitectureoftheDFNNmodelwas deter-minedtobe10–5–2.Hyperbolictangentandlinearactivationfunction wereadoptedinthehiddenandoutputlayer,respectively. Lu etal. [33] proposedaDFNN-basedpredictivemodelforpriceandload fore-castinginthesmartgrid.Throughseveralaccuracytests,theselected DFNNmodelhadthreehiddenlayers,with30,20and10neuronsin eachlayer.Sigmoidfunctionservedastheactivationfunctionforeach layer,whileweightandbiasadjustmentineachlayerwastrainedby Levenberg–Marquardtbackpropagationalgorithm.

Theresultsfromtheabove-mentionedresearchworksindicatedthat theDFNN-basedpredictivemodelhadhigherpredictionaccuracythan theconventionalFNNmodelswithsinglehiddenlayer.However,the

optimalDFNNarchitectureiscontingentonthetypeofproblemtobe solved,suchasthequantityoftrainingsamplesaswellasthequantityof inputandoutputneurons.Asaresult,thefullabilityoftheDFNN-based predictivemodelshouldbeexploitedbyoptimizingitsarchitecture. Al-thoughthearchitectureoftheDFNNmodelswasdesignedbyheuristic trial-and-errorapproachesinsomeofthepreviousresearchworks[31– 33] ,itistime-consumingandmightreachthelocal-optimalarchitecture ofDFNN.

1.2. Researchgapandcontribution

Basedupontheliteraturereview,itisfoundthattheprevious re-searchworksinbuildingenergypredictionhavethefollowing limita-tions:

• IntheFNNmodelswithsinglehiddenlayer,thequantityof neu-ronsinthehiddenlayerwaschosenbyexperience[12 ,13 ,23] ,using diﬀerent empiricalequations[17 ,19 ,25] ,through sensitivitytests [10 ,11 ,14 ,16 ,21 ,22] and[24] orviatrial-and-errorprocess[15] . Al-thoughsomeevolutionaryalgorithmswerehybridwithFNNmodels toimprovethepredictionaccuracyandspeedupthecomputation convergence,theyweremainlyfocusingonautomaticallyadjusting theweightingfactors.

• TheDFNNmodelswithmultiplehiddenlayerswouldbeableto re-vealamorecomprehensiverelationshipamongdatasets.However, thearchitectureoftheDFNNmodelwouldbecomemorecomplex withtheincreaseofhiddenlayers.Thereisnoempiricalequations orrule-of-thumbtoselectthequantityofhiddenlayers,quantityof neuronsineachlayer,activationfunctionineachlayeraswellas learningprocessforweightingfactors.However,whenthequantity ofhiddenlayersistoosmall,thenetworkcouldresultinunder-ﬁtting andlargepredictionerrors.Whereas,ifthequantityofhiddenlayers istoolarge,thenetworkmightlearnthenoiseinthetrainingdataset andresultinoverﬁtting.

• InpreviousresearchonDFNNmodels,thequantityofhidden lay-ersandthequantityofneuronsineachhiddenlayerwereselected basedonexperience[29 ,30] ortrial-and-errorprocessthrough enu-meration[31–33] ,whichmightnotbeabletoachievetheoptimal performanceorcauseexpensivecomputationload.Moreover,the authorsdidnotmentiontheapproachorcriteriainselectingthe ac-tivationfunctionsandlearningprocess.

Therefore,theobjectivesofthisstudyaresummarizedasfollows: • Duetothecomplex aﬀectingfactorsandbuildingelectricity

con-sumption,theDFNNmodelwithmultiplehiddenlayersisadopted toconstructthepredictivemodeltoimprovetheforecasting accu-racyinbuildingelectricityconsumption;

• Toguaranteetheeﬀectiveness oftheDFNNmodel, GA optimiza-tionisutilisedtodetermineitsoptimalarchitecture,includingthe quantityofhiddenlayers,quantityofneuronsinthehiddenlayers, activationfunctioninboththehiddenandoutputlayeraswellas thelearningprocessfordeterminingweightingfactors;

• Tofacilitate buildingenergymanagement,bothday-aheadhourly and week-ahead daily electrical energy consumption are fore-castedbytheproposedGA-determinedDFNN(GA-DFNN)prediction model.

Therestofthispaperisorganizedasfollows.Section 2 elucidatesthe proposed GA-DFNNpredictivemodel.Section 3 illustratesthe prepa-rationofthehistoricaldatabase.Section 4 discussestheperformance of GAoptimization andtheoptimalarchitectureof DFNNpredictive model.Section 5 presentstheperformancecomparisonagainstvarious referencepredictionmodels.Section 6 clariﬁesthepracticalimplication andfutureapplication,whileSection7summarizestheconclusions.

(4)

Fig.1. Diagram of DFNN model.

2. Geneticalgorithm-determineddeepneuronnetwork architecture

Toimprovethepredictionaccuracyofbuildingelectricity consump-tion,theGA-DFNNpredictionmodelisproposedinthisstudy.Tobe morespeciﬁc,theGAiscollaborativelyhybridwiththeDFNNalgorithm inordertoﬁgureouttheoptimalDFNNarchitecture.

2.1. BrieftheoryoftheDFNN

TheDFNNisamathematicalmodelwhichwasdesignedby mimick-ingthewaythehumanbrainprocessesinformation.TheDFNNmodels aregenerallycomprisedofthreeprincipallayers:aninputlayer,several hiddenlayersandanoutletlayer.Thequantityofhiddenlayersdeﬁnes thedepthofthearchitecture.Withlargequantityofhiddenlayersand quantityofneuronsineachhiddenlayer,DFNNcanprovidea multi-levelrepresentationofdataset.ThediagramofageneralDFNNmodel ispresentedinFig. 1 .

2.1.1. DFNNalgorithm

The training database is obtained through the historical data, containing bothinput dataset𝐗={𝑋_𝑖|𝑖=1, 2,…,𝑁_𝐼𝑁} andoutput dataset𝐙={𝑍_𝑡|𝑡=1,2,…, 𝑇}.ThetotaltypesofinputvariablesisNIN, whilethetotaltrainingsamplesisT.Foreachtypeofinputvariables,Xi containsTtimesamples,and𝑋𝑖={𝑥𝑖,𝑡|𝑡=1,2,…,𝑇}.Therearen hid-denlayersandNH,kneuronsinthekthhiddenlayer.Therefore,forthetth (𝑡=1,2,…, 𝑇)trainingsample,the𝑗₁𝑡ℎ(𝑗₁|𝑗₁=1, 2,…,𝑁_𝐻,₁)neuron intheﬁrsthiddenlayercanbecalculatedas:

𝑌1 ,𝑗1=𝑓 (_𝑖₌_𝑁 𝐼𝑁 ∑ 𝑖=1 ( 𝑊𝐼𝑁,𝑖,1 ,𝑗1𝑋𝑖 )) (1) whilethe𝑗𝑡ℎ

𝑘 neuroninthekth(n≥k≥2)hiddenlayercanbeobtained as: 𝑌𝑘,𝑗𝑘=𝑓 ⎛ ⎜ ⎜ ⎝ 𝑗𝑘−1=∑𝑁𝐻,𝑘−1 𝑗𝑘−1=1 ( 𝑊𝑘−1 ,𝑗𝑘−1,𝑘,𝑗𝑘𝑌𝑘−1 ,𝑗𝑘−1 )⎞ ⎟ ⎟ ⎠ (2)

Theneuronzintheoutputlayerisdeterminedas:

𝐙=𝑓 ⎛ ⎜ ⎜ ⎝ 𝑗𝑛= _∑𝑁𝐻,𝑛 𝑗𝑛=1 ( 𝑊𝑛,𝑗𝑛,𝑂𝑌𝑛,𝑗𝑛 )⎞ ⎟ ⎟ ⎠ (3)

wherewistheconnectionweightingfactorsbetweenthetwoneurons,i

isthenumberofneuronsintheinputlayer,kisthenumberofhidden layers,jkisthenumberofneuronsinthekthhiddenlayer,andfisthe activationfunction.

2.1.2. Activationfunction

Toachievebetterpredictionperformance,fourdiﬀerentactivation functions are tested,including sigmoid function, hyperbolic tangent (tanh)function,rectiﬁedlinearunit(relu)functionandexponential lin-ear(elu)function[34] .Thefouractivationfunctionscanbeexpressed asthefollows.

• sigmoidfunction: 𝑓(𝑥)= 1

1+𝑒− 𝑥 (4)

• hyperbolictangentfunction:

𝑓(𝑥)=tanh(𝑥) (5)

• rectiﬁedlinearunitfunction: 𝑓(𝑥)=

{

0𝑓𝑜𝑟𝑥<0

𝑥𝑓𝑜𝑟𝑥≥0 (6)

• exponentiallinearfunction: 𝑓(𝑥)=

{

𝑒𝑥₋₁_𝑓𝑜𝑟_𝑥_<₀

𝑥𝑓𝑜𝑟𝑥≥0 (7)

2.1.3. Learningapproach

The aim of the learning process of DFNN model is to minimize the mean absolute percentageerror MAPE between the produced ̂𝐙 and desired output Z by adjusting the group of weighting factors 𝑊 ={𝑊_𝐼_𝑁_,𝑖,₁_,_𝑗₁|𝑁_𝐼𝑁≥𝑖≥1,𝑁_𝐻,₁≥𝑗₁≥1}∪ {𝑊_𝑘₋₁_,_𝑗_𝑘₋₁_,𝑘,_𝑗_𝑘|𝑛≥𝑘≥ 2, 𝑁_𝐻,_𝑘−1≥𝑗𝑘≥1}∪ {𝑊𝑛,𝑗𝑛,𝑂|𝑁𝐻,𝑘≥𝑗𝑛≥1}.

Inthisstudy,sevendiﬀerentoptimizationapproachesareadopted totraintheproposed DFNNmodel,includingstochasticgradient de-scent(SGD)[35] ,adaptivemomentestimation(ADAM)[36] , Nesterov-acceleratedadaptivemomentestimation(NADAM)[37] andavariant ofADAMbasedontheinﬁnitynorm(ADAMAX)[38] .

2.1.4. Performanceindicators

Toevaluatetheperformanceof theproposed model,four perfor-manceindicatorsaredeﬁned,includingthemeanabsolutepercentage error,coeﬃcientofdetermination,meanpercentageerroraswellasroot meansquareerror:

• Meanabsolutepercentageerror 𝑀𝐴𝑃𝐸= 1 𝑇 𝑡= 𝑇 ∑ 𝑡=1 ||𝑧𝑡−̂𝑧_𝑡|| 𝑧𝑡 × 100% (8) • Coeﬃcientofdetermination 𝑅2 = [ ∑𝑡=𝑇 𝑡=1 ( 𝑧𝑡− ∑𝑙=𝑇 𝑙=1𝑧𝑡 𝑇 ) ⋅ ( ̂𝑧𝑡− ∑𝑡=𝑇 𝑡=1̂𝑧𝑡 𝑇 )]2 ∑𝑡=𝑇 𝑡=1 ( 𝑧𝑡− ∑𝑡=𝑇 𝑡=1𝑧𝑡 𝑇 ) ⋅∑𝑡=𝐿 𝑡=1 ( ̂𝑧𝑡− ∑𝑡=𝑇 𝑡=1̂𝑧𝑡 𝑇 ) (9)

• Rootmeansquareerror

𝑅𝑀𝑆𝐸=

√_∑

𝑡= 𝑇

𝑡=1 (𝑧𝑡−̂𝑧𝑡)2

𝑇 (10)

• Meanabsoluteerror 𝑀𝐴𝐸= 1 𝑇 𝑡=𝑇 ∑ 𝑡=1|| 𝑧𝑡−̂𝑧_𝑡|| (11) 2.2. Geneticalgorithm

GAiscapable ofsearchingfortheglobaloptimumintheinﬁnite, complex,multimodal,andnon-diﬀerentiablesearchspacetodetermine theoptimalDFNNarchitecture[39 ,40] .Inthebeginning,apopulation ofchromosomesinthesolutionspaceisrandomlyinitialized.The qual-ityofeachchromosomeisassessedusingtheobjectivefunction;thus theprobabilityofadoptingeachoftheminthenextgenerationcanbe

(5)

Table1

Summary of architecture of FNN and DFNN in literature review.

References Quantity of neurons in input layer Quantity of neurons in hidden layers Quantity of neurons in output layer Training samples

[10] 9 60–80 1 8760 [11] 6 15or20 1 8760 [12] 5 20 1 26,280 [13] 17 10 1 700,000 6 15 [14] 2 4~9 1 722 [15] 5 20 1 250 [16] 5 10or15or20 1 8928 [17] 12 25 1 8760 [19] 8 36 1 742 [21] 8 10or30 1 7200 [22] 10 10 1 10,972 [23] 3 20 1 2472 [24] 4 1~8 1 20 [25] 3 11 1 2472 [29] 82 75–59,71–87,57–63 2 17,000 87–99–94,14–31–28and61–70–69 3 [30] 56 56–15 2 17,520 58 58–20 [31] 168 10~100 1~5 497,832 [32] 13 10–5–2 3 217,440 [41] 4 30–30 2 2700 [42] 7 512 3 19,704 [43] 4 12–12 2 279 [44] 784 500perlayer 9 600,000

determined.Generally,thenewchromosomeisgeneratedthroughthe threegeneticoperators:selection,crossoverandmutation.Basedonthe fitnessvaluesoftheobjectivefunction,somechromosomesareselected toremainwhile othersarechosenforcrossoverandmutation. Chro-mosomeswhichareselectedforthecrossoveroperationarenamedas theparents.Throughcrossoveroperation,featuresofparentsare ex-changedbetweeneachothertoproducetheoffspring.Meanwhile,the mutationoperatorimplementstheprocessofalteringthefeatureswithin thechromosome.Throughconductingthethreegeneticoperations,the initialpopulationisupdatedwiththeoffspringgeneratedbycrossover andmutation.Atthefinalstage,theoffspringwiththebestfitnessvalues isreturnedtorepresenttheoptimumsolution.

2.3. GA-DFNNpredictivemodel

Althoughtherearethreedifferentempiricalequationsin determin-ingtheoptimalquantityofneuronsinthesinglehiddenlayerofFNN, thereisno rule-of-thumbforchoosingthequantityof hiddenlayers, quantityofneuronsinthehiddenlayers,activationfunctionsineach layeraswellaslearningprocessfordeterminingweightingfactors. How-ever,thesearchitecturalattributesareresponsiblefortheperformance ofDFNN.Thenetworkwithasmallsizecancauseunder-fittingwhile alargenetworkmayleadtooverfitting.Theactivationfunction deter-minestheconnectingperformanceamongeachneuron,whilethe learn-ingprocessaffectstheconvergenceofDFNN.Theseimplythatthe ar-chitecturaldesignofDFNNmodelisverycrucialandcanbedefinedas anoptimizationproblem.

ToimprovethepredictionaccuracyofDFNNpredictivemodel,GAis appliedtodeterminetheoptimalarchitectureoftheDFNNmodel.The decisionvariablesofGAincludethetotalquantityofhiddenlayers,the quantityofneuronsineachhiddenlayer,theactivationfunctionaswell asthelearningprocesstocomputetheweightingfactorsoftheDFNN inthetrainingphase.TheoptimizationobjectivefunctionistheMAPE

valueoftheDFNNmodel.Inotherwords,eachsetofdecisionvariables determinesonetypeofarchitectureofDFNNmodel.Throughtraining theDFNNmodelsusinghistoricaldatasets,theMAPEvalueoftheDFNN model,namely,optimizationobjectivefunction,canbeobtained.

ThedecisionvariablesaresummarizedinTable 2 .Togeneratethe poolofdecisionvariablesregardingquantityofhiddenlayersand

quan-Table2

Decision variables of GA for DFNN.

Quantityofneuronsineachhiddenlayer {5,6,7,8,9,10} Quantityofhiddenlayers {2,3,4}

Activationfunctions {sigmoid,tanh,relu,elu} Optimizationapproaches {SGD,ADAM,NADAM,ADAMAX}

tityofneuronsineachhiddenlayer,anin-depthliteraturereviewon FNNandDFNNpredictionmodelswasconducted,assummarized in Table 1 .AswillbediscussedinSection 3 ,thereare8760training sam-plesand9inputneuronsforday-aheadhourlyprediction;while365 trainingsamplesand10inputneuronsforweek-aheaddailyenergy con-sumptionprediction.Inviewofthehiddenlayersandneuronsadopted inthepreviousFNNandDFNNmodels,thequantityofhiddenlayers wastestedintherangeof1–5whilethequantityofneuronsineach layerwastestedintherangeof5–10.Theprocedureoftheproposed predictivemodelisshowninFig. 2 .

Inthebeginning,apopulationof20 DFNNmodelsaregenerated whilerandomdecisionvariablesareassignedtodeterminethe architec-tureofeachDFNNmodel.EachDFNNmodelistrainedusingtheinput andoutputdatasetsfromthehistoricaldatabase.Basedontheﬁtness valueoftheobjectivefunction(i.e.MAPEvalueofDFNN),selection, crossoverandmutationoperatorsofGAareconductedforthe popula-tionofDFNNmodels,respectively.Suchaprocedureisrepeateduntil thepresetoptimizationcriteriaaremet.Finally,optimalDFNN archi-tecturecanbedetermined.

2.4. Researchmethodology

TheproposedGA-DFNNpredictivemodelisimplementedinPython withtheKerasandTensorFlowlibrariesasthebackend.Kerasisa li-braryofopensourcesoftheneuralnetworkdevelopedinPython,which isgoodatthefastcomputationofDFNNandwhichisfocusedon mini-mization,modularity,andscalability[44] .Meanwhile,TensorFlowisan open-sourcesoftwarelibrarywhichprovidesaninterfaceforexpressing andexecutingvariousmachinelearningalgorithms[45] .

(6)

Fig.2. Schematic ﬂowchart of the proposed GA-DFNN predictive model.

Fig.3. Distribution of training, validation and testing datasets.

3. Structureofhistoricaldatabase

TotesttheperformanceoftheproposedGA-DFNNpredictivemodel, itwasimplementedonacampusbuildingintheUnitedKingdom.The energydatawascollectedfromtheNorthavonHouseinUniversityof theWestofEngland,Bristoloverthepast1yearand6months(1July 2018– 31December2019)atthetimestepof1h.

3.1. Compositionofthehistoricaldatabase

Sincethebuildingenveloppropertiesgenerallyremainstableover longtime,theaffectingfactorsofbuildingelectricityconsumption in-cludeweatherconditions,occupancyandoperatingequipmentinthe building.Duetothedifficultyincollectinginformationofoccupancy andoperatingequipment,timeindicators,suchashouroftheday,day oftheweek,monthoftheyear,areusuallychosenasschedule-related inputstorepresentoccupancyandoperatingequipmentscenarios.In addition,thehistoricalelectricityconsumptionduringthepreviousday orweekcanindicatethetrendofprofileinamathematicalway. Con-sequently,threetypesofinputvariables,includingweatherconditions, timeindicatorsandhistoricalenergyconsumption,areselectedforthe proposed GA-DFNNpredictive model.The weatherconditioncan be well-describedusingoutdoorairdry-bulbtemperature,outdoorair dew-pointtemperature,windspeedandsolarradiation.Thehistorical out-doorweatherdatawascollectedthroughtheweatherstationinBristol [46] .Althoughsolarradiationisaccessiblefromthehistoricalweather data, itis generallynot availablefrom weatherforecasting websites [47–49] .Therefore,itisnotconsideredinthisstudy.

3.2. Datapre-processing

Tofacilitatethepredictionofbothday-aheadhourlyandweek-ahead dailyelectricityconsumption,twosetsofthehistoricaldatabasewere formulated.AsshowninFig. 3 ,therandom80%ofthedatasetsfromthe firstoneyear,theremaining20%ofthedatasetsfromthefirstoneyear, anddatasetsfromthefollowing6monthswereadoptedfortraining, validationandtestingpurposes,respectively.Toincreasetheprediction accuracyofDFNNwhilepreventitfromoverfitting,theaveragevalueof

MAPEfortrainingandtestingcaseswastreatedastheobjectivefunction inGAoptimization.

3.2.1. Databaseforday-aheadhourlyelectricityconsumptionprediction

Thetrainingdatabasefortheday-aheadhourlyelectricity consump-tionpredictionincludesthehourofthedayh(t),thedayoftheweek

d(t),themonthoftheyearm(t),hourlyoutdoorairdry-bulb temper-atureTdb(t),hourlyoutdoorairdew-pointtemperatureTdew(t),hourly windspeedVwind(t)aswellasthehourlyelectricityconsumptionatthe samehourofthepreviousdayE_h(t-24).Torepresentthecyclicalnature ofthetime,h(t)wasrepresentedbyitssineandcosinevalueas[13 ,37] : ℎ𝑠(𝑡)=sin2𝜋ℎ₂₄(𝑡)ℎ𝑐(𝑡)=cos2𝜋ℎ₂₄(𝑡) (13) Owingtothediﬀerentoccupancyandoperatingequipment scenarios duringtheholidays,theholidaysweremarkedasanadditionaldayof theweek.Meanwhile,thecategoricaldata,includingdayoftheweek andmonthoftheyearwereencodedusingone-hotencodingapproach. Dayoftheweekwasrepresentedbyaconcatenationof8binary vari-ables. Forinstance,bankholidayswereexpressed as[1000000

(7)

Fig.4. Hourly outdoor weather data.

0],Sundaywasexpressedas[01000000].Thefirstcolumn corre-spondedtoaflagwhetherornotthedaywasabankholiday,whilethe secondcolumnreferredtoaflagwhetherornotthedaywasSunday. Similarly,theremainingsix columnscorrespondedtoaflagforeach ofthesixotherdaysoftheweek[50] .Meanwhile,monthoftheyear wasreferredbyaconcentrationof 12binaryvariables.Forexample, Januarywasillustratedas[100000000000],withthefirst col-umncorrespondedtoaflagwhetherornotthemonthwasJanuary.In

addition,thevariablesofoutdoorweatherdata(i.e.Tdb,h,Tdew,hand

Vwind,h)werenormalizedthroughthemin-maxscalingapproach:

𝑥′₌ 𝑥 (𝑡) − min 1 ≤𝑡≤365 × 24 𝑥(𝑡) max 1≤𝑡≤365× 24𝑥(𝑡)−1 ≤𝑡≤min365 × 24 𝑥(𝑡) (14)

The hourlyoutdoorair dry-bulbtemperature, outdoorair dew-point temperatureandwindspeedareshowninFig. 4 .Thepeakdry-bulb

(8)

Fig.5. Hourly electricity consumption.

anddew-pointtemperaturehappenedinJulywhilethevalleyoccurred inJanuary.Thehighestandlowestdry-bulbtemperatureis30°Cand −3°C,respectively.Meanwhile,thehighestandlowestdry-bulb tem-peratureis29°Cand−3°C,respectively.Theoveralltrendofdry-bulb temperatureanddew-pointtemperatureissimilar duringthe second-halfyearin2018and2019.Duringmosttimeoftheyear,windspeed islowerthan10m/s,whileitwouldreach15m/sonafewdays.

ThehourlyelectricityconsumptionispresentedinFig. 5 .InFig. 5 , thepinkdotsstoodforweekdayelectricityconsumptionwhilethe or-angedotsrepresentedtheweekendandbankholidays.Owingtothe normaloperatingscheduleoftheinvestigatedoﬃcebuilding,electricity consumptiononweekdayswasmuchhigherthanthatduringthe week-endsandbankholidays.Moreover,thepeakelectricityconsumptionwas identiﬁedinJulyduetoitshighdry-bulbanddew-pointtemperature.

In conclusion, the input dataset at time step t is denoted by training sample 𝑋𝑡=[𝑥1,𝑡𝑥2,𝑡𝑥3,𝑡𝑥4,𝑡𝑥5,𝑡𝑥6,𝑡𝑥7,𝑡], where 𝑥1,𝑡=ℎ𝑠(𝑡),

𝑥2 ,𝑡=ℎ𝑐(𝑡), 𝑥3 ,𝑡=𝑑(𝑡),𝑥4 ,𝑡=𝑚(𝑡), 𝑥5 ,𝑡=𝑇𝑑𝑏,ℎ(𝑡),𝑥6 ,𝑡=𝑇𝑑𝑒𝑤,ℎ(𝑡),𝑥7 ,𝑡=

𝑉𝑤𝑖𝑛𝑑,ℎ(𝑡),whiletheoutputdatasetis𝑍𝑡=𝐸_ℎ(𝑡).

3.2.2. Databaseforweek-aheaddailyprediction

Thetrainingdatabasefortheweek-aheaddailyelectricity consump-tionprediction includesthe day of theweek d(t), themonth of the yearm(t),dailyaverage outdoorair dry-bulbtemperatureTdb,ave,d(t), daily maximum outdoor air dry-bulb temperature Tdb,max,d(t), daily minimum outdoor air dry-bulb temperature T_db,_min_,d(t), daily av-erage outdoor air dew-point temperature Tdew,ave,d(t), daily maxi-mum outdoor air dew-point temperature Tdew,max,d(t), daily mini-mum outdoor air dew-point temperature T_dew,_min_,d(t), daily aver-age wind speed Vwind,d(t) as well as the total daily electricity con-sumption Ed(t). The variables of the daily outdoor weather data (i.e. Tdb,ave,d,Tdb,min,d,Tdb,max,d,Tdew,ave,d,Tdew,min,d,Tdew,max,d and

V_wind,_d)werealsonormalized: 𝑥′₌ 𝑥(𝑡) − min 1≤𝑡≤365𝑥(𝑡) max 1 ≤𝑡≤365 𝑥(𝑡)−1 ≤min𝑡≤365 𝑥(𝑡) (15) Thedailyaverageoutdoorairdry-bulbtemperature,outdoorair dew-pointtemperatureand windspeedareshown in Fig. 6 . Theoverall trendsofdaily-averagedry-bulbanddew-pointtemperatureissimilar tothoseofhourlydry-bulbanddew-point,respectively.Thepeakvalue ofdaily-averagedry-bulbtemperaturewasinJulyat23°C,whilethe lowestvaluewasinearlyJanuaryat−1°C.Moreover,thepeakvalue ofdaily-averagedew-pointtemperaturewasinAugustat17°C,while thelowestvaluewasinearlyJanuaryat−2°C.Thehighestwindspeed wasfoundinOctoberat12.5m/s.

Thedaily totalelectricityconsumptionispresentedin Fig. 7 .The pinkdotsstoodforweekdayelectricityconsumptionwhiletheorange dotsrepresentedtheweekendandbankholidays.Owingtothe nor-maloperatingscheduleof theinvestigatedoﬃce building,electricity

consumptiononweekdayswasmuchhigherthanthatduringthe week-endsandbankholidays.Thedailyelectricityconsumptionwasbetween 700=1300kWh onweekdays while 250–520kWh on weekendsand bankholidays. Therelativelyhigherelectricityconsumption was oc-curredinJanuaryduetorelativelyhigherdry-bulbanddew-point tem-perature.Itisalsonoticedthatthedailyelectricityconsumptionwas 500kWhonthe1stFebruary,whichwasmuchlowerthanother week-days.Itisbecausethatthecampuswasclosedduetoslowonthatday. Therefore,thisdaywasregardedasaholidaytoincreasetheprediction accuracyoftheproposedGA-DNNpredictivemodel.

As a whole, the input dataset at time step t is denoted by 𝑋𝑇=[𝑥₁_,𝑡𝑥₂_,𝑡𝑥₃_,𝑡𝑥₄_,𝑡𝑥₅_,𝑡𝑥₆_,𝑡𝑥₇_,𝑡𝑥₈_,𝑡𝑥₉_,𝑡], where 𝑥₁_,𝑡=𝑑′(𝑡),𝑥₂_,𝑡=

𝑚′₍_𝑡₎_,_𝑥

3 ,𝑡=𝑇𝑑𝑏,𝑎𝑣𝑒,𝑑′ (𝑡),𝑥4 ,𝑡=𝑇𝑑𝑏,𝑚𝑎𝑥,𝑑′ (𝑡),𝑥5 ,𝑡=𝑇𝑑𝑏,𝑚𝑖𝑛,𝑑′ (𝑡),𝑥6 ,𝑡=

𝑇′

𝑑𝑒𝑤,𝑎𝑣𝑒,𝑑(𝑡),𝑥7 ,𝑡=𝑇𝑑𝑒𝑤,𝑚𝑎𝑥,𝑑′ (𝑡), 𝑥8 ,𝑡=𝑇𝑑𝑒𝑤,𝑚𝑖𝑛,𝑑′ (𝑡),𝑥9 ,𝑡=𝑉𝑤𝑖𝑛𝑑,𝑑′ (𝑡), while theoutputdatasetis𝑧𝑡=𝐸_𝑑(𝑡).

4. PerformanceevaluationoftheproposedGA-DFNNpredictive model

Topreventtheoptimizationfrombeingtrappedinlocaloptimum, theGAparameterswerechosenusingsensitivityanalysis.The conver-genceperformanceoftheGAoptimizationwasassessed.Therefore,the optimalarchitectureoftheDFNNpredictionmodelcanbedetermined.

4.1. DeterminationofGAparameters

Topreventtheoptimizationfrombeingconvergedtolocaloptimum, diﬀerent GAparameters,includingﬁveretainingprobabilities (i.e.0.5, 0.6,0.7,0.8and0.9),fourselectionprobabilities(i.e.0.1,0.2,0.3and0.4) andfourmutationprobabilities(i.e.0.1,0.2,0.3and0.4)wereadopted intheoptimizationofDFNNarchitectureforweek-aheaddaily electric-ityconsumption.TheoptimizationresultsareshowninFig. 8 .Foreach variablebar,theaveragevalueofothervariablesisobtained.For exam-ple,whencalculatingthebarvalueofretainingprobability=0.5,the av-eragevalueof4×4optimizationresultswasadopted.Accordingtothe resultedMAPEvalue,theretainingprobability,selectionprobability and

mutationprobabilitywerechosenas80%,20%,and20%,respectively (Fig. 9 .).

4.2. ConvergenceperformanceofGAoptimization

Theconvergenceperformanceoftheday-aheadhourlyandthe week-aheaddailyelectricityconsumptionpredictionisshowninFig. 10 .The minimumvalueofday-aheadhourlyandweek-aheaddailyelectricity consumption reached convergentafter 31 and 22 iterations, respec-tively.Toensuretheconvergencewouldbesteadythereafter,the max-imumiterationswerechosenas40and30forday-aheadhourlyand week-aheaddailyelectricityconsumption,respectively.

(9)

Fig.6. Daily-average value of weather data.

4.3. OptimalarchitectureoftheproposedGA-DFNNmodel

AftertheGA optimization,the optimalarchitectureof theDFNN modelcouldbedetermined,asshowninFig. 10 .Thereare4and2 hid-denlayersforday-aheadhourlyandweek-aheaddailyelectricity con-sumption,respectively.Thereweremorehiddenlayersforday-ahead hourlyelectricityconsumptionpredictionowingtothereasonthatthere

existsahighernumberoftrainingsamples(i.e.8736).Rectiﬁedlinear unitandexponentiallinearactivationfunctionsweremainlyadoptedin hiddenlayersforday-aheadhourlypredictionwhileexponentiallinear andhyperbolictangentactivationfunctionswereusedinweek-ahead dailyelectricityconsumptionprediction.Exponentiallinearactivation functionwas adoptedin theoutputyearforbothpredictionmodels. NADAMandADAMAXwereadoptedastrainingprocessforday-ahead

(10)

Fig.7. Daily total electricity consumption.

Fig.8. Optimization result at diﬀerent GA parameters.

(11)

Fig.10. Optimal architecture of the proposed GA-DFNN prediction model.

hourlyandweek-aheaddailyelectricityconsumptionprediction, respec-tively.

Toreducethe computationtimewhileprevent theproposed GA-DFNN predictive model from overﬁtting, early-stopping is adopted [51 ,52] .Duringthetrainingphase,theMAPEvalueofthetrainingand validationdatasetsareshowninFig. 11 .After400and12,000epochs, theMAPEvalueofdailyandweeklypredictionreachedrelatively con-stant.Therefore,theMAPEvaluefordailyandweeklypredictionwas 9.8%and4.9%,respectively.

5. Performancecomparisonwithreferencemodels

To assess the effectiveness of the proposed GA-DFNN predictive model,fourgroupsofreferencescaseswereintroduced.Toinvestigate thepredictionperformanceoftheDFNNmodelwithmultiplehidden layers,thefeedforward neuralnetworkmodel withsingle layerwas adoptedasreference.Toassessthepredictionperformance improve-mentthroughDFNNmodelwithoptimalarchitecture,theDFNNmodels withdifferentarchitectureareadoptedasreferencecases.To demon-stratethe comprehensive relationship between affecting factors and electricityconsumption,thetime-seriespredictionapproachincluding long-term-short-memory(LSTM)neuralnetworkmodelandtemporal convolutionalnetworks(TCN)modelisadoptedasreference.Toexhibit

theoptimizationperformanceofGA,therandomsearch-basedDFNN modelisusedasareferencecase.

5.1. PerformancecomparisonagainstFNNmodelswithsinglehiddenlayer

Firstly,toinvestigatethepredictionperformanceoftheDFNNmodel withmultiplehiddenlayers,theANNmodelwithsinglelayerisadopted asreference.Thequantityofneuronswasselectedaccordingtothe em-piricalequationsin[18] and[20] ,respectively.Sigmoidactivation func-tionwasadoptedinthehiddenlayerwhileexponentiallinearactivation functionwasusedintheoutputlayer.ADAMwasadoptedasthe train-ingprocess,assummarizedinTable 3 .

TheperformanceoftheproposedGA-DFNNmodelandtworeference FNNmodelsaresummarizedinTable 4 .

• Forday-ahead hourlyprediction, although theprediction perfor-manceofGA-DFNN duringtrainingphasewasalittleworsethan thereferenceFNNmodelswithsinglelayer,itshowedbetter perfor-manceattestingphase.ComparedtoFNN1andFNN2models,there was3.4%and1.8%reductioninMAPE,1.3%and6.1%reductionin RMSE,4.8and3.8%reductioninMAE,aswellas0.10%and0.52% increaseinR2_,_{respectively.}

• Forweek-aheaddailyprediction, theproposed GA-DFNNshowed better performancein both trainingandtesting phases. For test-ingcasesoftheproposedGA-DFNNmodel,thereductioninMAPE, RMSEandMAEwas34.5%,38.9%and29.3%whiletheincrease inR2_was_6.3%_compared_to_FNN1_model;_the_reduction_in_MAPE, RMSEandMAEwas34.3%,38.9%and29.2%whiletheincreasein R2was6.3%comparedtoFNN2model.

Thepredictionresultsfromtwodaysintraining,twodaysintesting, twoweeksintrainingandtwoweeksintestingareshowninFigs. 12 and 13 ,respectively.TheproposedGA-DFNNmodelshowedbetterability attrackingthevariationofelectricityconsumption,suchasatthe4th, 13th,17th,18th,20th,21stand22ndhinFig. 13 (c);atthe1st,2nd,4th, 6th,12th-14th,17th,21sthinFig. 13 (d);duringTuesday,Wednesday, ThursdayandFridayinFig. 14 (c);aswellasduringTuesday-Saturday inFig. 14 (d).

5.2. PerformancecomparisonagainstDFNNmodelswithotherarchitecture

ToverifytheoptimalarchitecturedeterminedbyGAoptimization, theDFNNmodelswithdiﬀerentneuronsineachhiddenlayer,activation functionandlearningapproachwereadoptedasreference.The selec-tionofthereferencearchitectureisbasedonthecriteriathatonlyone aﬀectingfactorwaschangedwhileotherfactorswerekeptthesameas theGA-determinedarchitecture.Thearchitectureofthefourreference DFNNmodels,alongwiththeGA-determinedDFNN,aresummarizedin Table 5 .

TheperformanceoftheproposedGA-DFNNmodelandthefour ref-erenceDFNNmodelsaresummarizedinTable 6 .

• Forday-aheadhourlyenergyconsumptionprediction,althoughthe predictionperformanceofGA-DFNNduringtrainingphaseisalittle worsethanthereferenceFNNmodelswithsinglelayer,itshowed better performance at testing phase. Compared to the reference DFNN1, DFNN2, DFNN3 and DFNN4 models, the proposed GA-DFNNmodelhad3.4%,0.59%,5.0%and0.97%reductioninMAPE

value;0.21%,0.31%,0.52%and0.31%increaseinR2_;_1.4%,_0.25%, 0.34%and0.27%reductioninRMSE;aswellas1.6%,3.0%,3.3% and0.34%reductioninMAE.

• Forweek-aheaddailyenergyconsumptionprediction,theproposed GA-DFNNshowedbetterperformanceinbothtrainingandtesting phases.FortheproposedGA-DFNNmodel,thereductioninMAPE valuewas27.4%,20.7%,33.3%and16.7%;theincreaseinR2_was 2.9%, 4.2%, 5.8%and1.9%; thereductionin RMSE was25.3%, 32.0%,37.4%and18.9%;thereductioninMAEwas23.0%,24.2%,

(12)

Fig.11. MAPE convergence of the proposed GA-DFNN model.

Fig.12. Day-ahead hourly prediction of GA-DFNN and FNN models.

30.5%and16.7%,comparedtoDFNN1,DFNN2,DFNN3andDFNN4 models,respectively.

Sincethearchitectureofreferencemodelswerechosenbasedonthe GA-determinedarchitecturebychangingaffectingfactorsone-by-one, thereductioninMAPE,RMSEandMAEaswellastheincreaseinR2 valueverifiesthattheproposedGA-DFNNmodelwaseffectiveinfinding itsoptimalarchitecture.

Thepredictionresultsfromtwodaysintraining,twodaysin test-ing, two weeks in training and two weeks in testing are shown in Figs. 14 and15 ,respectively.TheproposedGA-DFNNmodelshowed

betterabilityattrackingthevariationofelectricityconsumption,such asat2nd,11th,12th,13thand16th−20thhinFig. 15 (c);at0th-3rd, 8th,11th-13th,2nd,4th,6th,12th-14th,20th-22ndhinFig. 15 (d);on Tuesday,WednesdayandFridayinFig. 16 (c);andonMonday,Tuesday, WednesdayandFridayinFig. 16 (d).

5.3. Performancecomparisonagainstrandomsearch-basedDFNNmodel

ToassesstheperformanceofGAinﬁndingtheoptimalarchitecture, theDFNNmodeloptimizedbyrandomsearch(RS-DFNN)wasadopted as reference. Thedecision variablesof random searchalso included Table3

Design parameters of reference DFNN models.

Day-ahead hourly prediction Week-ahead daily prediction

Referencemodels FNN1 FNN2 FNN1 FNN2

Neuronsinhiddenlayer 17 98 21 25

Activationfunction Sigmoidinhiddenlayer,exponentiallinearinoutputlayer Optimizationapproach ADAM

(13)

Fig.13. Week-ahead daily prediction of GA-DFNN and FNN models.

Fig.14. Day-ahead hourly prediction of GA-DFNN and FNN models.

quantityof hiddenlayers,quantity of neuronsin each layer, activa-tionfunctionandlearningapproach. Theoptimizationobjective was stilltheMAPEvaluebetweenthepredictedandactualmeasurement. Theoptimalarchitecturedeterminedbyrandomsearchissummarized inTable 7 .

TheperformanceoftheproposedGA-DFNNmodelandthefour ref-erenceDFNNmodelsaresummarizedinTable 8 .

• For day-aheadhourlyprediction, althoughthe prediction perfor-manceofGA-DFNNduringtrainingphaseisalittleworsethanthe reference FNNmodelswithsinglelayer, itshowedbetter perfor-manceattestingphase.ComparedtoRS-DFNNmodel,theproposed GA-DFNNmodelhas9.7%,6.8%and8.2%reductioninMAPE,RMSE

andMAE,while0.52%increaseinR2_,_{respectively.}

• Forweek-aheaddaily prediction, theproposedGA-DFNN showed betterperformanceinbothtrainingandtestingphases.Comparedto

RS-DFNNmodel,thereductioninMAPE,RMSEandMAEwas33.8%, 36.2%and31.8%and16.7%,respectively,whiletheincreaseinR2 was5.3%.

TheworseperformanceoftheRS-DFNNmodelmaybeowingtothe factthatrandomsearchwastrappedtolocaloptimalratherthanglobal optimal.

Thepredictionresultsfromtwodaysintraining,twodaysintesting, twoweeksintrainingandtwoweeksintestingareshowninFigs. 16 and 17 ,respectively.TheproposedGA-DFNNmodelshowedbetterability attrackingthevariationofelectricityconsumptioninbothtrainingand testingcasesduringmostofthetime.

(14)

Fig.15. Week-ahead daily prediction of GA-DFNN and DFNN models.

Table4

Performance indicators of GA-DFNN and reference FNN models with single hidden layer.

GA-DFNN FNN1 FNN2

Train Test Train Test Train Test Daily MAPE(%) 9.870 7.885 9.574 8.163 9.669 8.032 R2 _0.958 _0.970 _0.961 _0.969 _0.963 _0.965 RMSE(kWh) 4.440 3.652 4.281 3.702 4.143 3.891 MAE(kWh) 2.958 2.675 2.827 2.811 2.722 2.781 Weekly MAPE(%) 4.908 5.326 10.04 8.13 10.04 8.11 R2 _0.958 _0.966 _0.837 _0.909 _0.837 _0.909 RMSE(kWh) 55.25 47.24 108.6 77.29 108.7 77.27 MAE(kWh) 32.49 35.52 60.26 50.30 60.30 50.17

5.4. PerformancecomparisonagainstLSTMandTCNtime-series predictionmodel

Todemonstratethecomprehensiverelationshipbetween aﬀecting factorsandelectricityconsumption,thetime-seriespredictionapproach includingLSTM[53] andTCNmodelisadoptedasreference.The per-formanceoftheproposedGA-DFNNmodelandthefourreferenceDFNN modelsaresummarizedinTable 9 .

• Forday-aheadhourly prediction, althoughthe prediction perfor-manceofGA-DFNNduringtrainingphasewasalittleworsethanthe

referenceLSTMandTCNmodels,itshowedbetterperformanceat testingphase.ComparedtothereferenceLSTMmodel,theproposed GA-DFNNmodelhas12.2%,40.1%and32.6%reductioninMAPE, RMSEandMAE,while5.9%increaseinR2_,_{respectively.}_Compared to thereference TCN model, the proposed GA-DFNN model has 23.8%,9.8%and17.7%reductioninMAPE,RMSEandMAE,while 0.73%increaseinR2_,_{respectively.}

• Forweek-aheaddailyprediction, theproposed GA-DFNNshowed betterperformanceinbothtrainingandtestingphases.Compared toLSTMpredictionmodel,thereductioninMAPE,RMSEandMAE

was32.7%,46.4%and26.4%,respectively,whiletheincreaseinR2 is9.6%.ComparedtoTCNpredictionmodel,thereductioninMAPE, RMSEandMAEwas65.0%,60.6%and64.5%,respectively,while theincreaseinR2_is_24.0%.

ItdemonstratedthattheproposedGA-DFNNmodelhasbetter accu-racythanLSTMandTCNmodelsforbothday-aheadhourlyprediction andweek-aheaddailyprediction.

Thepredictionresultsfromtwodaysintraining,twodaysintesting, twoweeksintrainingandtwoweeksintestingareshowninFigs. 18 and 19 ,respectively.TheproposedGA-DFNNmodelshowedbetterability attrackingthevariationofelectricityconsumptioninbothtrainingand testingcasesduringmostofthetimethanthatofreferenceLSTMand TCNmodels.

Table5

Architecture of the GA-DFNN and four reference DFNN models.

Models GA-DFNN DFNN1 DFNN2 DFNN3 DFNN4

Neuronsinhiddenlayers {8,5,7,9} {10,10,10,10} {5,5,5,5} {8,5,7,9} {8,5,7,9} Quantityofhiddenlayers 4

Activationfunctionsinhiddenlayers relu,relu,relu,elu relu,relu,relu,elu relu,relu,relu,elu sigmoid,sigmoid,sigmoid,sigmoid relu,relu,relu,elu Activationfunctioninoutputlayer elu

Learningapproach NADAM NADAM NADAM NADAM ADAM

Neuronsinhiddenlayers {5,9} {10,10} {5,5} {5,9} {5,9} Quantityofhiddenlayers 2

Activationfunctioninhiddenlayer elu,tanh elu,tanh elu,tanh Sigmoid,sigmoid elu,tanh Activationinoutputlayer elu

(15)

Table6

Prediction performance of the GA-DFNN and four reference DFNN models.

GA-DFNN DFNN1 DFNN2 DFNN3 DFNN4

Train Test Train Test Train Test Train Test

Daily MAPE(%) 9.870 7.885 9.574 8.163 9.669 7.932 9.521 8.303 9.751 7.962 R2 _0.958 _0.973 _0.961 _0.971 _0.959 _0.970 _0.956 _0.968 _0.957 _0.970 RMSE(kWh) 4.440 3.652 4.277 3.602 4.379 3.643 4.520 3.782 4.483 3.662 MAE(kWh) 2.958 2.675 2.843 2.718 2.897 2.683 2.926 2.765 2.955 2.684 Weekly MAPE(%) 4.908 5.326 7.085 7.337 6.693 6.713 9.662 7.989 5.812 6.388 R2 _0.958 _0.966 _0.890 _0.939 _0.860 _0.927 _0.851 _0.913 _0.942 _0.948 RMSE(kWh) 55.25 47.24 89.35 63.21 100.8 69.42 103.9 75.49 65.04 58.24 MAE(kWh) 32.49 35.52 48.22 46.11 56.53 46.89 59.45 51.13 39.76 42.65

Fig.16. Day-ahead hourly prediction of GA-DFNN and RS-DFNN models.

(16)

Fig.18. Day-ahead hourly prediction of GA-DFNN, LSTM and TCN models.

Fig.19. Week-ahead daily prediction of GA-DFNN, LSTM and TCN models.

6. Practicalimplicationandfutureapplication

Togeneratethedatabasefortrainingandtestingtheproposed GA-DFNNpredictivemodel,morethanoneandahalfyear’shistorical out-doorweatherdatashouldbecollectedfromthelocalweatherstation whiletheelectricityconsumptiondataoughttobeobtainedfromthe buildingenergysystem.UpontrainingandtestingtheproposeGA-DFNN predictivemodel,itcanbeadoptedinthebuildingmanagement sys-temtopredicttheday-aheadhourlyandweek-aheaddailyelectricity consumptionusingthelatestforecastofweatherproﬁleacquiredfrom theweatherreportingwebsite[45–47] .Itcanalsobefurtherreﬁned

totackletheproblemcausedbytheprobablefaultymeasurementdata owingtosensorandequipmentfaults.

However,due tothefactthatthecampusbuildinghasrelatively stable buildingoperatingschedule, theeﬀectivenessof theproposed GA-DFNNpredictivemodelshouldbefurtherevaluatedwhenadopted inresidential,hospital,hotelorothertypesofbuildings.Moreover,it is worthwhiletoinvestigatewhetherotherevolutionaryoptimization algorithms,suchasparticleswarmoptimization,antcolony optimiza-tionandartiﬁcialbeecolonyalgorithmwouldshowbetteroptimization performancethantheGAalgorithmindeterminingtheoptimalDFNN architecture.

(17)

Table7

Optimal architecture of DFNN model determined by random search.

Models Day-ahead hourly prediction Week-ahead daily prediction Neuronsinhiddenlayers {6,8,7,9} {7,8} Quantityofhiddenlayers 4 2 Activationfunctionsinhiddenlayers tanh,tanh,relu elu Activationfunctioninoutputlayer elu elu Learningapproach ADAMAX ADAM

Table8

Prediction performance of the GA-DFNN and the reference RS-DFNN model.

GA-DFNN RS-DFNN

Train Test Train Test Daily MAPE(%) 9.870 7.885 9.124 8.734 R2 _0.958 _0.970 _0.960 _0.965 RMSE(kWh) 4.440 3.652 4.305 3.920 MAE(kWh) 2.958 2.675 2.808 2.917 Weekly MAPE(%) 4.908 5.326 9.592 8.049 R2 _0.958 _0.966 _0.851 _0.917 MAE(kWh) 55.25 47.24 103.9 73.99 RMSE(kWh) 32.49 35.52 60.34 52.10 Table9

Prediction performance of the GA-DFNN and the reference LSTM and TCN model.

GA-DFNN LSTM TCN

Train Test Train Test Train Test Daily MAPE(%) 9.870 7.885 9.591 8.983 10.70 10.36 R2 _0.958 _0.970 _0.910 _0.916 _0.970 _0.963 RMSE(kWh) 4.440 3.652 6.499 6.104 3.741 4.050 MAE(kWh) 2.958 2.675 3.966 3.969 3.049 3.251 Weekly MAPE(%) 4.908 5.326 7.013 7.909 16.12 15.21 R2 _0.958 _0.966 _0.885 _0.881 _0.756 _0.779 RMSE(kWh) 55.25 47.24 88.79 88.11 130.8 119.9 MAE(kWh) 32.49 35.52 44.13 48.26 101.7 100.1 6. Conclusion

Toenhancethepredictionaccuracyofbuildingelectricalenergy con-sumption,ahybridpredictivemodelbasedonthecombinationofGA optimizationandDFNNalgorithmisproposed.InconventionalFNNor DFNN-basedpredictivemodels,thearchitectureoftheFNNandDFNN networkwasselectedbased ontheexperience, intuitionortrialand errorprocess,whichwastime-consumingandlackofaccuracy. How-ever,intheproposedGA-DFNNpredictivemodel,thequantityof hid-denlayers,thequantityofneuronsineachhiddenlayer,theactivation algorithmineachlayeraswellasthelearningprocessforweighting fac-torsaredeterminedthroughGAoptimization.Asaresult,theproposed GA-enhancedDFNNpredictivemodelisaself-directed,systematicand well-behavedprocesstoguaranteetheoptimalDFNNarchitecturewith betterpredictionaccuracyandlesscomputationcomplexity.

Theactualoutdoorweatherdatafromweatherstationandthe mea-suredelectricityconsumptiondatafromthebuildingmanagement sys-temofareal-worldcampusbuildingfromoneentireyearwereadopted ashistoricaldatabasetotraintheproposedGA-enhancedDFNN predic-tivemodel,whilethedatasetsfromtheother6monthswereusedfor testingpurpose.Themainﬁndingsaresummarizedasbelow:

• Forday-aheadhourlyelectricityconsumptionprediction,the opti-malquantityofhiddenlayeris4,with8,5,7and9neuronsineach hiddenlayer,andwiththeactivationfunctionofrectiﬁedlinearunit intheﬁrstthreehiddenlayersandexponentiallinearinthefourth hiddenlayer.Theactivationfunctionin theoutputlayeris

expo-nentiallinearfunctionwhiletheoptimaloptimizationapproachis NADAM.

• Forweek-aheaddailyelectricityconsumptionprediction,the opti-malquantityofhiddenlayeris2.Thereare5neuronsintheﬁrst hiddenlayer,whileexponentiallinearfunctionisadoptedasthe ac-tivationfunction.Thereare9neuronsinthesecondhiddenlayer, whilehyperbolictangentisadopted.Theactivationfunctioninthe outputlayerisexponentiallinearwhiletheoptimaloptimization ap-proachisADAMAX.

• Forday-aheadhourlyelectricityconsumptionprediction,theMAPE, R2_,_RMSE_and_MAE_was_7.885%,_0.970,_3.652_kWh_and_2.675_kWh fortestingcase,respectively;Forweek-aheaddailyelectricity con-sumptionprediction, theMAPE,R2_,_MPE_and_RMSE_was_5.326%, 0.966,47.24kWhand35.52kWhfortestingcase,respectively. • Forday-aheadhourlyandweek-aheaddailyelectricityconsumption

prediction,theproposedGA-DFNNpredictivemodelhas3.4%and 34.5%reductioninMAPEcomparedtothereferenceFNNmodels withsinglehiddenlayer,demonstratingthatDFNNmodelswith mul-tiplelayerscanbetterrevealthecomplexrelationshipamongvarious inputdatasetsandoutputdataset.

• Forday-aheadhourlyandweek-aheaddailyelectricityconsumption prediction,theproposedGA-DFNNpredictivemodelhas5.0%and 33.3%reductioninMAPEcomparedtothereferenceDFNNmodels withotherarchitecture,demonstratingthatGAoptimizationhasthe abilitytoﬁndtheoptimalarchitectureforDFNNmodels.

• Forday-aheadhourlyand week-aheaddaily electricity consump-tionprediction,theproposedGA-DFNNpredictivemodelhas9.7% and33.8%reductioninMAPEthanthereferenceRS-DFNNmodel, demonstratingthatGAoptimizationhasbetteroptimization perfor-mancethanrandomsearchoptimizationindeterminingtheoptimal architectureformachinelearningmodels.

• Forday-aheadhourlyandweek-aheaddailyelectricityconsumption prediction,theproposedGA-DFNNpredictivemodelhas12.2%and 32.7%reductioninMAPEthanthereferenceLSTM,while23.8%and 65.0%reductioninMAPEthanthereferenceTCNmodels, demon-strating thatDFNNhasbetterperformancein buildingelectricity consumption prediction than time-series based machinelearning models.

Acknowledgment

Theauthorswouldlike toacknowledgeandexpresstheirsincere gratitude toTheDepartmentforBusiness,Energy& Industrial Strat-egy(BEIS)throughgrantprojectnumberTEIF-101-7025.Opinions ex-pressedandconclusionsarrivedatarethoseoftheauthorsandarenot tobeattributedtoBEIS.

References

[1] ZhaoGY,LiuZY,HeY,CaoHJ,GuoYB.Energyconsumptioninmachining: classi-ﬁcation,prediction,andreductionstrategy.Energy2017;133:142–57.

[2] SieminskiA.Internationalenergyoutlook.EnergyInfAdm2017:5–30.

[3] LuoXJ,FongKF.Developmentofmulti-supply-multi-demandcontrolstrategyfor combinedcooling,heatingandpowersystemprimedwithsolidoxidefuelcell-gas turbine.EnergyConversManag2017;154:538–61.

[4] LuoXJ,FongKF.Developmentofintegrateddemandandsupplysidemanagement strategyofmulti-energysystemforresidentialbuildingapplication.ApplEnergy 2019;242:570–87.

[5] Pérez-ChacónR,Luna-RomeraJM,TroncosoA,Martínez-ÁlvarezF,RiquelmeJC. Bigdataanalyticsfordiscoveringelectricityconsumptionpatternsinsmartcities. Energies2018;11(3):683.

[6] WangZ,SrinivasanRS.Areviewofartiﬁcialintelligencebasedbuildingenergyuse prediction:contrastingthecapabilitiesofsingleandensemblepredictionmodels. RenewSustainEnergyRev2017;1:796–808.

[7] ZhaoY,ZhangC,ZhangY,WangZ,LiJ.Areviewofdataminingtechnologiesin buildingenergysystems:loadprediction,patternidentiﬁcation,faultdetectionand diagnosis.EnergyBuiltEnviron2020;2:149–64.

[8] WeiY,ZhangX,ShiY,XiaL,PanS,WuJ,HanM,ZhaoX.Areviewofdata-driven approachesforpredictionandclassiﬁcationofbuildingenergyconsumption.Renew SustainEnergyRev2018;1:1027–47.

[9] AmasyaliK,El-GoharyNM.Areviewofdata-drivenbuildingenergyconsumption predictionstudies.RenewSustainEnergyRev2018;81:1192–205.

(18)

[10] LuoXJ,LukumonO,AnuoluwapoA,ChukwukaM,OlugbengaA,LukmanA. De-velopmentofanIoT-basedbigdataplatformforday-aheadpredictionofbuilding heatingandcoolingdemands.AdvEngInform2019;41:100926.

[11] LuoXJ.Anovelclustering-enhancedadaptiveartiﬁcialneuralnetworkmodelfor predictingday-aheadbuildingcoolingdemand.JBuildEng2020:101504. [12] Singh S, Hussain S, Bazaz MA. Short term load forecasting using artiﬁcial neural

network. 2017 Fourth International Conference on Image Information Processing (ICIIP), Shimla 2017:1–5. doi: 10.1109/ICIIP.2017.8313703.

[13] MenaR,RodríguezF,CastillaM,ArahalMR.Apredictionmodelbasedon neu-ralnetworksfortheenergyconsumptionofabioclimaticbuilding.EnergyBuild 2014;82:142–55.

[14] KusiakA,LiMY,ZhangZJ.Adata-drivenapproachforsteamloadpredictionin buildings.ApplEnergy2010;87:925–33.

[15] DebC, EangLS, Yang J,SantamourisM.Forecastingdiurnal coolingenergy load for institutional buildings usingartiﬁcial neural networks. EnergyBuild 2016;121:284–97.

[16] AhmadT,ChenH,ShairJ,XuC.Deploymentofdata-miningshortandmedium-term horizoncoolingloadforecastingmodelsforbuildingenergyoptimizationand man-agement.IntJRefrig2019;98:399–409.

[17] YangJ,HuguesR,RaduZ.On-linebuildingenergypredictionusingadaptive artiﬁ-cialneuralnetworks.EnergyBuild2005;37:1250–9.

[18] CloseR.Theoryofthebackpropagationneuralnetwork.In:ProceedingsofIEEE internationalconferenceonneuralnetworks,1;1989.p.593–605.

[19] WangL,EricWML,RichardKKY.Noveldynamicforecastingmodelforbuilding coolingloadscombininganartiﬁcialneuralnetworkandanensembleapproach. ApplEnergy2018;228:1740–53.

[20] .NeuroShell2manual.Frederick,MA:WardSystemGroup,Inc.;1993.

[21] BagnascoA,FresiF,SaviozziM,SilvestroF,VinciA.Electricalconsumption fore-castinginhospitalfacilities:anapplicationcase.EnergyBuild2015;103:261–70. [22] AhmadMW,MourshedM,RezguiY.TreesvsNeurons:comparisonbetween

ran-domforestandANNforhigh-resolutionpredictionofbuildingenergyconsumption. EnergyBuild2017;147:77–89.

[23] LiK,XieX,XueW,DaiX,ChenX,YangX.Ahybridteaching-learningartiﬁcial neuralnetworkforbuildingelectricalenergyconsumptionprediction.EnergyBuild 2018;174:323–34.

[24] AlnaqiAA,MoayediH,ShahsavarA,NguyenTK.Predictionofenergetic perfor-manceofabuildingintegratedphotovoltaic/thermalsystemthoroughartiﬁcial neu-ralnetworkandhybridparticleswarmoptimizationmodels.EnergyConversManag 2019;183:137–48.

[25] LiK,HuC,LiuG,XueW.Building’selectricityconsumptionpredictionusing op-timizedartiﬁcialneuralnetworksandprincipalcomponentanalysis.EnergyBuild 2015;108:106–13.

[26] MuralitharanK,SakthivelR,VishnuvarthanR.Neuralnetworkbased optimiza-tion approach for energy demand prediction in smart grid. Neurocomputing 2018;273:199–208.

[27] SchmidhuberJ.Deep learningin neuralnetworks: an overview.Neural Netw 2015;61:85–117.

[28] LeCunY,BengioY,HintonG.Deeplearning.Nature2015;321:436–44.

[29] Zekić-SušacM,Mitrović S,HasA.Machinelearningbasedsystemformanaging energyeﬃciencyofpublicsectorasanapproachtowardssmartcities.IntJInf Manag2020:102074.

[30] BandaE,FollyKA.Shorttermloadforecastingusingartiﬁcialneuralnetwork.In: 2007IEEElausannepowertech.IEEE;2007Jul1.p.108–12.

[31] TorresJF,Gutiérrez-AvilésD,TroncosoA,Martínez-ÁlvarezF.Random hyper-pa-rametersearch-baseddeepneuralnetworkforpowerconsumptionforecasting. Ad-vancesincomputationalintelligenceIWANN2019Lecturenotesincomputer sci-ence,11506.Cham:Springer;2019.

[32] ParkSK,MoonHJ,MinKC,HwangC,KimS.Applicationofamultiplelinear re-gressionandanartiﬁcialneuralnetworkmodelfortheheatingperformanceanalysis andhourlypredictionofalarge-scalegroundsourceheatpumpsystem.EnergyBuild 2018;165:206–15.

[33] LuR,HongSH.Incentive-baseddemandresponseforsmartgridwithreinforcement learninganddeepneuralnetwork.ApplEnergy2019;236:937–49.

[34] Schalkoﬀ RJ,YorkNew.Artiﬁcialneuralnetworks.McGraw-Hill;1997. [35] JohnD,EladH,SingerYoram.Adaptivesubgradientmethodsforonlinelearning

andstochasticoptimization.JMachLearnRes2011;12:2121–59.

[36] KingmaDP,BaJL.Adam:amethodforstochasticoptimization.In:International conferenceonlearningrepresentations;2015.p.1–13.(2015).

[37] DogoEM,AfolabiOJ,NwuluNI,TwalaB,AigbavboaCO.Acomparativeanalysisof gradientdescent-basedoptimizationalgorithmsonconvolutionalneuralnetworks. In:2018internationalconferenceoncomputationaltechniques,electronicsand me-chanicalsystems(CTEMS).IEEE;December2018.p.92–9.

[38] VaniS,RaoTM.Anexperimentalapproachtowardstheperformanceassessmentof variousoptimizersonconvolutionalneuralnetwork.In:20193rdinternational con-ferenceontrendsinelectronicsandinformatics(ICOEI).IEEE;April2019.p.331–6. [39] GoldbergD.Geneticalgorithmsinsearch,optimization,andmachinelearning.

Read-ing,Massachusetts:AddisonWesley;1989.

[40] MitchellM.Anintroductiontogeneticalgorithms.MITpress;1998.

[41] OhBK,GlisicB,ParkSW,ParkHS.Neuralnetwork-basedseismicresponse pre-dictionmodelforbuilding structuresusingartiﬁcialearthquakes.JSound Vib 2020;468:115109.

[42] RenT,LiuX,NiuJ,LeiX,ZhangZ.Real-timewaterlevelpredictionofcascaded channelsbasedonmultilayerperceptionandrecurrentneuralnetwork.JHydrol (Amst)2020:124783.

[43] BahmanAM,EbrahimSA.Predictionoftheminimumﬁlmboilingtemperatureusing artiﬁcialneuralnetwork.IntJHeatMassTransf2020;155:119834.

[44] Chollet F. (2015) keras, GitHub. https://github.com/fchollet/keras.

[45] Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., Zheng X. TensorFlow: large-scale machine learning on heterogeneous distributed systems. 2015. [46] https://dpds.weatheronline.co.uk/historical_data/weather_stations_download. [47] https://www.accuweather.com.

[48] https://weather.com/en-GB/. [49] https://www.metoﬃce.gov.uk/.

[50] Berriel RF, Lopes AT, Rodrigues A, Varejao FM. and Oliveira-Santos T. Monthly energy consumption forecast: a deep learning approach. In 2017 international joint conference on neural networks (IJCNN), IEEE, 4283–4290

[51] ZurRM,JiangY,PesceLL,DrukkerK.Noiseinjectionfortrainingartiﬁcial neu-ral networks: acomparison withweight decay andearlystopping. Med Phys 2009;36(10):4810–18.

[52] PrecheltL.Earlystopping-butwhen?.In:Neuralnetworks:tricksofthetrade.Berlin, Heidelberg:Springer;1998.p.55–69.

[53] TorresJF,GaliciaA,TroncosoA,Martínez-ÁlvarezF.Ascalableapproachbased ondeeplearningforbigdatatimeseriesforecasting.IntegrComputAidedEng 2018;25(4):335–48.

100015

ScienceDirect

www.elsevier.com/locate/egyai

( http://creativecommons.org/licenses/by-nc-nd/4.0/

2017;133:142–57

2017:5–30

2017;154:538–61

2019;242:570–87

2018;11(3):683

2017;1:796–808

2020;2:149–64

2018;1:1027–47

2018;81:1192–205

2019;41:100926

2020:101504

doi: 10.1109/ICIIP.2017.8313703

2014;82:142–55

2010;87:925–33

2016;121:284–97

2019;98:399–409

2005;37:1250–9

593–605

2018;228:1740–53

1993

2015;103:261–70

2017;147:77–89

2018;174:323–34

2019;183:137–48

2015;108:106–13

2018;273:199–208

2015;61:85–117

2015;321:436–44

2020:102074

108–12

2019

2018;165:206–15

2019;236:937–49

1997

2011;12:2121–59

2015.

92–9

331–6

1989

1998

2020;468:115109

2020:124783

2020;155:119834

https://github.com/fchollet/keras

https://dpds.weatheronline.co.uk/historical_data/weather_stations_download

https://www.accuweather.com

https://weather.com/en-GB/

https://www.metoﬃce.gov.uk/

2009;36(10):4810–18

55–69

2018;25(4):335–48