ContentslistsavailableatScienceDirect
Sustainable
Cities
and
Society
j o ur n a l ho me p a g e :w w w . e l s e v i e r . c o m / l o c a t e / s c s
Engineering
advance
Advances
and
challenges
in
building
engineering
and
data
mining
applications
for
energy-efficient
communities
夽
Zhun
(Jerry)
Yu
a,
Fariborz
Haghighat
b,∗,
Benjamin
C.M.
Fung
c aCollegeofCivilEngineering,HunanUniversity,Changsha410082,Hunan,PRChinabDepartmentofBuilding,CivilandEnvironmentalEngineering,ConcordiaUniversity,Montreal,QC,CanadaH3G1M8 cSchoolofInformationStudies,McGillUniversity,Montreal,QC,CanadaH3A1X1
a
r
t
i
c
l
e
i
n
f
o
Articlehistory:Availableonline19December2015 Keywords:
DataMining BuildingEnergyUse OccupantBehavior BigData Review
a
b
s
t
r
a
c
t
Therapidlygrowingandgiganticbodyofstoreddatainthebuildingfield,coupledwiththeneedfordata analysis,hasgeneratedanurgentneedforpowerfultoolsthatcanextracthiddenbutusefulknowledge ofbuildingperformanceimprovementfromlargedatasets.Asanemergingsubfieldofcomputerscience, dataminingtechnologiessuitthisneedwellandhavebeenproposedforrelevantknowledgediscovery inthepastseveralyears.Aimedtohighlightrecentadvances,thispaperprovidesanoverviewofthe studiesundertakingthetwomaindataminingtasks(i.e.predictivetasksanddescriptivetasks)inthe buildingfield.Basedontheoverview,majorchallengesandfutureresearchtrendsarealsodiscussed.
©2015TheAuthors.PublishedbyElsevierLtd.ThisisanopenaccessarticleundertheCCBY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).
Contents
1. Introduction...33
2. DMapplicationsinthebuildingfield...34
2.1. Predictivetasks...34
2.1.1. Buildingenergydemandprediction...34
2.1.2. Buildingoccupancyandoccupantbehavior...34
2.1.3. Faultdetectiondiagnostics(FDD)forbuildingsystems...34
2.2. Descriptivetasks...35
2.2.1. Frameworkdevelopment...35
2.2.2. Occupantbehavior...35
2.2.3. Buildingmodelingandoptimalcontrol...35
2.2.4. Discoveringandunderstandingenergyusepatterns...36
3. Futuredirectionsandchallenges—Concludingremarks...36
Acknowledgements ... 37
References...37
1. Introduction
Currently theimportanceof improvingbuildingenergy per-formanceforsavingenergyandenhancingbuildingsustainability hasbeenwidelyrecognized.Oneeffectivewayofachievingthis objectiveistouncoverandextractusefulknowledgefrom build-ing operational data (e.g. temperature, flow rate, power and
夽 ThisisanEngineeringAdvancepaper. ∗ Correspondingauthor.
E-mailaddress:[email protected](F.Haghighat).
equipmentstates) that contains abundantvaluable information onactualbuildingperformance. Thewidespreaduseofbuilding automationsystems(BASs)enablesatremendousamountof build-ingoperationaldatatobestoredinbuildingdatabasesthatalso continue to expand. Thisrapidlygrowing and giganticbody of storeddata,coupledwiththeneedfor dataanalysis,has gener-atedanurgentneedforpowerfultoolsthatcanextracthiddenbut usefulknowledgefromlargebuildingdatabases.
Asanemergingandpromisingtechnology,datamining(DM) isapowerfulandversatiletooltoautomaticallyextractthe valu-able knowledge embeddedin huge amounts of data.It can be definedinmanydifferentways.AsdefinedbyCabena,Hadjinian,
http://dx.doi.org/10.1016/j.scs.2015.12.001
2210-6707/©2015TheAuthors.PublishedbyElsevierLtd.ThisisanopenaccessarticleundertheCCBY-NC-NDlicense(http://creativecommons.org/licenses/by-nc-nd/4. 0/).
Stadler,Verhees,and Zanasi(1998),DMis “aninterdisciplinary fieldbringingtogethertechniquesfrommachinelearning,pattern recognition,statistics,databases,andvisualizationtoaddressthe issueofinformationextractionfromlargedatabases.”Inthepast severaldecades,researchershavebeenvigorouslyandsuccessfully applyingDMinmanyscientific,medical,andapplicationdomains suchasbanking,bioinformaticsandnewmaterialsidentification. Recentlyithasalsobeenintroducedintothebuildingfieldthatisa well-fitapplicationareaforDM,sinceitgeneratesandcollectsvast amountsofdataonsystemoperation,occupantbehavior,power consumption,climaticconditionsandetc.
Ingeneral,DMincludessixcategoriesofwidelyacceptedand implementedtechniques(Usama,Gregory,&Smyth,1996):
• Dataclassification(e.g.thedecisiontreemethod,supportvector machine(SVM)andartificialneuralnetwork(ANN)),
• Clusteringanalysis,
• AssociationRuleMining(ARM), • Regression,
• Summarization,and • AnomalyDetection.
Readersmay refer to(Jia,Kamber, &Pei, 2012) for detailed informationofthesetechniques.Thesetechniquescanbebroadly categorizedintopredictivetasksanddescriptivetasks(Jiaetal., 2012).Thispaperreportsanoverviewoftherecentstudies under-takingthetwotasksinthebuildingfield.
2. DMapplicationsinthebuildingfield
2.1. Predictivetasks
2.1.1. Buildingenergydemandprediction
Thepredictionofbuildingenergydemandplaysanimportant rolein improvingbuildingperformance. Anaccurateprediction needstotakevarioussignificantinfluencing factors ofbuilding energydemand intoconsideration, suchasweather conditions, HVACequipment,buildingenvelopesandoccupantbehavior.The complexityanduncertaintyofthesefactorsfurtheradds difficul-tiestoimprovepredictionaccuracy.DMtechniquesmaymakea breakthroughindealingwiththiscomplexityanduncertainty.
Zhaoand Magoulès(2012)indicatedthatDMtechniquesare veryapplicabletobuildingenergydemandpredictionsincethey candeal withnon-linear problems. ANNand SVM arethe two most widely used DM techniques for this application (Ahmad etal.,2014).Kumar, Aggarwal,andSharma(2013)applied vari-ousANNmethods,includingbackpropagation,recurrentANN,auto associativeANNandgeneralregression ANN.TheadoptedANN architecturesignificantlyinfluencesthecoefficientofvariationthat rangesfrom2%to40%.TheyconcludedthatANNismoresuitable forthepredictionofalargesetofparametersthananystatistical techniques.Li,Ren,andMeng(2010)reportedthatinmanycases SVMshowshigherpredictionaccuracythanANN.However, train-ingSVMcanbeaveryslowprocessduetolargevolumeoftraining data.TheusageofparallelSVM(Zhao&Magoulès,2011)mightbe afeasiblealternative.
BothANNandSVMmodelsoperatelikea“blackbox”, mean-ingthatthemodelcanprovideapredictionbutcannotprovidea justificationforsupportingtheprediction.Inordertoovercome thislimitation,Yu,Haghighat,Fung,andYoshino(2010)developed abuildingenergy demandpredictivemodel basedonthe deci-siontreemethod.Itscompetitiveadvantageliesintheabilityto generateaccuratepredictivemodels(92%accuracyintheirstudy) withinterpretableflowchart-liketreestructuresthatenableusers toquicklyextractusefulinformation.However,thedecisiontree
methodisbasicallydevelopedforpredictingcategoricalvariables otherthanforpredictingnumericalvariables.
ConsideringthatdifferentDMtechniqueshavetheirown spe-cialties,strengthsandweaknesses,itisnoteasytofindthebest candidateforthebuildingenergydemandprediction.Therecent researchtrendis towardstheintegrationof differentDM tech-niquesformoreaccurateprediction.For example,ChouandBui
(2014) developed an ensemble model by combining ANN and
SVMtopredictcoolingand heatingdemand.Inordertopredict thenext-dayenergyconsumptionandpeakpowerdemand,Fan, Xiao,andWang(2014)developedensemblemodelsbycombining eightbaseDMmodelsandassigningeachDMmodalanoptimized weightbasedongeneticalgorithm(GA).Theresultsshowthatthe ensemblemodelsachievehigherpredictionaccuracythanthoseof individualbasemodels.
2.1.2. Buildingoccupancyandoccupantbehavior
Buildingoccupancyandoccupantbehavior arerecognizedas crucialfactorsinfluencingthediscrepancybetweenpracticaland simulatedbuildingenergyconsumption.However,itisdifficultto investigatethemanalyticallyandthentodevelopreliable predic-tionmodelsduetotheircomplicatedcharacteristicsandstochastic nature(Yu,Haghighat,Fung,Morofsky,&Yoshino,2011a).Tomeet thischallenge,differentstochasticmodels(e.g.usingprobabilistic methods(Sun,Yan,Hong,&Guo,2014;Stoppel&Leite,2014)and theMarkovChainmethod(Muratori,Roberts,Sioshansi,Marano,& Rizzoni,2013))havebeenproposed.However,theproposed mod-elsareseverelyrestrainedbythelimitation:Themodelingprocess tendstobecomplex and advanced mathematicalknowledge is required.
In order to remove the above limitation, researchers have attempted to establish DM-based models. For example, Basu, Hawarah,Arghira,Joumaa,&Ploix, 2013)developeda decision-treebasedmodelforpredictingoccupantbehaviorattheappliance levelinresidentialbuildings.D’OcaandHong(2015)proposeda DMmethodologytomodelofficeoccupancypatternsand work-inguserprofilesbasedonbigdatastreams.Theyfoundthatthe decisiontreemethodissuitableforpredictingtheoccupancy pres-ence,supportedbyZhao,Lasternas,Lam,Yun,andLoftness(2014), whobuilttheoccupantbehaviorpredictionmodelsbasedon appli-ancepowerconsumptiondatainamedium-sizeofficebuilding.The resultsofbothstudiesindicatethatthemodelingaccuracyisvery satisfactory.However,thedevelopeddecision-treebasedmodels arestaticmodelsthatarehardtosimulatethedynamicnatureof buildingoccupancyandoccupantbehavior.
Themainfocusoffutureresearchshouldbeplacedontesting andcomparingotherdynamicDM-basedmodelsandintegrating themintobuildingenergy modelingprogramslikeTRNSYSand EnergyPlus.Inaddition,moreresearchneedstobeconductedso thatarchitectsanddesignerswillbenefitfrombridgingthegap betweenactualandpredictedbuildingenergyperformance.
2.1.3. Faultdetectiondiagnostics(FDD)forbuildingsystems Automatingthe process of detectingequipmentand system malfunctionsandmakingaproperdiagnosiscanhelptoensure stableoroptimalbuildingoperation.Intermsoftheapproachto formulatingthediagnostics,FDDmethodscanbecategorizedas model-based methods, which are based onprior knowledge of underlyingsystemphysics,anddata-drivenmethods,whichare basedonhistoricaldata(Katipamula&Brambley,2005).
Therequirementof priorknowledge,togetherwithcomplex modeling processes and heavy computationalburden, imposes severeconstraintsontheapplicationofthemodel-based meth-ods.Comparatively,thedatadrivenmethodsaremucheasiertouse sincethemodelsarenormallyautomaticallygenerated.VariousDM techniqueshavebeenemployedasdatadrivenmethodsforFDD.
Forexample,Magoulès,Zhao,&Elizondo,2013)developedanANN modelbasedontherecursivedeterministicperceptron(RDP)ANN toimplementFDDatthewholebuildinglevel.Moreover,they pro-posedanewfaultdiagnosticprocedureforidentifyingandranking abnormalequipmentintheorderoffaultrisk.OtherDMtechniques havealsobeenemployedforFDDsuchasclusteranalysis(Khan, Capozzoli,Corgnati,&Cerquitelli,2013),fuzzylogic(Lauroetal., 2014)andSVM(Mulumba,Afshari,Yan,Shen,&Norford,2015).
In order to takeadvantage of different DM techniques and achieve more robust models, hybrid approaches have been proposedforFDDsuchasthecombinationoftheSVMand autore-gressivemodelwithexogenousinputs(ARX)(Yan,Shen,Mulumba, &Afshari,2014)andthecombinationofANNandsubtractive clus-tering analysis(Du,Fan, Jin, &Chi,2014).Experimental results suggestthesehybridapproacheshavehigherpredictionaccuracy andlowerfalsealarmratesthantheapplicationofsinglemodels. However,theaboveDM-basedmodels,bothindividualandhybrid, focusedonhistoricaldataandcannotbeusedforreal-timeFDD.
Recently,researchers haveshown interestin developing on-lineFDDtoolsthatcanprovidereal-timeornear-real-timeupdates onmodelparametersandpredictions(Bonvini,Sohn,Granderson, Wetter, & Piette, 2014). DM techniques, particularly the SVM, have been utilized in these studies and shown great potential forreal-timeFDDsincetheypreventperformingnumerous time-consumingsimulations(Yanetal.,2014).Futureresearchshould focusonverifyingpredictionofmodelsbasedonarealworlddata set.
2.2. Descriptivetasks
2.2.1. Frameworkdevelopment
ThediversityofDMtechniquesandtheirfunctionalities neces-sitatesestablishingandprovidingaframeworkforuserswhomay nothavemuchknowledgeandexperiencewiththeapplicationof DM.Basically,suchframeworkneedstospecifywhichDM tech-nique/algorithmcanbeusedforbuildingapplicationandwhich kindofpatterncanbeminedcorrespondingly.Theultimate objec-tiveistohelpuserstodevelopasetofdataanalysismethodologies sothattheycansolvetheproblemsbasedontheframework. More-over,casestudiesalsoneedtobeconductedinordertodemonstrate itseffectiveness.Theframework,togetherwiththecasestudies, enablesuserstoanalyzebuilding-relateddatamoreefficiently.
BothgeneralandspecificDMframeworkshavebeenproposedin thepastseveralyears(Yu,Fung,&Haghighat,2013).General frame-worksprovideaDMtechniquepoolanddifferentpatternswillbe foundgivendifferentDMtechniquesareselectedandcombined. Specificframeworksaretargetedatexploringa specificpattern suchas occupancypatternsand designated DMtechniquesare used.
Regarding general frameworks, Yu et al. (2013) estab-lished a four-component framework consisting of DM tech-niques/algorithms.Theyalsoproposedastep-by-stepdataanalysis processthatstartsfromproblemdefinitiontoknowledge discov-ery.Fan,Xiao,andYan(2015)developedaframeworkthatmainly includesfourphases:dataexploration,datapartitioning, knowl-edgediscoveryandpostmining.Regardingspecificframeworks,
D’OcaandHong(2015)developedaframeworktodiscover occu-pancy patterns and working user profiles in office spaces. The decision-treemethodandclusteranalysiswerecombinedto pro-videinsightsintopatternsofoccupancyschedules.However,these frameworksinvolvelimitedDMtechniquesandsofarrelatively veryfewproblemshasbeentackledwiththeinvolvedDM tech-niques.
DMframeworkscandeliverasetofdataanalysismethodologies topracticaloperation,andthusefficientlygeneralizingtheusage ofDMtechniquestoawiderpopulation.Theyprovideanoverview
andastandardizedprocessoftheusageofDMtechniques. There-fore,moreframeworks,particularlyspecificframeworks,andcase studieswithhighreplicationpotentialshouldbedevelopedand appliedfordifferentapplications.
2.2.2. Occupantbehavior
DMtechniqueshavealsobeenemployedtoconductdescriptive tasksforoccupantbehaviorinseveralstudies.Thesestudiesfocus oneitheronetypeofspecificbehaviorinofficebuildingsorfull patternsofbehaviorinresidentialbuildings.Theirmainobjectives aretodiscoverbehaviorpatternsandtoidentifyandimprovethe effectofoccupantbehavioronbuildingenergyconsumption.The obtainedknowledgecanbeintegratedintobuildingenergy simu-lationprogramsespeciallywhensettingupbehaviorschedules.
D’Oca and Hong (2014) combined clustering analysis with
ARM in order to discover occupants’ window opening/closing behavior patterns in a natural ventilatedoffice building. Based onclusteranalysis,Yu,Fung,Haghighat,Yoshino,andMorofsky
(2011b)developedanewmethodologyforexaminingthe
influ-encesofoccupantbehavioronbuildingenergyconsumptionwhile removingtheeffect ofother factorssuchasclimateand build-ing envelopes.Based on clusteranalysis, classification analysis, andARM, Yuetal.(2011a) developedanothermethodologyfor identifyingenergy-inefficientbehaviorinresidentialbuildingsand providingoccupantswithfeasiblerecommendationstoimprove them. It is worth mentioning that the effective usage of DM techniques,particularlyclusteranalysis,for describingoccupant behaviorisheavilydependentonthesizeofestablisheddatabases, whichnormallyarerestrictedbyvariousfactorssuchasbudgets andeffortinpractice.
Duetoitscomplexityandrandomness,theinteractionsbetween occupant behaviorand otherinfluencing factorsare stillpoorly understood.FurtherstudiesontheDM-basedmethodologiesare activelyencouraged,andthecombinationofclusteranalysiswith ARMdeservesextraattention.
2.2.3. Buildingmodelingandoptimalcontrol
Buildingoperationaldatacontainsvaluableinformationthatcan beextractedformodelingandidentifyingthehiddenrelationship betweendifferentvariables,therebyimprovingbuildingoperation. Also,couplingtheinformationintobuildingcontrolsystemscan assist in optimizingbuildingperformance. Anumber of studies havealreadybeenconductedonsuchinformationextraction.
Withthegoal ofimprovingbuildingperformance, DM tech-niques have been employed to model different performance variablessuchasfurnaces’cycleidletime(Li,Miao,&Shi,2014), thermodynamicproperties of refrigerants (Küc¸üksille,Selbas¸, & S¸encan,2011),applianceusage(Basu,Debusschere,&Bacha,2012) andcoefficientofperformanceforrefrigerationequipment(Chou, Hsu,&Lin,2014).Forexample,Ahmed,Korres,Ploennigs,Elhadi, and Menzel(2011)usedNaïveBayes,decision treeandSVM to simplifytheproceduretoestablishmodelsforpredictinginternal naturallightingandthermalcomfort.Theresultsshowthehigh predictionaccuracyandreliabilityofthesetechniques.Basedon theBASdatabaseofthetallest buildinginHongKong, Xiaoand Fan(2014)firstappliedclusteringanalysistoidentifythetypical buildingenergyconsumptionpatterns,andthenappliedARMto eachclusterfordiscoveringthehiddenassociationsamongpower consumptionsofmajorequipmentsuchaschillersandpumps.Asa result,deficitflowinthechilledwatersystemwasfoundand abnor-maloperationoftheprimaryandsecondarypumpswasdetected. DMtechniqueshavealsobeenemployedforHVACsystem opti-malcontrolwiththeaimofminimizingtheenergyconsumption while maintaining indoorcomfort levels. Generally these tech-niquesareintegratedintocontrollersinordertotakeadvantageof theirstrengths.Forexample,basedonthefuzzylogicmethod,ANN
andGeneticAlgorithm,fuzzylogiccontrollers(Hussain,Gabbar, Bondarenko, Musharavati, & Pokharel, 2014), ANN controllers (Moon, Lee,&Kim, 2014)andGA-based controllers(Congradac &Kulic, 2009)were developed. These controllers have already been widely applied in the buildingfield and a large number of studies can be found in literature that is not listed in this paper.
Existingstudieshavealreadydemonstratedthecapabilitiesof DMtechniquesasameanstoeffectivelyhelpimprovingand opti-mizingbuildingperformance.Potential futureresearchincludes developingmoreaccurateandrobustmodels,particularlya com-binationofminingtechniques,that canbereadilyusable. Also, morepractical applicationsofDM-based controllersneedtobe conductedinthebuildingfieldinordertoevaluatetheirusability.
2.2.4. Discoveringandunderstandingenergyusepatterns
Buildingenergyconsumptionpatterns,bothfrequentand infre-quent,usuallycanbeusedtohelpidentifyoperationabnormalities andfaults.Effectivepatternrecognitionandcorresponding knowl-edgeextractionbyminingbuildinghistoricaldataorsimulation datacanleadtoenergysavingsandperformance improvement. Relevantstudiesatboththewholebuildinglevelandsub-system levelhavebeenconductedinrecentyears.
Atthewholebuildinglevel,Miller,Nagy,andSchlueter(2015)
appliedclusteringanalysistoaggregatefrequentenergyuse pat-ternssoastogenerateprofilesthatmaybenefitfuturesimulation modelcalibration.Lange, Rodriguez,Puech,and Vasques(2013)
developeda3DvisualizationtoolthatcanbeintegratedintoBASs tovisualizedatainanunderstandableway.Theworkispartofthe RIDERprojectthatcomposedofasetofDMtechniquesaimedat gaininginsightintobuildingenergyusepatterns.BasedonARM,
Yu,Haghighat,Fung,andZhou(2012)developedamethodology for examining all associations and correlations between build-ingoperationaldata,therebypinpointingenergywasteaswellas equipmentfaults.Byminingsimulationdata,Kim,Stumpf,andKim (2011)developedanoveralldataanalysisprocesstofindimportant patternsinordertoimprovetheenergyefficiencyofbuildingdesign duringtheearlydesignphase.
Atthesub-systemlevel,anincreasingnumberofstudies(Lai, Lai,Huang,&Chao,2013;Beckel,Sadamori,Staake,&Santini,2014; Rollins&Banerjee,2014)focusedontheinvestigationson appli-anceusagepatterns.Chicco(2012)provided anoverviewofthe previousliteratureonclusteringanalysisforgroupingand min-ingelectricalloadpatterns,particularlyresidentialdemandprofiles (Rhodes,Cole,Upshaw,Edgar,&Webber,2014).Fewstudiesfurther focusedonmorespecificapplianceusagepatternssuchaslighting (MottaCabrera&Zareipour,2013).
Patternsextractedfrombuildinghistoricdataandsimulation dataprovideadeepinsightintothewaybuildingoccupants con-sumeenergy.Utilities,policy-makers,buildingowners,designers andoperatorsmaybenefitgreatlyfromtheseextractedknowledge. Mostexistingstudiesarestillfocusedongeneralpatternanalysis overalongtimelength,limitedbylowresolutiondataavailable.In ordertodiscoverin-depthknowledgeofbuildingperformance,the focusoffuturestudiesneedstobeplacedonmoredetailedpatterns basedonhighresolutiondatasuchas1-mindata.
3. Futuredirectionsandchallenges—Concludingremarks
Thispaperreviewstheadvancesinthefieldofbuilding engi-neeringand DMapplicationsover thelast severalyears. It has beenshownthatDMtechniquesareaneffectivetoolforextracting hiddenbutusefulknowledgefrombuilding-relateddata,thereby improvingbuildingperformance.However,someimportant prob-lemslimitingtheapplicationsstillneedtobeaddressedandmore
effort is needed. The following described theencountered and anticipatedchallenges,potentialfuturedevelopmentsand recom-mendationsfortheapplicationofDMtechniquestothebuilding field:
(1)Low-qualitydataleadstolow-qualitydataminingresults.A keychallengeishowtoimprovedataqualityintermsof relia-bility,completeness,consistencyandresolution.Therearetwo major reasonsfor thelack of high-qualitydata: thelack of widely-acceptedbenchmarksforidentifyingtheparametersto bemonitoredandthelackofadequatesensorsforcollecting reliableandhighresolutiondatathattendtobecostly. More-over,forresidentialbuildings,theenergyconsumptiondataare collectedfromthemonthlyorannualbillsfromutility compa-nies.Thesehighlysummarizeddataareinsufficienttoconduct detaileddataanalysis.Thefactthatend-usedataonhousehold applianceusageisseldommonitoredaddsgreatdifficultiesto in-depthdataanalysis.
(2)Wideanddiverseapplicationshavebeenfoundandvarious DMtechniqueshavebeenemployed.However,consideringthat DMisarelativelyyoungandfast-growingdisciplineandnew techniquesemerges,clearlythepotentialofDMtoextract use-fulknowledgefrombuilding-relateddatahasnot beenfully exploited.Inthisview,buildingresearchersandengineersneed tokeepaneyeonthecurrenttrendtowardsthedevelopmentof DMtechniques,suchasscalableandconstraint-basedmining methods,aswellastheirapplications.
(3)Previous studies indicate that theapplications have several importantlimitations.Forexample,althoughadeeper under-standing of the data can be gained, the process of model constructionandknowledgeextractionreliesheavilyonusers’ priordomainknowledge(e.g.settingathresholdforARMand determiningtheoptimalnumberofclusters).Also,itisnot intu-itiveonhowtoutilizethediscoveredknowledge.Moreeffortis neededfortheexplorationofimprovingtheefficiencyandease oftheminingprocessandresultanalysis.Onepossibledirection totakeistheintroductionof“actionmining”or“profit min-ing”thathasshownsomesuccessinthebusinessworld.Their DMresultsarenotjustabunchofrules,butsomesuggested actionstomaximizetheprofitorrevenue.Similarconceptsmay beapplicableinthebuildingfield.
(4)Theincreasingvolume,velocityandvarietyofbuilding-related data usually result in databases with sizesand complexity beyondtheabilityoftraditional statisticalanalysismethods to process them efficiently. Such databases can be consid-eredas“BigData”thatisacurrenttopicunderdiscussionin Information, Communicationand Technology(ICT) in build-ings’experts.Basedonthedefinitionandexistingapplication of DM to building-related data,DM is expected to play an importantroleinestablishingeffective“BigData”management approaches.Fromthisperspective,itishighlyrecommended thatfutureresearchpaymoreattentiontothefollowingpoints: • PassingtheinformationontheusefulnessofDMapplications inthebuildingfieldtopolicymakersandbuildingownersso thattheywill,inturn,besupportiveformoreapplications; • formulating acceptable benchmarks for identifying the
parameterstobemonitored;
• lowering the cost of monitoring and accessing building-relateddata;
• developingmethodologiesbasedonnewDMtechniquesfor diverseapplications;
• improvingthepresentationandvisualizationofDMresults; and
• investigating multiple building blocks and communities besidesindividualbuildings.
Acknowledgements
ThisworkwasfinanciallysupportedbytheNationalNatural Science Foundation of China (No. 51408205), the Fundamental ResearchFundsfortheCentralUniversities,HunanProvincial Sci-ence and Technology Department Major Project of China (No. 2011FJ1007)andwasalsofundedbyELS.
References
Ahmad,A.S.,Hassan,M.Y.,Abdullah,M.P.,Rahman,H.A.,Hussin,F.,Abdullah,H., etal.(2014).AreviewonapplicationsofANNandSVMforbuildingelectrical energyconsumptionforecasting.RenewableandSustainableEnergyReviews,33, 102–109.
Thispaperprovidesanoverviewofthebuildingelectricalenergyprediction methodusingSVMandANN.Itsuggeststhatcombiningmultipleclassification modelsensurebetterforecastingperformance.Theproposedapproachis promising.
Ahmed,A.,Korres,N.E.,Ploennigs,J.,Elhadi,H.,&Menzel,K.(2011).Mining buildingperformancedataforenergy-efficientoperation.Advanced EngineeringInformatics,25,341–354.
Basu,K.,Debusschere,V.,&Bacha,S.(2012).Applianceusagepredictionusinga timeseriesbasedclassificationapproach.InIECON2012—38thAnnual ConferenceonIEEEIndustrialElectronicsSocietyMontreal,Canada.
Basu,K.,Hawarah,L.,Arghira,N.,Joumaa,H.,&Ploix,S.(2013).Apredictionsystem forhomeapplianceusage.EnergyandBuildings,67,668–679.
Beckel,C.,Sadamori,L.,Staake,T.,&Santini,S.(2014).Revealinghousehold characteristicsfromsmartmeterdata.Energy,78,397–410.
Bonvini,M.,Sohn,M.D.,Granderson,J.,Wetter,M.,&Piette,M.A.(2014).Robust on-linefaultdetectiondiagnosisforHVACcomponentsbasedonnonlinear stateestimationtechniques.AppliedEnergy,124,156–166.
Cabena,P.,Hadjinian,P.,Stadler,R.,Verhees,J.,&Zanasi,A.(1998).Discoveringdata mining:Fromconcepttoimplementation.UpperSaddleRiver,NJ:PrenticeHall. Chicco,G.(2012).Overviewandperformanceassessmentoftheclustering
methodsforelectricalloadpatterngrouping.Energy,42,68–80. Chou,J.-S.,&Bui,D.-K.(2014).Modelingheatingandcoolingloadsbyartificial
intelligenceforenergy-efficientbuildingdesign.EnergyandBuildings,82, 437–446.
Chou,J.-S.,Hsu,Y.-C.,&Lin,L.-T.(2014).Smartmetermonitoringanddatamining techniquesforpredictingrefrigerationsystemperformance.ExpertSystems withApplications,41,2144–2156.
Congradac,V.,&Kulic,F.(2009).HVACsystemoptimizationwithCO2
concentrationcontrolusinggeneticalgorithms.EnergyandBuildings,41, 571–577.
D’Oca,S.,&Hong,T.(2014).Adata-miningapproachtodiscoverpatternsof windowopeningandclosingbehaviorinoffices.BuildingandEnvironment,82, 726–739.
Theauthorsfirstimplementedclusteringanalysistothemeasureddatain differentofficesforobtainingpatternsincludingmotivations,durationand numberofopeningandtypicaldegreeofopeningpositions.Basedontheobtained information,ARMwasthenconductedandconsequentlytheoccupantswere dividedintotwotypicalworkinguserprofiles:physicalenvironmentaldrivenand contextualdriven.
D’Oca,S.,&Hong,T.(2015).Occupancyscheduleslearningprocessthroughadata miningframework.EnergyandBuildings,88,395–408.
Du,Z.,Fan,B.,Jin,X.,&Chi,J.(2014).Faultdetectionanddiagnosisforbuildings andHVACsystemsusingcombinedneuralnetworksandsubtractiveclustering analysis.BuildingandEnvironment,73,1–11.
Fan,C.,Xiao,F.,&Wang,S.(2014).Developmentofpredictionmodelsfornext-day buildingenergyconsumptionandpeakpowerdemandusingdatamining techniques.AppliedEnergy,127,1–10.
Fan,C.,Xiao,F.,&Yan,C.(2015).Aframeworkforknowledgediscoveryinmassive buildingautomationdataanditsapplicationinbuildingdiagnostics. AutomationinConstruction,50,81–90.
Hussain,S.,Gabbar,H.A.,Bondarenko,D.,Musharavati,F.,&Pokharel,S.(2014). Comfort-basedfuzzycontroloptimizationforenergyconservationinHVAC systems.ControlEngineeringPractice,32,172–182.
Jia,WeiHan,Kamber,M.,&Pei,J.(2012).Dataminingconceptsandtechniques(3rd ed.).Waltham,US:MorganKaufmann.
Katipamula,S.,&Brambley,M.R.(2005).Methodsforfaultdetection,diagnostics, andprognosticsforbuildingsystems—Areview,partI.HVAC&RResearch,11, 3–25.
Khan,I.,Capozzoli,A.,Corgnati,S.P.,&Cerquitelli,T.(2013).Faultdetection analysisofbuildingenergyconsumptionusingdataminingtechniques.Energy Procedia,42,557–566.
Kim,H.,Stumpf,A.,&Kim,W.(2011).Analysisofanenergyefficientbuilding designthroughdataminingapproach.AutomationinConstruction,20,37–43. Küc¸üksille,E.U.,Selbas¸,R.,&S¸encan,A.(2011).Predictionofthermodynamic
propertiesofrefrigerantsusingdatamining.EnergyConversionand Management,52,836–848.
Kumar,R.,Aggarwal,R.K.,&Sharma,J.D.(2013).Energyanalysisofabuilding usingartificialneuralnetwork:Areview.EnergyandBuildings,65,352–358.
Lai,Y.-X.,Lai,C.-F.,Huang,Y.-M.,&Chao,H.-C.(2013).Multi-appliancerecognition systemwithhybridSVM/GMMclassifierinubiquitoussmarthome. InformationSciences,230,39–55.
Lange,B.,Rodriguez,N.,Puech,W.,&Vasques,X.(2013).Discoveringunexpected informationusingabuildingenergyvisualizationtool.InProc.SPIE8650, Three-DimensionalImageProcessing(3DIP)andApplications.
Lauro,F.,Moretti,F.,Capozzoli,A.,Khan,I.,Pizzuti,S.,Macas,M.,etal.(2014). BuildingFanCoilElectricConsumptionAnalysiswithFuzzyApproachesfor FaultDetectionandDiagnosis.EnergyProcedia,62,411–420.
Li,Q.,Ren,P.,&Meng,Q.(2010).Predictionmodelofannualenergyconsumption ofresidentialbuildings.InProceedingsof2010internationalconferenceon advancesinenergyengineering.
Li,M.,Miao,L.,&Shi,J.(2014).Analyzingheatingequipment’soperationsbasedon measureddata.EnergyandBuildings,82,47–56.
Magoulès,F.,Zhao,H.-x.,&Elizondo,D.(2013).DevelopmentofanRDPneural networkforbuildingenergyconsumptionfaultdetectionanddiagnosis.Energy andBuildings,62,133–138.
Theauthorssimulatedthedailyelectricityconsumptionofdifferentelectric equipmentinanofficebuildingoverayear.Theparametersoftheequipmentare settodefaultandchangedvalues,indicatingnormalandfaultyconditions, respectively.Thegeneratednormalandfaultydatasetswerethenusedtotrainand testthefaultdetectionabilityofthedevelopedANNmodel.Experimentalresults demonstratethatthismodelishighlyaccurateinfaultdetection.
Miller,C.,Nagy,Z.,&Schlueter,A.(2015).Automateddailypatternfilteringof measuredbuildingperformancedata.AutomationinConstruction,49(PartA), 1–17.
Moon,J.W.,Lee,J.-H.,&Kim,S.(2014).Applicationofcontrollogicforoptimum indoorthermalenvironmentinbuildingswithdoubleskinenvelopesystems. EnergyandBuildings,85,59–71.
MottaCabrera,D.F.,&Zareipour,H.(2013).Dataassociationminingforidentifying lightingenergywastepatternsineducationalinstitutes.EnergyandBuildings, 62,210–216.
ThisstudyproposedanARM-basedmethodtoquantifyandunderstand lightingenergywastepatternsassociatedwithoccupancypatternsinclassrooms inpost-secondaryeducationalinstitutes.Simulationresultsshowthatuptoabout 40%savingscouldbeachievedbyimplementingthediscoveredrules.
Mulumba,T.,Afshari,A.,Yan,K.,Shen,W.,&Norford,L.K.(2015).Robust model-basedfaultdiagnosisforairhandlingunits.EnergyandBuildings,86, 698–707.
Muratori,M.,Roberts,M.C.,Sioshansi,R.,Marano,V.,&Rizzoni,G.(2013).Ahighly resolvedmodelingtechniquetosimulateresidentialpowerdemand.Applied Energy,107,465–473.
Rhodes,J.D.,Cole,W.J.,Upshaw,C.R.,Edgar,T.F.,&Webber,M.E.(2014). Clusteringanalysisofresidentialelectricitydemandprofiles.AppliedEnergy, 135,461–471.
Thisstudyclustered103homesinAustinbytheirhourlyelectricityuse patternsforeachseason.Theoptimalnumberofclusterswasdeterminedto representresidentialdemandprofiles.Theresultswillbenefitutilitiesfor formulatingpoliciesonloadshiftingandefficiencyupgraderebates.
Rollins,S.,&Banerjee,N.(2014).Usingruleminingtounderstandapplianceenergy consumptionpatterns.In2014IEEEInternationalConferenceonPervasive ComputingandCommunications(PerCom).
Stoppel,C.M.,&Leite,F.(2014).Integratingprobabilisticmethodsfordescribing occupantpresencewithbuildingenergysimulationmodels.Energyand Buildings,68(PartA),99–107.
Sun,K.,Yan,D.,Hong,T.,&Guo,S.(2014).Stochasticmodelingofovertime occupancyanditsapplicationinbuildingenergysimulationandcalibration. BuildingandEnvironment,79,1–12.
Usama,Fayyad,Gregory,Piatetsky-Shapiro,&Smyth,P.(1996).Fromdatamining toknowledgediscoveryindatabases.AIMagazine,17,37–54.
Xiao,F.,&Fan,C.(2014).Datamininginbuildingautomationsystemfor improvingbuildingoperationalperformance.EnergyandBuildings,75, 109–118.
Yan,K.,Shen,W.,Mulumba,T.,&Afshari,A.(2014).ARXmodelbasedfault detectionanddiagnosisforchillersusingsupportvectormachines.Energyand Buildings,81,287–295.
Yu,Z.,Haghighat,F.,Fung,B.C.M.,&Yoshino,H.(2010).Adecisiontreemethodfor buildingenergydemandmodeling.EnergyandBuildings,42,1637–1646. Yu,Z.,Haghighat,F.,Fung,B.C.M.,Morofsky,E.,&Yoshino,H.(2011).A
methodologyforidentifyingandimprovingoccupantbehaviorinresidential buildings.Energy,36,6596–6608.
Yu,Z.,Fung,B.C.M.,Haghighat,F.,Yoshino,H.,&Morofsky,E.(2011).Asystematic proceduretostudytheinfluenceofoccupantbehavioronbuildingenergy consumption.EnergyandBuildings,43,1409–1417.
Yu,Z.,Haghighat,F.,Fung,B.C.M.,&Zhou,L.(2012).Anovelmethodologyfor knowledgediscoverythroughminingassociationsbetweenbuilding operationaldata.EnergyandBuildings,47,430–440.
Yu,Z.,Fung,B.C.M.,&Haghighat,F.(2013).Extractingknowledgefrom building-relateddata-Adataminingframework.BuildingSimulation,6, 207–222.
Thispaperestablishedafour-componentframeworkconsistingofDM techniques/algorithms(includingclassificationanalysis,clusteranalysis,and ARM),potentialapplications,inputandoutput.Astep-by-stepdataanalysis processthatstartsfromproblemdefinitiontoknowledgediscoverywasalso proposed.Itdemonstratesthefeasibilityofapplyingtheprocessandframeworkin thebuildingfield.
Zhao,H.X.,&Magoulès,F.(2011).Newparallelsupportvectorregressionfor predictingbuildingenergyconsumption.InProceedingsof2011IEEE symposiumoncomputationalintelligenceinmulticriteriadecision-making. Zhao,H.-x.,&Magoulès,F.(2012).Areviewonthepredictionofbuildingenergy
consumption.RenewableandSustainableEnergyReviews,16,3586–3592.
Zhao,J.,Lasternas,B.,Lam,K.P.,Yun,R.,&Loftness,V.(2014).Occupantbehavior andschedulemodelingforbuildingenergysimulationthroughoffice appliancepowerconsumptiondatamining.EnergyandBuildings,82, 341–355.