• No results found

Advances and challenges in building engineering and data mining applications for energy-efficient communities

N/A
N/A
Protected

Academic year: 2021

Share "Advances and challenges in building engineering and data mining applications for energy-efficient communities"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

ContentslistsavailableatScienceDirect

Sustainable

Cities

and

Society

j o ur n a l ho me p a g e :w w w . e l s e v i e r . c o m / l o c a t e / s c s

Engineering

advance

Advances

and

challenges

in

building

engineering

and

data

mining

applications

for

energy-efficient

communities

Zhun

(Jerry)

Yu

a

,

Fariborz

Haghighat

b,∗

,

Benjamin

C.M.

Fung

c aCollegeofCivilEngineering,HunanUniversity,Changsha410082,Hunan,PRChina

bDepartmentofBuilding,CivilandEnvironmentalEngineering,ConcordiaUniversity,Montreal,QC,CanadaH3G1M8 cSchoolofInformationStudies,McGillUniversity,Montreal,QC,CanadaH3A1X1

a

r

t

i

c

l

e

i

n

f

o

Articlehistory:

Availableonline19December2015 Keywords:

DataMining BuildingEnergyUse OccupantBehavior BigData Review

a

b

s

t

r

a

c

t

Therapidlygrowingandgiganticbodyofstoreddatainthebuildingfield,coupledwiththeneedfordata analysis,hasgeneratedanurgentneedforpowerfultoolsthatcanextracthiddenbutusefulknowledge ofbuildingperformanceimprovementfromlargedatasets.Asanemergingsubfieldofcomputerscience, dataminingtechnologiessuitthisneedwellandhavebeenproposedforrelevantknowledgediscovery inthepastseveralyears.Aimedtohighlightrecentadvances,thispaperprovidesanoverviewofthe studiesundertakingthetwomaindataminingtasks(i.e.predictivetasksanddescriptivetasks)inthe buildingfield.Basedontheoverview,majorchallengesandfutureresearchtrendsarealsodiscussed.

©2015TheAuthors.PublishedbyElsevierLtd.ThisisanopenaccessarticleundertheCCBY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).

Contents

1. Introduction...33

2. DMapplicationsinthebuildingfield...34

2.1. Predictivetasks...34

2.1.1. Buildingenergydemandprediction...34

2.1.2. Buildingoccupancyandoccupantbehavior...34

2.1.3. Faultdetectiondiagnostics(FDD)forbuildingsystems...34

2.2. Descriptivetasks...35

2.2.1. Frameworkdevelopment...35

2.2.2. Occupantbehavior...35

2.2.3. Buildingmodelingandoptimalcontrol...35

2.2.4. Discoveringandunderstandingenergyusepatterns...36

3. Futuredirectionsandchallenges—Concludingremarks...36

Acknowledgements ... 37

References...37

1. Introduction

Currently theimportanceof improvingbuildingenergy per-formanceforsavingenergyandenhancingbuildingsustainability hasbeenwidelyrecognized.Oneeffectivewayofachievingthis objectiveistouncoverandextractusefulknowledgefrom build-ing operational data (e.g. temperature, flow rate, power and

夽 ThisisanEngineeringAdvancepaper. ∗ Correspondingauthor.

E-mailaddress:[email protected](F.Haghighat).

equipmentstates) that contains abundantvaluable information onactualbuildingperformance. Thewidespreaduseofbuilding automationsystems(BASs)enablesatremendousamountof build-ingoperationaldatatobestoredinbuildingdatabasesthatalso continue to expand. Thisrapidlygrowing and giganticbody of storeddata,coupledwiththeneedfor dataanalysis,has gener-atedanurgentneedforpowerfultoolsthatcanextracthiddenbut usefulknowledgefromlargebuildingdatabases.

Asanemergingandpromisingtechnology,datamining(DM) isapowerfulandversatiletooltoautomaticallyextractthe valu-able knowledge embeddedin huge amounts of data.It can be definedinmanydifferentways.AsdefinedbyCabena,Hadjinian,

http://dx.doi.org/10.1016/j.scs.2015.12.001

2210-6707/©2015TheAuthors.PublishedbyElsevierLtd.ThisisanopenaccessarticleundertheCCBY-NC-NDlicense(http://creativecommons.org/licenses/by-nc-nd/4. 0/).

(2)

Stadler,Verhees,and Zanasi(1998),DMis “aninterdisciplinary fieldbringingtogethertechniquesfrommachinelearning,pattern recognition,statistics,databases,andvisualizationtoaddressthe issueofinformationextractionfromlargedatabases.”Inthepast severaldecades,researchershavebeenvigorouslyandsuccessfully applyingDMinmanyscientific,medical,andapplicationdomains suchasbanking,bioinformaticsandnewmaterialsidentification. Recentlyithasalsobeenintroducedintothebuildingfieldthatisa well-fitapplicationareaforDM,sinceitgeneratesandcollectsvast amountsofdataonsystemoperation,occupantbehavior,power consumption,climaticconditionsandetc.

Ingeneral,DMincludessixcategoriesofwidelyacceptedand implementedtechniques(Usama,Gregory,&Smyth,1996):

• Dataclassification(e.g.thedecisiontreemethod,supportvector machine(SVM)andartificialneuralnetwork(ANN)),

• Clusteringanalysis,

• AssociationRuleMining(ARM), • Regression,

• Summarization,and • AnomalyDetection.

Readersmay refer to(Jia,Kamber, &Pei, 2012) for detailed informationofthesetechniques.Thesetechniquescanbebroadly categorizedintopredictivetasksanddescriptivetasks(Jiaetal., 2012).Thispaperreportsanoverviewoftherecentstudies under-takingthetwotasksinthebuildingfield.

2. DMapplicationsinthebuildingfield

2.1. Predictivetasks

2.1.1. Buildingenergydemandprediction

Thepredictionofbuildingenergydemandplaysanimportant rolein improvingbuildingperformance. Anaccurateprediction needstotakevarioussignificantinfluencing factors ofbuilding energydemand intoconsideration, suchasweather conditions, HVACequipment,buildingenvelopesandoccupantbehavior.The complexityanduncertaintyofthesefactorsfurtheradds difficul-tiestoimprovepredictionaccuracy.DMtechniquesmaymakea breakthroughindealingwiththiscomplexityanduncertainty.

Zhaoand Magoulès(2012)indicatedthatDMtechniquesare veryapplicabletobuildingenergydemandpredictionsincethey candeal withnon-linear problems. ANNand SVM arethe two most widely used DM techniques for this application (Ahmad etal.,2014).Kumar, Aggarwal,andSharma(2013)applied vari-ousANNmethods,includingbackpropagation,recurrentANN,auto associativeANNandgeneralregression ANN.TheadoptedANN architecturesignificantlyinfluencesthecoefficientofvariationthat rangesfrom2%to40%.TheyconcludedthatANNismoresuitable forthepredictionofalargesetofparametersthananystatistical techniques.Li,Ren,andMeng(2010)reportedthatinmanycases SVMshowshigherpredictionaccuracythanANN.However, train-ingSVMcanbeaveryslowprocessduetolargevolumeoftraining data.TheusageofparallelSVM(Zhao&Magoulès,2011)mightbe afeasiblealternative.

BothANNandSVMmodelsoperatelikea“blackbox”, mean-ingthatthemodelcanprovideapredictionbutcannotprovidea justificationforsupportingtheprediction.Inordertoovercome thislimitation,Yu,Haghighat,Fung,andYoshino(2010)developed abuildingenergy demandpredictivemodel basedonthe deci-siontreemethod.Itscompetitiveadvantageliesintheabilityto generateaccuratepredictivemodels(92%accuracyintheirstudy) withinterpretableflowchart-liketreestructuresthatenableusers toquicklyextractusefulinformation.However,thedecisiontree

methodisbasicallydevelopedforpredictingcategoricalvariables otherthanforpredictingnumericalvariables.

ConsideringthatdifferentDMtechniqueshavetheirown spe-cialties,strengthsandweaknesses,itisnoteasytofindthebest candidateforthebuildingenergydemandprediction.Therecent researchtrendis towardstheintegrationof differentDM tech-niquesformoreaccurateprediction.For example,ChouandBui

(2014) developed an ensemble model by combining ANN and

SVMtopredictcoolingand heatingdemand.Inordertopredict thenext-dayenergyconsumptionandpeakpowerdemand,Fan, Xiao,andWang(2014)developedensemblemodelsbycombining eightbaseDMmodelsandassigningeachDMmodalanoptimized weightbasedongeneticalgorithm(GA).Theresultsshowthatthe ensemblemodelsachievehigherpredictionaccuracythanthoseof individualbasemodels.

2.1.2. Buildingoccupancyandoccupantbehavior

Buildingoccupancyandoccupantbehavior arerecognizedas crucialfactorsinfluencingthediscrepancybetweenpracticaland simulatedbuildingenergyconsumption.However,itisdifficultto investigatethemanalyticallyandthentodevelopreliable predic-tionmodelsduetotheircomplicatedcharacteristicsandstochastic nature(Yu,Haghighat,Fung,Morofsky,&Yoshino,2011a).Tomeet thischallenge,differentstochasticmodels(e.g.usingprobabilistic methods(Sun,Yan,Hong,&Guo,2014;Stoppel&Leite,2014)and theMarkovChainmethod(Muratori,Roberts,Sioshansi,Marano,& Rizzoni,2013))havebeenproposed.However,theproposed mod-elsareseverelyrestrainedbythelimitation:Themodelingprocess tendstobecomplex and advanced mathematicalknowledge is required.

In order to remove the above limitation, researchers have attempted to establish DM-based models. For example, Basu, Hawarah,Arghira,Joumaa,&Ploix, 2013)developeda decision-treebasedmodelforpredictingoccupantbehaviorattheappliance levelinresidentialbuildings.D’OcaandHong(2015)proposeda DMmethodologytomodelofficeoccupancypatternsand work-inguserprofilesbasedonbigdatastreams.Theyfoundthatthe decisiontreemethodissuitableforpredictingtheoccupancy pres-ence,supportedbyZhao,Lasternas,Lam,Yun,andLoftness(2014), whobuilttheoccupantbehaviorpredictionmodelsbasedon appli-ancepowerconsumptiondatainamedium-sizeofficebuilding.The resultsofbothstudiesindicatethatthemodelingaccuracyisvery satisfactory.However,thedevelopeddecision-treebasedmodels arestaticmodelsthatarehardtosimulatethedynamicnatureof buildingoccupancyandoccupantbehavior.

Themainfocusoffutureresearchshouldbeplacedontesting andcomparingotherdynamicDM-basedmodelsandintegrating themintobuildingenergy modelingprogramslikeTRNSYSand EnergyPlus.Inaddition,moreresearchneedstobeconductedso thatarchitectsanddesignerswillbenefitfrombridgingthegap betweenactualandpredictedbuildingenergyperformance.

2.1.3. Faultdetectiondiagnostics(FDD)forbuildingsystems Automatingthe process of detectingequipmentand system malfunctionsandmakingaproperdiagnosiscanhelptoensure stableoroptimalbuildingoperation.Intermsoftheapproachto formulatingthediagnostics,FDDmethodscanbecategorizedas model-based methods, which are based onprior knowledge of underlyingsystemphysics,anddata-drivenmethods,whichare basedonhistoricaldata(Katipamula&Brambley,2005).

Therequirementof priorknowledge,togetherwithcomplex modeling processes and heavy computationalburden, imposes severeconstraintsontheapplicationofthemodel-based meth-ods.Comparatively,thedatadrivenmethodsaremucheasiertouse sincethemodelsarenormallyautomaticallygenerated.VariousDM techniqueshavebeenemployedasdatadrivenmethodsforFDD.

(3)

Forexample,Magoulès,Zhao,&Elizondo,2013)developedanANN modelbasedontherecursivedeterministicperceptron(RDP)ANN toimplementFDDatthewholebuildinglevel.Moreover,they pro-posedanewfaultdiagnosticprocedureforidentifyingandranking abnormalequipmentintheorderoffaultrisk.OtherDMtechniques havealsobeenemployedforFDDsuchasclusteranalysis(Khan, Capozzoli,Corgnati,&Cerquitelli,2013),fuzzylogic(Lauroetal., 2014)andSVM(Mulumba,Afshari,Yan,Shen,&Norford,2015).

In order to takeadvantage of different DM techniques and achieve more robust models, hybrid approaches have been proposedforFDDsuchasthecombinationoftheSVMand autore-gressivemodelwithexogenousinputs(ARX)(Yan,Shen,Mulumba, &Afshari,2014)andthecombinationofANNandsubtractive clus-tering analysis(Du,Fan, Jin, &Chi,2014).Experimental results suggestthesehybridapproacheshavehigherpredictionaccuracy andlowerfalsealarmratesthantheapplicationofsinglemodels. However,theaboveDM-basedmodels,bothindividualandhybrid, focusedonhistoricaldataandcannotbeusedforreal-timeFDD.

Recently,researchers haveshown interestin developing on-lineFDDtoolsthatcanprovidereal-timeornear-real-timeupdates onmodelparametersandpredictions(Bonvini,Sohn,Granderson, Wetter, & Piette, 2014). DM techniques, particularly the SVM, have been utilized in these studies and shown great potential forreal-timeFDDsincetheypreventperformingnumerous time-consumingsimulations(Yanetal.,2014).Futureresearchshould focusonverifyingpredictionofmodelsbasedonarealworlddata set.

2.2. Descriptivetasks

2.2.1. Frameworkdevelopment

ThediversityofDMtechniquesandtheirfunctionalities neces-sitatesestablishingandprovidingaframeworkforuserswhomay nothavemuchknowledgeandexperiencewiththeapplicationof DM.Basically,suchframeworkneedstospecifywhichDM tech-nique/algorithmcanbeusedforbuildingapplicationandwhich kindofpatterncanbeminedcorrespondingly.Theultimate objec-tiveistohelpuserstodevelopasetofdataanalysismethodologies sothattheycansolvetheproblemsbasedontheframework. More-over,casestudiesalsoneedtobeconductedinordertodemonstrate itseffectiveness.Theframework,togetherwiththecasestudies, enablesuserstoanalyzebuilding-relateddatamoreefficiently.

BothgeneralandspecificDMframeworkshavebeenproposedin thepastseveralyears(Yu,Fung,&Haghighat,2013).General frame-worksprovideaDMtechniquepoolanddifferentpatternswillbe foundgivendifferentDMtechniquesareselectedandcombined. Specificframeworksaretargetedatexploringa specificpattern suchas occupancypatternsand designated DMtechniquesare used.

Regarding general frameworks, Yu et al. (2013) estab-lished a four-component framework consisting of DM tech-niques/algorithms.Theyalsoproposedastep-by-stepdataanalysis processthatstartsfromproblemdefinitiontoknowledge discov-ery.Fan,Xiao,andYan(2015)developedaframeworkthatmainly includesfourphases:dataexploration,datapartitioning, knowl-edgediscoveryandpostmining.Regardingspecificframeworks,

D’OcaandHong(2015)developedaframeworktodiscover occu-pancy patterns and working user profiles in office spaces. The decision-treemethodandclusteranalysiswerecombinedto pro-videinsightsintopatternsofoccupancyschedules.However,these frameworksinvolvelimitedDMtechniquesandsofarrelatively veryfewproblemshasbeentackledwiththeinvolvedDM tech-niques.

DMframeworkscandeliverasetofdataanalysismethodologies topracticaloperation,andthusefficientlygeneralizingtheusage ofDMtechniquestoawiderpopulation.Theyprovideanoverview

andastandardizedprocessoftheusageofDMtechniques. There-fore,moreframeworks,particularlyspecificframeworks,andcase studieswithhighreplicationpotentialshouldbedevelopedand appliedfordifferentapplications.

2.2.2. Occupantbehavior

DMtechniqueshavealsobeenemployedtoconductdescriptive tasksforoccupantbehaviorinseveralstudies.Thesestudiesfocus oneitheronetypeofspecificbehaviorinofficebuildingsorfull patternsofbehaviorinresidentialbuildings.Theirmainobjectives aretodiscoverbehaviorpatternsandtoidentifyandimprovethe effectofoccupantbehavioronbuildingenergyconsumption.The obtainedknowledgecanbeintegratedintobuildingenergy simu-lationprogramsespeciallywhensettingupbehaviorschedules.

D’Oca and Hong (2014) combined clustering analysis with

ARM in order to discover occupants’ window opening/closing behavior patterns in a natural ventilatedoffice building. Based onclusteranalysis,Yu,Fung,Haghighat,Yoshino,andMorofsky

(2011b)developedanewmethodologyforexaminingthe

influ-encesofoccupantbehavioronbuildingenergyconsumptionwhile removingtheeffect ofother factorssuchasclimateand build-ing envelopes.Based on clusteranalysis, classification analysis, andARM, Yuetal.(2011a) developedanothermethodologyfor identifyingenergy-inefficientbehaviorinresidentialbuildingsand providingoccupantswithfeasiblerecommendationstoimprove them. It is worth mentioning that the effective usage of DM techniques,particularlyclusteranalysis,for describingoccupant behaviorisheavilydependentonthesizeofestablisheddatabases, whichnormallyarerestrictedbyvariousfactorssuchasbudgets andeffortinpractice.

Duetoitscomplexityandrandomness,theinteractionsbetween occupant behaviorand otherinfluencing factorsare stillpoorly understood.FurtherstudiesontheDM-basedmethodologiesare activelyencouraged,andthecombinationofclusteranalysiswith ARMdeservesextraattention.

2.2.3. Buildingmodelingandoptimalcontrol

Buildingoperationaldatacontainsvaluableinformationthatcan beextractedformodelingandidentifyingthehiddenrelationship betweendifferentvariables,therebyimprovingbuildingoperation. Also,couplingtheinformationintobuildingcontrolsystemscan assist in optimizingbuildingperformance. Anumber of studies havealreadybeenconductedonsuchinformationextraction.

Withthegoal ofimprovingbuildingperformance, DM tech-niques have been employed to model different performance variablessuchasfurnaces’cycleidletime(Li,Miao,&Shi,2014), thermodynamicproperties of refrigerants (Küc¸üksille,Selbas¸, & S¸encan,2011),applianceusage(Basu,Debusschere,&Bacha,2012) andcoefficientofperformanceforrefrigerationequipment(Chou, Hsu,&Lin,2014).Forexample,Ahmed,Korres,Ploennigs,Elhadi, and Menzel(2011)usedNaïveBayes,decision treeandSVM to simplifytheproceduretoestablishmodelsforpredictinginternal naturallightingandthermalcomfort.Theresultsshowthehigh predictionaccuracyandreliabilityofthesetechniques.Basedon theBASdatabaseofthetallest buildinginHongKong, Xiaoand Fan(2014)firstappliedclusteringanalysistoidentifythetypical buildingenergyconsumptionpatterns,andthenappliedARMto eachclusterfordiscoveringthehiddenassociationsamongpower consumptionsofmajorequipmentsuchaschillersandpumps.Asa result,deficitflowinthechilledwatersystemwasfoundand abnor-maloperationoftheprimaryandsecondarypumpswasdetected. DMtechniqueshavealsobeenemployedforHVACsystem opti-malcontrolwiththeaimofminimizingtheenergyconsumption while maintaining indoorcomfort levels. Generally these tech-niquesareintegratedintocontrollersinordertotakeadvantageof theirstrengths.Forexample,basedonthefuzzylogicmethod,ANN

(4)

andGeneticAlgorithm,fuzzylogiccontrollers(Hussain,Gabbar, Bondarenko, Musharavati, & Pokharel, 2014), ANN controllers (Moon, Lee,&Kim, 2014)andGA-based controllers(Congradac &Kulic, 2009)were developed. These controllers have already been widely applied in the buildingfield and a large number of studies can be found in literature that is not listed in this paper.

Existingstudieshavealreadydemonstratedthecapabilitiesof DMtechniquesasameanstoeffectivelyhelpimprovingand opti-mizingbuildingperformance.Potential futureresearchincludes developingmoreaccurateandrobustmodels,particularlya com-binationofminingtechniques,that canbereadilyusable. Also, morepractical applicationsofDM-based controllersneedtobe conductedinthebuildingfieldinordertoevaluatetheirusability.

2.2.4. Discoveringandunderstandingenergyusepatterns

Buildingenergyconsumptionpatterns,bothfrequentand infre-quent,usuallycanbeusedtohelpidentifyoperationabnormalities andfaults.Effectivepatternrecognitionandcorresponding knowl-edgeextractionbyminingbuildinghistoricaldataorsimulation datacanleadtoenergysavingsandperformance improvement. Relevantstudiesatboththewholebuildinglevelandsub-system levelhavebeenconductedinrecentyears.

Atthewholebuildinglevel,Miller,Nagy,andSchlueter(2015)

appliedclusteringanalysistoaggregatefrequentenergyuse pat-ternssoastogenerateprofilesthatmaybenefitfuturesimulation modelcalibration.Lange, Rodriguez,Puech,and Vasques(2013)

developeda3DvisualizationtoolthatcanbeintegratedintoBASs tovisualizedatainanunderstandableway.Theworkispartofthe RIDERprojectthatcomposedofasetofDMtechniquesaimedat gaininginsightintobuildingenergyusepatterns.BasedonARM,

Yu,Haghighat,Fung,andZhou(2012)developedamethodology for examining all associations and correlations between build-ingoperationaldata,therebypinpointingenergywasteaswellas equipmentfaults.Byminingsimulationdata,Kim,Stumpf,andKim (2011)developedanoveralldataanalysisprocesstofindimportant patternsinordertoimprovetheenergyefficiencyofbuildingdesign duringtheearlydesignphase.

Atthesub-systemlevel,anincreasingnumberofstudies(Lai, Lai,Huang,&Chao,2013;Beckel,Sadamori,Staake,&Santini,2014; Rollins&Banerjee,2014)focusedontheinvestigationson appli-anceusagepatterns.Chicco(2012)provided anoverviewofthe previousliteratureonclusteringanalysisforgroupingand min-ingelectricalloadpatterns,particularlyresidentialdemandprofiles (Rhodes,Cole,Upshaw,Edgar,&Webber,2014).Fewstudiesfurther focusedonmorespecificapplianceusagepatternssuchaslighting (MottaCabrera&Zareipour,2013).

Patternsextractedfrombuildinghistoricdataandsimulation dataprovideadeepinsightintothewaybuildingoccupants con-sumeenergy.Utilities,policy-makers,buildingowners,designers andoperatorsmaybenefitgreatlyfromtheseextractedknowledge. Mostexistingstudiesarestillfocusedongeneralpatternanalysis overalongtimelength,limitedbylowresolutiondataavailable.In ordertodiscoverin-depthknowledgeofbuildingperformance,the focusoffuturestudiesneedstobeplacedonmoredetailedpatterns basedonhighresolutiondatasuchas1-mindata.

3. Futuredirectionsandchallenges—Concludingremarks

Thispaperreviewstheadvancesinthefieldofbuilding engi-neeringand DMapplicationsover thelast severalyears. It has beenshownthatDMtechniquesareaneffectivetoolforextracting hiddenbutusefulknowledgefrombuilding-relateddata,thereby improvingbuildingperformance.However,someimportant prob-lemslimitingtheapplicationsstillneedtobeaddressedandmore

effort is needed. The following described theencountered and anticipatedchallenges,potentialfuturedevelopmentsand recom-mendationsfortheapplicationofDMtechniquestothebuilding field:

(1)Low-qualitydataleadstolow-qualitydataminingresults.A keychallengeishowtoimprovedataqualityintermsof relia-bility,completeness,consistencyandresolution.Therearetwo major reasonsfor thelack of high-qualitydata: thelack of widely-acceptedbenchmarksforidentifyingtheparametersto bemonitoredandthelackofadequatesensorsforcollecting reliableandhighresolutiondatathattendtobecostly. More-over,forresidentialbuildings,theenergyconsumptiondataare collectedfromthemonthlyorannualbillsfromutility compa-nies.Thesehighlysummarizeddataareinsufficienttoconduct detaileddataanalysis.Thefactthatend-usedataonhousehold applianceusageisseldommonitoredaddsgreatdifficultiesto in-depthdataanalysis.

(2)Wideanddiverseapplicationshavebeenfoundandvarious DMtechniqueshavebeenemployed.However,consideringthat DMisarelativelyyoungandfast-growingdisciplineandnew techniquesemerges,clearlythepotentialofDMtoextract use-fulknowledgefrombuilding-relateddatahasnot beenfully exploited.Inthisview,buildingresearchersandengineersneed tokeepaneyeonthecurrenttrendtowardsthedevelopmentof DMtechniques,suchasscalableandconstraint-basedmining methods,aswellastheirapplications.

(3)Previous studies indicate that theapplications have several importantlimitations.Forexample,althoughadeeper under-standing of the data can be gained, the process of model constructionandknowledgeextractionreliesheavilyonusers’ priordomainknowledge(e.g.settingathresholdforARMand determiningtheoptimalnumberofclusters).Also,itisnot intu-itiveonhowtoutilizethediscoveredknowledge.Moreeffortis neededfortheexplorationofimprovingtheefficiencyandease oftheminingprocessandresultanalysis.Onepossibledirection totakeistheintroductionof“actionmining”or“profit min-ing”thathasshownsomesuccessinthebusinessworld.Their DMresultsarenotjustabunchofrules,butsomesuggested actionstomaximizetheprofitorrevenue.Similarconceptsmay beapplicableinthebuildingfield.

(4)Theincreasingvolume,velocityandvarietyofbuilding-related data usually result in databases with sizesand complexity beyondtheabilityoftraditional statisticalanalysismethods to process them efficiently. Such databases can be consid-eredas“BigData”thatisacurrenttopicunderdiscussionin Information, Communicationand Technology(ICT) in build-ings’experts.Basedonthedefinitionandexistingapplication of DM to building-related data,DM is expected to play an importantroleinestablishingeffective“BigData”management approaches.Fromthisperspective,itishighlyrecommended thatfutureresearchpaymoreattentiontothefollowingpoints: • PassingtheinformationontheusefulnessofDMapplications inthebuildingfieldtopolicymakersandbuildingownersso thattheywill,inturn,besupportiveformoreapplications; • formulating acceptable benchmarks for identifying the

parameterstobemonitored;

• lowering the cost of monitoring and accessing building-relateddata;

• developingmethodologiesbasedonnewDMtechniquesfor diverseapplications;

• improvingthepresentationandvisualizationofDMresults; and

• investigating multiple building blocks and communities besidesindividualbuildings.

(5)

Acknowledgements

ThisworkwasfinanciallysupportedbytheNationalNatural Science Foundation of China (No. 51408205), the Fundamental ResearchFundsfortheCentralUniversities,HunanProvincial Sci-ence and Technology Department Major Project of China (No. 2011FJ1007)andwasalsofundedbyELS.

References

Ahmad,A.S.,Hassan,M.Y.,Abdullah,M.P.,Rahman,H.A.,Hussin,F.,Abdullah,H., etal.(2014).AreviewonapplicationsofANNandSVMforbuildingelectrical energyconsumptionforecasting.RenewableandSustainableEnergyReviews,33, 102–109.

Thispaperprovidesanoverviewofthebuildingelectricalenergyprediction methodusingSVMandANN.Itsuggeststhatcombiningmultipleclassification modelsensurebetterforecastingperformance.Theproposedapproachis promising.

Ahmed,A.,Korres,N.E.,Ploennigs,J.,Elhadi,H.,&Menzel,K.(2011).Mining buildingperformancedataforenergy-efficientoperation.Advanced EngineeringInformatics,25,341–354.

Basu,K.,Debusschere,V.,&Bacha,S.(2012).Applianceusagepredictionusinga timeseriesbasedclassificationapproach.InIECON2012—38thAnnual ConferenceonIEEEIndustrialElectronicsSocietyMontreal,Canada.

Basu,K.,Hawarah,L.,Arghira,N.,Joumaa,H.,&Ploix,S.(2013).Apredictionsystem forhomeapplianceusage.EnergyandBuildings,67,668–679.

Beckel,C.,Sadamori,L.,Staake,T.,&Santini,S.(2014).Revealinghousehold characteristicsfromsmartmeterdata.Energy,78,397–410.

Bonvini,M.,Sohn,M.D.,Granderson,J.,Wetter,M.,&Piette,M.A.(2014).Robust on-linefaultdetectiondiagnosisforHVACcomponentsbasedonnonlinear stateestimationtechniques.AppliedEnergy,124,156–166.

Cabena,P.,Hadjinian,P.,Stadler,R.,Verhees,J.,&Zanasi,A.(1998).Discoveringdata mining:Fromconcepttoimplementation.UpperSaddleRiver,NJ:PrenticeHall. Chicco,G.(2012).Overviewandperformanceassessmentoftheclustering

methodsforelectricalloadpatterngrouping.Energy,42,68–80. Chou,J.-S.,&Bui,D.-K.(2014).Modelingheatingandcoolingloadsbyartificial

intelligenceforenergy-efficientbuildingdesign.EnergyandBuildings,82, 437–446.

Chou,J.-S.,Hsu,Y.-C.,&Lin,L.-T.(2014).Smartmetermonitoringanddatamining techniquesforpredictingrefrigerationsystemperformance.ExpertSystems withApplications,41,2144–2156.

Congradac,V.,&Kulic,F.(2009).HVACsystemoptimizationwithCO2

concentrationcontrolusinggeneticalgorithms.EnergyandBuildings,41, 571–577.

D’Oca,S.,&Hong,T.(2014).Adata-miningapproachtodiscoverpatternsof windowopeningandclosingbehaviorinoffices.BuildingandEnvironment,82, 726–739.

Theauthorsfirstimplementedclusteringanalysistothemeasureddatain differentofficesforobtainingpatternsincludingmotivations,durationand numberofopeningandtypicaldegreeofopeningpositions.Basedontheobtained information,ARMwasthenconductedandconsequentlytheoccupantswere dividedintotwotypicalworkinguserprofiles:physicalenvironmentaldrivenand contextualdriven.

D’Oca,S.,&Hong,T.(2015).Occupancyscheduleslearningprocessthroughadata miningframework.EnergyandBuildings,88,395–408.

Du,Z.,Fan,B.,Jin,X.,&Chi,J.(2014).Faultdetectionanddiagnosisforbuildings andHVACsystemsusingcombinedneuralnetworksandsubtractiveclustering analysis.BuildingandEnvironment,73,1–11.

Fan,C.,Xiao,F.,&Wang,S.(2014).Developmentofpredictionmodelsfornext-day buildingenergyconsumptionandpeakpowerdemandusingdatamining techniques.AppliedEnergy,127,1–10.

Fan,C.,Xiao,F.,&Yan,C.(2015).Aframeworkforknowledgediscoveryinmassive buildingautomationdataanditsapplicationinbuildingdiagnostics. AutomationinConstruction,50,81–90.

Hussain,S.,Gabbar,H.A.,Bondarenko,D.,Musharavati,F.,&Pokharel,S.(2014). Comfort-basedfuzzycontroloptimizationforenergyconservationinHVAC systems.ControlEngineeringPractice,32,172–182.

Jia,WeiHan,Kamber,M.,&Pei,J.(2012).Dataminingconceptsandtechniques(3rd ed.).Waltham,US:MorganKaufmann.

Katipamula,S.,&Brambley,M.R.(2005).Methodsforfaultdetection,diagnostics, andprognosticsforbuildingsystems—Areview,partI.HVAC&RResearch,11, 3–25.

Khan,I.,Capozzoli,A.,Corgnati,S.P.,&Cerquitelli,T.(2013).Faultdetection analysisofbuildingenergyconsumptionusingdataminingtechniques.Energy Procedia,42,557–566.

Kim,H.,Stumpf,A.,&Kim,W.(2011).Analysisofanenergyefficientbuilding designthroughdataminingapproach.AutomationinConstruction,20,37–43. Küc¸üksille,E.U.,Selbas¸,R.,&S¸encan,A.(2011).Predictionofthermodynamic

propertiesofrefrigerantsusingdatamining.EnergyConversionand Management,52,836–848.

Kumar,R.,Aggarwal,R.K.,&Sharma,J.D.(2013).Energyanalysisofabuilding usingartificialneuralnetwork:Areview.EnergyandBuildings,65,352–358.

Lai,Y.-X.,Lai,C.-F.,Huang,Y.-M.,&Chao,H.-C.(2013).Multi-appliancerecognition systemwithhybridSVM/GMMclassifierinubiquitoussmarthome. InformationSciences,230,39–55.

Lange,B.,Rodriguez,N.,Puech,W.,&Vasques,X.(2013).Discoveringunexpected informationusingabuildingenergyvisualizationtool.InProc.SPIE8650, Three-DimensionalImageProcessing(3DIP)andApplications.

Lauro,F.,Moretti,F.,Capozzoli,A.,Khan,I.,Pizzuti,S.,Macas,M.,etal.(2014). BuildingFanCoilElectricConsumptionAnalysiswithFuzzyApproachesfor FaultDetectionandDiagnosis.EnergyProcedia,62,411–420.

Li,Q.,Ren,P.,&Meng,Q.(2010).Predictionmodelofannualenergyconsumption ofresidentialbuildings.InProceedingsof2010internationalconferenceon advancesinenergyengineering.

Li,M.,Miao,L.,&Shi,J.(2014).Analyzingheatingequipment’soperationsbasedon measureddata.EnergyandBuildings,82,47–56.

Magoulès,F.,Zhao,H.-x.,&Elizondo,D.(2013).DevelopmentofanRDPneural networkforbuildingenergyconsumptionfaultdetectionanddiagnosis.Energy andBuildings,62,133–138.

Theauthorssimulatedthedailyelectricityconsumptionofdifferentelectric equipmentinanofficebuildingoverayear.Theparametersoftheequipmentare settodefaultandchangedvalues,indicatingnormalandfaultyconditions, respectively.Thegeneratednormalandfaultydatasetswerethenusedtotrainand testthefaultdetectionabilityofthedevelopedANNmodel.Experimentalresults demonstratethatthismodelishighlyaccurateinfaultdetection.

Miller,C.,Nagy,Z.,&Schlueter,A.(2015).Automateddailypatternfilteringof measuredbuildingperformancedata.AutomationinConstruction,49(PartA), 1–17.

Moon,J.W.,Lee,J.-H.,&Kim,S.(2014).Applicationofcontrollogicforoptimum indoorthermalenvironmentinbuildingswithdoubleskinenvelopesystems. EnergyandBuildings,85,59–71.

MottaCabrera,D.F.,&Zareipour,H.(2013).Dataassociationminingforidentifying lightingenergywastepatternsineducationalinstitutes.EnergyandBuildings, 62,210–216.

ThisstudyproposedanARM-basedmethodtoquantifyandunderstand lightingenergywastepatternsassociatedwithoccupancypatternsinclassrooms inpost-secondaryeducationalinstitutes.Simulationresultsshowthatuptoabout 40%savingscouldbeachievedbyimplementingthediscoveredrules.

Mulumba,T.,Afshari,A.,Yan,K.,Shen,W.,&Norford,L.K.(2015).Robust model-basedfaultdiagnosisforairhandlingunits.EnergyandBuildings,86, 698–707.

Muratori,M.,Roberts,M.C.,Sioshansi,R.,Marano,V.,&Rizzoni,G.(2013).Ahighly resolvedmodelingtechniquetosimulateresidentialpowerdemand.Applied Energy,107,465–473.

Rhodes,J.D.,Cole,W.J.,Upshaw,C.R.,Edgar,T.F.,&Webber,M.E.(2014). Clusteringanalysisofresidentialelectricitydemandprofiles.AppliedEnergy, 135,461–471.

Thisstudyclustered103homesinAustinbytheirhourlyelectricityuse patternsforeachseason.Theoptimalnumberofclusterswasdeterminedto representresidentialdemandprofiles.Theresultswillbenefitutilitiesfor formulatingpoliciesonloadshiftingandefficiencyupgraderebates.

Rollins,S.,&Banerjee,N.(2014).Usingruleminingtounderstandapplianceenergy consumptionpatterns.In2014IEEEInternationalConferenceonPervasive ComputingandCommunications(PerCom).

Stoppel,C.M.,&Leite,F.(2014).Integratingprobabilisticmethodsfordescribing occupantpresencewithbuildingenergysimulationmodels.Energyand Buildings,68(PartA),99–107.

Sun,K.,Yan,D.,Hong,T.,&Guo,S.(2014).Stochasticmodelingofovertime occupancyanditsapplicationinbuildingenergysimulationandcalibration. BuildingandEnvironment,79,1–12.

Usama,Fayyad,Gregory,Piatetsky-Shapiro,&Smyth,P.(1996).Fromdatamining toknowledgediscoveryindatabases.AIMagazine,17,37–54.

Xiao,F.,&Fan,C.(2014).Datamininginbuildingautomationsystemfor improvingbuildingoperationalperformance.EnergyandBuildings,75, 109–118.

Yan,K.,Shen,W.,Mulumba,T.,&Afshari,A.(2014).ARXmodelbasedfault detectionanddiagnosisforchillersusingsupportvectormachines.Energyand Buildings,81,287–295.

Yu,Z.,Haghighat,F.,Fung,B.C.M.,&Yoshino,H.(2010).Adecisiontreemethodfor buildingenergydemandmodeling.EnergyandBuildings,42,1637–1646. Yu,Z.,Haghighat,F.,Fung,B.C.M.,Morofsky,E.,&Yoshino,H.(2011).A

methodologyforidentifyingandimprovingoccupantbehaviorinresidential buildings.Energy,36,6596–6608.

Yu,Z.,Fung,B.C.M.,Haghighat,F.,Yoshino,H.,&Morofsky,E.(2011).Asystematic proceduretostudytheinfluenceofoccupantbehavioronbuildingenergy consumption.EnergyandBuildings,43,1409–1417.

Yu,Z.,Haghighat,F.,Fung,B.C.M.,&Zhou,L.(2012).Anovelmethodologyfor knowledgediscoverythroughminingassociationsbetweenbuilding operationaldata.EnergyandBuildings,47,430–440.

Yu,Z.,Fung,B.C.M.,&Haghighat,F.(2013).Extractingknowledgefrom building-relateddata-Adataminingframework.BuildingSimulation,6, 207–222.

Thispaperestablishedafour-componentframeworkconsistingofDM techniques/algorithms(includingclassificationanalysis,clusteranalysis,and ARM),potentialapplications,inputandoutput.Astep-by-stepdataanalysis processthatstartsfromproblemdefinitiontoknowledgediscoverywasalso proposed.Itdemonstratesthefeasibilityofapplyingtheprocessandframeworkin thebuildingfield.

(6)

Zhao,H.X.,&Magoulès,F.(2011).Newparallelsupportvectorregressionfor predictingbuildingenergyconsumption.InProceedingsof2011IEEE symposiumoncomputationalintelligenceinmulticriteriadecision-making. Zhao,H.-x.,&Magoulès,F.(2012).Areviewonthepredictionofbuildingenergy

consumption.RenewableandSustainableEnergyReviews,16,3586–3592.

Zhao,J.,Lasternas,B.,Lam,K.P.,Yun,R.,&Loftness,V.(2014).Occupantbehavior andschedulemodelingforbuildingenergysimulationthroughoffice appliancepowerconsumptiondatamining.EnergyandBuildings,82, 341–355.

References

Related documents

It was decided that with the presence of such significant red flag signs that she should undergo advanced imaging, in this case an MRI, that revealed an underlying malignancy, which

Twenty-five percent of our respondents listed unilateral hearing loss as an indication for BAHA im- plantation, and only 17% routinely offered this treatment to children with

Self help Groups have been playing considerable role in training of Swarozgaris, infrastructure development, marketing and technology support, communication level

Use of PPMC was also signi fi cantly associated with place of PNC access (X 2 = 784.6, P < 0.001) More women (27.6%) who accessed PNC in private health facilities used PPMC

19% serve a county. Fourteen per cent of the centers provide service for adjoining states in addition to the states in which they are located; usually these adjoining states have

Standardization of herbal raw drugs include passport data of raw plant drugs, botanical authentification, microscopic & molecular examination, identification of

The degree of resistance exhibited after 1, 10 and 20 subcultures in broth in the absence of strepto- mycin was tested by comparing the number of colonies which grew from the