• No results found

Developing and evaluating a machine learning based algorithm to predict the need of pediatric intensive care unit transfer for newly hospitalized children

N/A
N/A
Protected

Academic year: 2021

Share "Developing and evaluating a machine learning based algorithm to predict the need of pediatric intensive care unit transfer for newly hospitalized children"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

ContentslistsavailableatScienceDirect

Resuscitation

jo u r n al hom ep age:w w w . e l s e v i e r . c o m / l o c a t e / r e s u s c i t a t i o n

Developing

and

evaluating

a

machine

learning

based

algorithm

to

predict

the

need

of

pediatric

intensive

care

unit

transfer

for

newly

hospitalized

children

Haijun

Zhai

a

,

Patrick

Brady

b,d

,

Qi

Li

a

,

Todd

Lingren

a

,

Yizhao

Ni

a

,

Derek

S.

Wheeler

c

,

Imre

Solti

a,d,∗

aDivisionofBiomedicalInformatics,CincinnatiChildren’sHospitalMedicalCenter,Cincinnati,OH,USA bDivisionofHospitalMedicine,CincinnatiChildren’sHospitalMedicalCenter,Cincinnati,OH,USA cDivisionofCriticalCareMedicine,CincinnatiChildren’sHospitalMedicalCenter,Cincinnati,OH,USA

dJamesM.AndersonCenterforHealthSystemsExcellence,CincinnatiChildren’sHospitalMedicalCenter,Cincinnati,OH,USA

a

r

t

i

c

l

e

i

n

f

o

Articlehistory:

Received27September2013

Receivedinrevisedform20February2014 Accepted8April2014

Keywords:

Clinicalstatusdeterioration Clinicalcare PICU PEWS Machinelearning EHR

a

b

s

t

r

a

c

t

Background:Earlywarningscores(EWS)aredesignedtoidentifyearlyclinicaldeteriorationbycombining physiologicand/orlaboratorymeasurestogenerateaquantifiedscore.CurrentEWSleverageonlya smallfractionofElectronicHealthRecord(EHR)content.Theplannedwidespreadimplementationof EHRsbringsthepromiseofabundantdataresourcesforpredictionpurposes.Thethreespecificaimsof ourresearchare:(1)todevelopanEHR-basedautomatedalgorithmtopredicttheneedforPediatric IntensiveCareUnit(PICU)transferinthefirst24hofadmission;(2)toevaluatetheperformanceofthe newalgorithmonaheld-outtestdataset;and(3)tocomparetheeffectivenessofthenewalgorithm’s withthoseoftwopublishedPediatricEarlyWarningScores(PEWS).

Methods:Thecaseswerecomprisedof526encounterswith24-hPediatricIntensiveCareUnit(PICU) transfer.Inadditiontothecases,werandomlyselected6772controlencountersfrom62516inpatient admissionsthatwerenevertransferredtothePICU.Weused29variablesinalogisticregressionand comparedouralgorithmagainsttwopublishedPEWSonaheld-outtestdataset.

Results:Thelogisticregressionalgorithmachieved0.849(95%CI0.753–0.945)sensitivity,0.859(95%CI 0.850–0.868)specificityand0.912(95%CI0.905–0.919)areaunderthecurve(AUC)inthetestset.Our algorithm’sAUCwassignificantlyhigher,by11.8and22.6%inthetestset,thantwopublishedPEWS. Conclusion:Thenovelalgorithmachievedhighersensitivity,specificity,andAUCthanthetwoPEWS reportedintheliterature.

©2014TheAuthors.PublishedbyElsevierIrelandLtd.ThisisanopenaccessarticleundertheCC BY-NC-SAlicense(http://creativecommons.org/licenses/by-nc-sa/3.0/).

1. Introduction

Failuretorescuehospitalizedpatientsfromcomplicationsof diseaseortreatmentis thesourceofsubstantialmorbidity and death.1,2Acardiopulmonaryarrestorcodeoutsidetheintensive careunit(ICU)isaprofoundconsequenceoffailuretorescuethat isassociatedwitha poorprognosisinhospitalizedchildren and adults.3Asclinicalantecedentsarepresentbeforemostcodes,rapid responsesystems (RRS)have beendesigned,tested, and imple-mentedtodetectdeteriorationearlyandtorapidlyintervene.4,5

Correspondingauthorat:CincinnatiChildren’sHospitalMedicalCenter,Division

ofBiomedicalInformatics,3333BurnetAvenue,MLC7024,Cincinnati,OH 45229-3039,USA.

E-mailaddress:[email protected](I.Solti).

OnechallengewithRRSisfailuretoactivateortriggertheafferent limb.6Earlywarningscores(EWS)aredesignedtoaddressthis chal-lengebycombiningphysiologicand/orlaboratorymeasuresintoa quantifiedscorethatcanthenbelinkedtoclear,expectedaction suchasincreasednursingassessmentsor activationofRRS.7–18 ThemostcommonlyusedPediatricEWS(PEWS)combinescoresin 3–7sub-scalestogenerateascorebetween0and26.12,15,16Initial developmentandvalidationofthesescores,whicharedesignedto betabulatedbyhandbynurses,occurredbeforewidespread imple-mentationofelectronichealthrecords(EHR)andthereforeleverage onlyasmallfractionoftheEHRcontent.

The predictive validity of two commonly used PEWS scores12,15,16 has been examined using the outcome of sub-sequent transfer to the PICU. The Bedside PEWS is the most extensively validated to date and includes seven components: heartrate,systolicbloodpressure,capillaryrefilltime,respiratory http://dx.doi.org/10.1016/j.resuscitation.2014.04.009

0300-9572/© 2014 The Authors. Published by Elsevier Ireland Ltd. This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).

(2)

Fig.1.Stepstogeneratecasesandcontrols.

rate, respiratory effort, transcutaneous oxygen saturation, and oxygentherapy.15Ascoreof0,1,2,or4isgeneratedfromeach categoryandaggregatedtoatotalscore,whichhasanareaunder the receiving operating characteristics curve (AUC) of 0.91 in itsderivationcohortand AUCof0.87 and 0.73in twoseparate validationcohorts.12,15,17

TheMonaghan’sPEWSusedinourinstitutioncombines sub-scoresinbehavior,cardiovascular,andrespiratorydomains,with addedpointsfornebulizers¼hourlyorvomitingfollowingsurgery tocreatea0–9overallscore.Whilelessextensivelyvalidated,this scorehadAUCof0.89whenprospectivelyevaluated.16Sincean EWSwillonlysucceedinpreventingdeteriorationwhenitistied toclearaction,eachscorehascutpointswhereassociated algo-rithmscallforspecificactionstobetaken.TheBedsidePEWShas mostcommonlybeenstudiedusingacutpointof8,whilethe Mon-aghan’sPEWScommonlyusesascore>2forincreasednurseand physicianevaluation.15,16

TheplannedwidespreadimplementationofEHRsbringsthe promise of abundant data resources for research purposes via secondaryuseof EHRdata,includingbetterprediction of clini-caldeterioration.19 Asnoted,EHRsandEHR-basedresearchcan transformhealthcaredeliverythroughadvancedclinicaldecision support.20However,manyofthegrandchallengesindeveloping clinicaldecisionsupportarestillbarelyaddressed.21Oneofthese challengesistominelargeclinicaldatasetstodevelopnew clini-caldecisionsupportsystemstoimproveclinicaloutcomes.Inour studyweaimtocontributetoachievingthisexactgoalbyusing thedatacollectedintheEHRduringroutineclinicalcaretoderive andevaluateapredictionalgorithmforPICUtransferforchildren inacutecarewardswithinthefirst24hofadmission.

2. Methods

2.1. Definitionofcasesandcontrols

CincinnatiChildren’sHospitalMedicalCenter’s(CCHMC) Insti-tutionalReviewBoardapprovedtheprotocolforourretrospective study. We extracted EHR data that were generated by clinical providersbetweenJanuary1,2010andAugust31,2012.During thisperiod,CCHMChad71,752admissionstoitsinpatientwards. Ofthese,1438admissionswerelatertransferredfromthegeneral wardstothePICU. Our unitof analysiswastheencounter and not the patient. For each inpatient encounter, we defined the first24hofadmissionasthestudyperiodforthreereasons.First, we attempted to determine which patients might need more attentionandresourcesatthestartoftheirinpatientstay.Second, aspresentedbelow,thePICUtransfersthatoccurredinthisscope covered a largepercentage of total PICU transfers (i.e.,36.6%). Third,thealgorithmdevelopedinthisscopecouldbegeneralized

andtestedinotherscopes.Weidentified526caseand6772control encounters(Fig.1).

Casesandcontrolsweresplitintotwoexperimentaldatasets,a trainingsetwith90%ofcases(including473casesand473controls) andatestsetwith10%ofcases(consistingof53casesand6299 con-trols).The119:1ratioof“no-PICUtransfer”:“24-hPICUtransfer” wasmaintainedinthetestsettopreservethegeneralizabilityof thestudy’sfindings.

2.2. Identificationandselectionofpredictiveclinicalelementsfor themachinelearningalgorithm

We collected over 300,000,000 data points from all 71,752 encountersthatoccurredbetweenJanuary1,2010andAugust31, 2012.Thedatasetincluded7587uniqueclinicalelementsas can-didatepredictors.Throughasix-stepprocess(Fig.2),weselected thepredictiveclinicalelementsfromthisdataset.

Inthefirststep,wesortedtheclinicalelementsbytheir fre-quency.Inthenextstepwefilteredouttheelementsthatwere mea-suredinlessthan20%ofclinicalencountersandretainedthetop 400mostfrequentelements.Inthethirdstep,apediatrichospitalist manuallyreviewedthe400clinicalelementsandgeneratedalist of16candidateclinicalelementswithpredictivepotential.To cre-ateindependentvariables,wecollectedallmeasurementsforthe 16clinicalelementsrecordedintheEHRuntil1hbeforethe trans-fereventforcasesandmeasurementsrecordedinthefirst24hfor

Top 400 most frequent elements extracted from electronic records

Discretization and categorization

Measurement selection based on chi-square test

36 measurements (155 variables) used for machine learning algorithm

16 elements selected by expert

36 measurements extracted

Fig.2. Identificationandselectionprocedureofclinicalelementsformachine learn-ingalgorithm.

(3)

Table1

Predictiveclinicalelements.

Allpredictiveclinical elements

Clinicalelement Type Availability Categorization

Temperature Continuous 924(97.7%) C1 ≤36◦C C2 >36◦Cand <38◦C C3 ≥38◦C Systolicbloodpressure Continuous 912(96.4%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >60and<80 >80and<100 >90and<110 >90and<120 >100and <130 C2 ≥80or≤60 ≥100or≤80 ≥110or≤90 ≥120or≤90 ≥130or ≤100 C3 ≥100or≤50 ≥120or≤70 ≥125or≤75 ≥140or≤80 ≥150or≤85 C4 ≥130or≤45 ≥150or≤60 ≥160or≤65 ≥170or≤70 ≥190or≤75

Oxygensaturation Continuous 836(88.4%) C1

>94

C2

91–94

C3 ≤90

Heartrate Continuous 930(98.3%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >110and <150 >100and <150 >90and<120 >70and<110 >60and<100 C2 ≥150or ≤110 ≥150or ≤100 ≥120or≤90 ≥110or≤70 ≥100or≤60 C3 ≥180or≤90 ≥170or≤80 ≥150or≤70 ≥130or≤60 ≥120or≤50 C4 ≥190or≤80 ≥180or≤70 ≥170or≤60 ≥150or≤50 ≥140or≤40

Respiratoryrate Continuous 929(98.2%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >29and<61 >24and<51 >19and<41 >19and<31 >11and<17 C2 ≥61or≤29 ≥51or≤24 ≥41or≤19 ≥31or≤19 ≥17or≤11 C3 ≥81or≤19 ≥71or≤19 ≥61or≤15 ≥41or≤14 ≥23or≤10 C4 ≥91or≤15 ≥81or≤15 ≥71or≤12 ≥51or≤10 ≥30or≤9 Levelofconsciousness Narrativetext 666(70.4%) C1

Normal C2 Unclear C3 Highrisk Ispatientexperiencing pain? Nominal 920(97.2%) C1 Yes C2 No Cardiac Nominal 881(93.1%) C1 WNL C2 X Respiratory Nominal 883(93.3%) C1 WNL C2 X Neurologic Nominal 875(92.5%) C1 WNL C2 X Urinary Nominal 872(92.2%) C1 WNL C2 X Tissueperfusionand

oxygenation Nominal 858(90.7%) C1 Excellent C2 Compromised C3 Adequate C4 Extremely compro-mised Development appropriate Nominal 813(85.9%) C1 Yes C2 No

AcuityLevel Nominal 674(71.2%) C1

Level1 C2 Level2 C3 Level3 C4 Level4 C5 Level5 Workofbreathing Narrativetext 758(80.1%) C1

Normal

C2

Unclear

C3

Highrisk

Perfusioncaprefill Nominal 398(42.1%) C1

<3s

C2 ≥3s

Levelofconsciousness Workofbreathing

Categorydefinitionsforthe“levelofconsciousness”and“workofbreathing”clinicalelements Normal Alert,active,quiet,arousable,WNL(withinnormallimits),

responds,oriented,awake,answersquestions,follows commands

Easy

Unclear Sleeping,drowsy,emerging,jittery Othercomments

Highrisk Lethargic,sedated,combative,unresponsive,seizure, agitated,seizing

Retracting,mild,nasalflaring,grunting,headbobbing,bobbinghead, moderate,reportsdyspnea,prolongedexpiratoryphase,gasping,severe Note1:Thefirstfiveclinicalelementswerevitalsigndataenteredintotheelectronichealthrecordasnumericvaluesbyanurseornursingassistant.Theremaining10 clinicalelementswereenteredbyanurseatleastevery4haspartofthebodysystemsassessment.Forthisassessmentthenursechosefromalistofpre-existingdescriptors ofthebodysystemorselected“other”andinputfreetext.Note2:Thesecondcolumninfirstsectionoftablenamed“Availability”showedthetotalnumberandpercentage ofencountershavingmeasurementforeachclinicalelementinthetrainingset;Note3:Ciwasusedtoindicatedifferentcategoriesundereachelement;Note4:WNLand

Xdenoted“WithinNormalLimits”and“ExceptionstoWNL”;Note5:For“Acuitylevel”,alocallydevelopedindicator-basedtoolwasusedforassessingpatientacuityand dependencyregardingnurseresources.

controls.Thatis,tobeconservativeinouralgorithmdevelopment, weusedonlyamaximumofthefirst23hours’datapointsforthe cases.Table1presentsthedetailsofthe16clinicalelements,which includefivecontinuous,twonarrative,andninenominalelements.

Inthefourthstep,foreachencounter,weextractedthemost recentmeasurementinthestudytime-window(accountingforthe 1hcutoffthresholdforcases)foreachclinicalelementtodevelop a machine learning algorithm. For the five continuous clinical

(4)

Table2

PerformancecomparisonoflogisticregressionagainstBedsidePEWSandMonaghan’sPEWS.

Sensitivity(95%CI) Specificity(95%CI) PPV(95%CI) AUC(95%CI) Performance Trainingset BedsidePEWS 0.715(0.672–0.755) 0.708(0.665–0.749) 0.710(0.669–0.751) 0.785(0.757–0.819) Monaghan’sPEWS 0.741(0.675–0.800) 0.830(0.775–0.877) 0.797(0.739–0.854) 0.814(0.774–0.850) Logisticregression 0.827(0.793–0.861) 0.873(0.843–0.903) 0.867(0.836–0.898) 0.919(0.899–0.935) TestSet BedsidePEWS 0.736(0.597–0.847) 0.717(0.706–0.728) 0.021(0.015–0.028) 0.816(0.806–0.826) Monaghan’sPEWS 0.684(0.434–0.874) 0.816(0.802–0.829) 0.023(0.010–0.035) 0.744(0.728–0.759) Logisticregression 0.849(0.753–0.945) 0.859(0.850–0.868) 0.048(0.035–0.062) 0.912(0.905–0.919) Sensitivity (improvement, P-value) Specificity (improvement, P-value) PPV (improvement, P-value) AUC (improvement, P-value) Performancecomparison

Trainingset LogisticregressionagainstBedsidePEWS 15.7%,P<0.001 23.3%,P<0.001 22.1%,P<0.001 17.1%,P<0.001 LogisticregressionagainstMonaghan’sPEWS 11.6%,P=0.01 5.2%,P=0.13 8.8%,P=0.03 12.9%,P<0.001 Testset LogisticregressionagainstBedsidePEWS 15.4%,P=0.15 19.8%,P<0.001 125.4%,P<0.001 11.8%,P<0.001 LogisticregressionagainstMonaghan’sPEWS 24.1%,P=0.22 5.3%,P<0.001 114.3%,P=0.02 22.6%,P<0.001

elements,wecreatedfouradditionalmeasurements,includingthe oldest,maximum,minimumandmeaninthestudytime-window. With these four measurements, we intended to represent the dynamicnature ofthepatients’ clinical conditions.In total, we collected36candidatemeasurements.

Inthefifthstep,wecategorizedeachofthese36measurements. Formeasurementsfromtheninenominalelements,weusedthe originallabelstoindicatethecategories.Forthefivecontinuous elements, we performeda categorization step based on defini-tionsofcut-offpointsidentifiedfromthepublishedwork15 and guidedbytwophysicians.Wecategorizedthefinaltwonarrative elementsbasedonkeywordsandsynonymsprovidedbythe physi-cian.Table1showsthecategorizationofthe16clinicalelements.In ourdataset,thestudyvariablesrarelyhadmissingvalues. Conse-quently,insteadofattemptingtoimputevaluesfortheoccasionally missingclinicalelements,weaddedanewcategory“NotAvailable” (“N/A”)foreachclinicalelementandhandledthe“N/A”category asothernaturallyoccurringcategoriesofthedata.22The catego-rizationofthese36measurementsresultedin155dichotomous variables.

Finally,weusedChi-squarecalculationstotestthesignificance ofeachmeasurementinthetrainingset.All36measurementswere selectedandusedtodevelopamachinelearningalgorithmat0.05

P-value.

2.3. Algorithmdevelopment

WeselectedlogisticregressionastheMachineLearning(ML) algorithmandusedWeka3.6.8asourexperimentalplatform.We basedthechoiceonlogisticregression’swideusageinclinical deci-sionsystemsandtherelativeeaseofinterpretingitsoutput.Inthis study,insteadofcalculatingaPEWSforeachcategory,weused binary-valuedvariablestoindicatethepresence/absence of cat-egories.Weapplieda forwardstepwiseapproachwithAkaike’s InformationCriterion(AIC)toselectthebestmodel.23

2.4. Evaluationmetrics

Tomeasurethealgorithm’spredictiveperformance,we calcu-latedthesensitivity,specificity,positivepredictivevalue(PPV)and AUC.24Apredictedpositivewasanycombinationofpredictor vari-ablesthathadanoutputof>0.5fromthelogisticregressionmodel.

2.5. Experimentalcomparison

InordertoevaluatethePICUtransferpredictionmethods,we implementedastandard10-foldcrossvalidationstep.Toassessthe performanceofthemodelsonpractical,realworlddata,we evalu-atedthemodelsonthetestingset,whichrepresentedtherealworld ratioof24-htransferandnon-transferencounters.For compari-son,wealsoevaluatedParshuram’sBedsidePEWSandMonaghan’s PEWS.

WhenevaluatingParshuram’sBedsidePEWS,wefollowed Rob-son’swork17andincludedonlyfiveclinicalelementsavailablein ourdatasetinstead oforiginal seven (heartrate,systolic blood pressure,capillaryrefilltime,respiratoryrate,respiratoryeffort, transcutaneousoxygensaturation,andoxygentherapy).Allofthe elementsexceptrespiratoryeffortandoxygentherapywerealso usedinourmachinelearningalgorithm.

Monaghan’sPEWSiscalculatedfornearlyalladmittedpatients atCCHMC.Forthecases,weextractedthehighestPEWSvalues recordedbeforethetransferevent;forcontrols,weextractedthe highestPEWSvaluesrecordedinthefirst24hofhospitalization. Fourhundredtwentyfiveencountersinthetrainingset (consist-ingof201casesand224controls)includedanon-missingvalue forMonaghan’sPEWSand3080encountersinthetestset (includ-ing19casesand3061controls)hadMonaghan’sPEWS.Forafair comparison,thepredictionresultsforMonaghan’sPEWSwere cal-culatedonlyontheencountersthathadanon-missingMonaghan’s PEWSvalue.

2.6. Timestampexperiment

In order to assess the robustness of our logistic regression model,weevaluateditusingavailableclinicalmeasurementsat differentpointsofthefirst24-hafteradmission.Specifically,we ranthemodel24timesusingcumulativeclinicalmeasurementsin eachhourofthefirst24-hafteradmissiononthetrainingset(i.e., 946samples).

3. Results

3.1. Predictionresultsusingbaselines

Table2showstheresultsofBedsidePEWSatscore7and Mon-aghan’sPEWSatscore 2.These cut-pointsweredeterminedby

(5)

Table3

Variablesusedinthefinallogisticregressionmodel.

Variable Oddsratio P-value

Oldesttemperature≤36 0.78 0.222

Mostrecenttemperature≥38b 1.52 <0.001

Minimumtemperature≤36b 0.41 <0.001

OldestsystolicbloodpressureC4a 72.91 0.207

MostrecentsystolicbloodpressureC1a 0.67 0.070

MaximumsystolicbloodpressureC1a,b 1.57 <0.001

MinimumsystolicbloodpressureC4a,b 25.47 <0.001

MeansystolicbloodpressureC1a,b 0.47 <0.001

Mostrecentoxygensaturation>94b 0.43 <0.001

Maximumoxygensaturation>94 8.42 0.105 Minimumoxygensaturation≤90b 1.80 0.021

MostrecentheartrateC1a 0.61 <0.001

MostrecentheartrateC3a,b 3.07 <0.001

MostrecentheartrateC4a,b 22.78 0.031

MaximumheartrateC3a,b 0.58 <0.001

MinimumheartrateC4a,b 24.22 0.002

MeanheartrateC4a,b 2.68 0.039

MinimumrespiratoryrateC4a,b 3.10 <0.001

MeanrespiratoryrateC4a,b 4.55 <0.001

LevelofconsciousnessHighriskb 7.16 0.005

Isthepatientexperiencingpain?Yesb 1.97 <0.001

Cardiacwithinnormallimitsb 0.28 <0.001

Neurologicwithinnormallimitsb 0.26 <0.001

Tissueperfusionandoxygenationadequateb 89.04 <0.001

Tissueperfusionandoxygenationextremely compromised

28.65 0.246

AcuitylevelLevel3 1.52 0.611 AcuitylevelLevel4b 1.26 <0.001

WorkofbreathingHighriskb 8.60 0.049

Perfusioncaprefill<3sb 2.10 0.031

aSeeTable1.

bIndicatesvariablessignificantlyassociatedwithPICUtransfer.

logisticregressionwiththreshold0.5toguaranteeafair compari-son.

3.2. Predictionresultsusinglogisticregression

Weincluded29variables associatedwith13ofthe16 clini-calelements,inthefinalmodelbytheforwardstepwiseapproach (Table3).Ofthe29variables,23weresignificantlyassociatedwith PICUtransfer(P<0.05).TheresultsarepresentedinTable2.

3.3. Resultscomparison

Table2 presentstheperformance ofthedifferentpredictive methods. In comparison tothe BedsidePEWS and Monaghan’s PEWSinthetestset,ourmodel’sAUCandspecificitywere statis-ticallysignificantlyimproved.Ourmodel’ssensitivitynumerically improvedby15.4and24.1%inthetestset,buttheseimprovements werenotstatisticallysignificant.Wehypothesizethattherelatively smallnumberofcasesinthetestsethinderedthedetectionof sta-tisticalsignificance.ThePPVofourmodelwas8.1%,overtwicethe valueofthatfoundinourdatasetofthetwoPEWS.Asaprevious study25described,thebigdecreaseofPPVinthetestsetwascaused bythedominationofcontrols.

3.4. Timestampexperiment

Fig.3 (top)showstheperformanceofourlogisticregression modelondifferentpoints(from1to24h)afteradmission.Wecan seethatAUCincreasessignificantlyinthefirst6handreachesthe plateauafter7h,whichmeansthatourmodelcanperformagood predictionusingtheclinicalmeasurementsoffirst7hafter admis-sion.ThebottomofFig.3displaysthePICUtransferdistribution forthe473casesinthetrainingset.TheaverageandmedianPICU

transfertimesare11.7and11hrespectively,and79%admissions aretransferredtoPICUafter7h.

4. Discussion

WeusedexistingclinicaldataintheEHRandmachine learn-ingtodevelopandvalidateapredictionalgorithmforPICUtransfer ofhospitalizedpatientsinthefirst24h.Throughaprocessusing expertclinicianopinion,categorizationandmachinelearning,we builtamodelconsistingof29variablesforpredictingPICU trans-fer.Ouralgorithmachieveda0.912(95%CI0.905–0.919)AUCin thetestset.Thisresultwasstatisticallysignificantlyhigherthan applicationoftwoexistingPEWSinourtestdataset.Unlike previ-ousPEWSwhichusedanumberofsub-scorestocreateanoverall scorewithvariouscut-points,weusedlogisticregressionsothat theoutputwasapercentagelikelihoodofPICUtransfer.Withthis approachwewereabletoachieve0.849sensitivityand0.859 speci-ficity.

Our predictionalgorithmperformedsignificantly betterthan twopublishedPEWSthatwerebasedondynamicclinicalelements, suchasvitalsigns.Onereasonforthisfindingisthatweused29 variablesfrom16clinicalelementsascomparedto3–7variablesin PEWSwithwhichwecompared.Ourvariablesincludedvitalsigns, whichbothotherscoresemploy.Wealsoincludedlevelof con-sciousness,painassessments,andworkofbreathingthateachmet twoimportantcriteria:(1)facevalidityinassociationwith wors-eningpatientstatusthatmightprecedePICUtransfer,and(2)were obtainedbyournursesinthecourseoftheirusualclinical assess-ments.Withtheexceptionofonevariable(capillaryrefill)eachof ourvariableswasavailablein>70%ofencounters,withthe major-itybeingpresentin>90%oftheencounters.Atourcenter,these datadidnotrequireanextrareportingstructure,additional clini-calassessments,orresearchnurses.EachwaspresentintheEHR forclinicalcare,butwebelieveeachwaspoorlyleveragedinthe courseofcareinidentifyingandpredictingpatientrisk.

Thetimestampexperimentshowedthatclinicalmeasurements takeninthefirst7hweresufficientforourpredictions.Wefound arelativelylowPPVastransfertothePICUinthefirst24hisan uncommonevent.Aswebelievethecostofafalsenegativeis con-siderablyhigherthanafalsepositive,relativelylowPPVmaybea tolerabletrade-off.

Our prediction algorithm can be integrated into our rapid response system to identify patients at elevated risk for PICU transfer. Current mechanisms to trigger or activate the rapid response systemhave limitations.6 Earlywarningscores repre-sentaquantitativeandthemostextensivelyvalidatedmechanism toactivate,butotheractivationmechanismsincludingsubjective nurse,family,orphysicianconcernsarealsoused.These subjec-tivemechanismshavelimitationsregardingvalidityandreliability, andtodate,PEWSarelimitedbythemodestnumberofelements from which a score canbe generated. Our methodof machine learning/logisticregressionallowsanoutputofthepercentage like-lihood of PICU transferto becalculated for analmost limitless numberofclinicalelements.Whilethebestwaytousethis out-putwillneedtobedeterminedprospectively,webelievearapid responsesystemcouldhavemultiplethresholdsbasedonthe per-centage likelihood. For example,ifthe likelihoodwere >50%of PICUtransferwithin24h,this maypromptanautomaticcallof themedicalemergencyteamformultidisciplinaryassessment.A scoreof>25%mighttriggerabedsideevaluationbytheprimary medicalteamandarecalculationofpredictionwithin2h.An out-putof>95%mightputinmotion, throughclinicaldecisionaids, aprocessthatmakesimmediatePICUtransferthedefaultaction andaphysicianwouldneedtotakeactiveactiontoavoidsucha result.

(6)

0 5 10 15 20 25 30 35 40 1h 2h 3h 4h 5h 6h 7h 8h 9h 10h 11h 12h 13h 14h 15h 16h 17h 18h 19h 20h 21h 22h 23h 24h

Number of PICU transfers

PICU transfer me aer admission (in hours)

PICU transfer me distribuon

0 0.2 0.4 0.6 0.8 1 1h 2h 3h 4h 5h 6h 7h 8h 9h 10h 11h 12h 13h 14h 15h 16h 17h 18h 19h 20h 21h 22h 23h 24h Performance

Time aer admission (in hours)

PICU transfer classificaon performance over me

Sensivity Specificity AUC PPV

Fig.3.Timestampexperimentresults.

4.1. Limitations

Missingdatawasamajorcauseofincorrectpredictionandwe needtodevelopaproperimputationmethodology.Inthecurrent study,weusedaverysimplemethodtoaddressthechallengesof missingdata.Wewillimplementmorecompleximputation meth-odsinourfuturework.Imputationwillbeespeciallyimportant becauseaswewilladdmorevariablestothemodel,theadditional variableswillincludemissingvaluesmorefrequently.

TransfertothePICU,whileaclinicallyimportantevent,does havesomelimitationsasitmaybedriveninpartbynon-patient factorssuchasPICUbedavailability.Infuturestudies,insteadof focusingexclusivelyontheneedforPICUtransferasadependent variable,wewillpredictthedeteriorationofhospitalizedchildren’s clinicalstatusandwillincludeothervariables(e.g.callingamedical emergencyteam,nurseorphysicianidentificationashighrisk)as dependentvariables.

Inthisstudy,alltheclinicalelementshadamuchhigher per-centageofavailabilitythaninpreviousstudies.Thehighpercentage of available clinical measures provided the bases for applying machinelearningtodetect24-hPICUtransfer.Thismayreduce thegeneralizabilityofourfindingsincenterswithmorefrequently missingdata.Infuturestudies,wewillpartnerwithotheracademic andnon-academicchildren’shospitalstovalidateouralgorithm ina diversesetof institutionsona prospectivesetof patients. Althoughour algorithm was created in thefirst 24h, applying itafter24hisquitestraightforward.Similarlytothetimestamp experiment,weneedtoregeneratethevalueofeachvariableevery fewhours(e.g.,1,2,4)andusethemodeltocalculatethe probabil-ity.However,theeffectivenessofthisapproachneedstobeverified onprospectivedata.Thedeteriorationofhospitalizedchildrenisa complicatedphenomenonwithavarietyofclinicalantecedentsand causes.AprolongedseizurethatresultsinPICUtransferhasa differ-entpathophysiologyandlikelydifferentclinicalantecedentsthan worseningrespiratorydistressinpneumonia.Inourcurrentstudy, weonlytested16clinicalelementsforprediction.Weincluded

just one static element (presence of developmentaldelay) and 15dynamic elements. Whilewe believedynamic elementswill likelybemoreusefulinpredictingPICUtransferintheshortterm, wesuspectfurtheradditionofstaticelementssuchasaneedfor medicaltechnologywillimprovepredictionaccuracy.Wedidnot includelabtestresults,andanearlierworkhasexploredthe pre-dictivepotentialof labtests andmedications for patientstatus deterioration.26Infutureworks,wewillincludemanymoredata pointsforthepatientssuchaslabtests,medications,diagnostic history,socialhistory,andfamilyhistory.

5. Conclusion

Onaretrospectivedatasetwesuccessfullydevelopedalogistic regressionalgorithmthatutilizedtheEHRcontenttopredictPICU transferforpediatricpatients’firstdayofhospitalization.Thenovel algorithmachievedhighersensitivity,specificityandAUCthantwo ofthecurrentPEWSreportedintheliterature.

Authorcontributions

IS,PBandHZdesignedthestudy.HZrantheexperiments, ana-lyzedtheresults,createdthetablesandfigures,andcontributed tothedraftandfinalmanuscript.QL,TL,DWandYNcontributed ideasforalgorithmdevelopment.TheprojectwassupervisedbyIS. ThefirstdraftofthemanuscriptwaspreparedbyHZ,IS,andPB withadditionalcontributionsbyallauthors.Allauthorsreadand approvedthefinalmanuscript.

Funding

TheauthorsweresupportedbyinternalfundsfromCincinnati Children’sHospitalMedicalCenter.HZ,QL,TL,YN,andISwere par-tiallysupportedbygrants5R00LM010227-05,1R21HD072883-01, and1U01HG006828-01.

(7)

Conflictofintereststatement

Noconflictofinterestforanyoftheauthors. References

1.GhaferiAA,BirkmeyerJD,DimickJB.Variationinhospitalmortalityassociated withinpatientsurgery.NEnglJMed2009;361:1368–75.

2.SilberJH,KaestnerR,Even-ShoshanO,WangY,BresslerLJ.Aggressivetreatment styleandsurgicaloutcomes.HealthServRes2010;45:1872–92.

3.NadkarniVM,LarkinGL,PeberdyMA,etal.Firstdocumentedrhythmandclinical outcomefromin-hospitalcardiacarrestamongchildrenandadults.JAmMed Assoc2006;295:50–7.

4.ScheinRM,HazdayN,PenaM,RubenBH,SprungCL.Clinicalantecedentsto in-hospitalcardiopulmonaryarrest.Chest1990;98:1388–92.

5.Jones DA, DeVita MA, Bellomo R. Rapid-response teams. N Engl J Med 2011;365:139–46.

6.DeVitaMA,SmithGB,AdamSK,etal.“Identifyingthehospitalisedpatientin crisis”–aconsensusconferenceontheafferentlimbofrapidresponsesystems. Resuscitation2010;81:375–82.

7.Bradman K, Borland M, Pascoe E. Predicting patient disposition in a paediatric emergency department. J Paediatr Child Health 2012,

http://dx.doi.org/10.1111/jpc.12011.

8.Petrillo-AlbaranoT,StockwellJ,LeongT,HebbarK.Theuseofamodified pedi-atricearlywarningscoretoassessstabilityofpediatricpatientsduringtransport. PediatrEmergCare2012;28:878–82.

9.SweneyJS,PossWB,GrissomCK,KeenanHT.Comparisonofseverityofillness scorestophysicianclinicaljudgmentforpotentialuseinpediatriccriticalcare triage.DisasterMedPublicHealthPrep2012;6:126–30.

10.ParshuramCS,BaylissA,ReimerJ,MiddaughK,BlanchardN.Implementingthe BedsidePaediatricEarlyWarningSysteminacommunityhospital:aprospective observationalstudy.PaediatrChildHealth2011;16:e18–22.

11.SkaletzkySM,RaszynskiA,TotapallyBR.Validationofamodifiedpediatricearly warningsystemscore:aretrospectivecase-controlstudy.ClinPediatr(Phila) 2012;51:431–5.

12.ParshuramCS,DuncanHP,JoffeAR,etal.Multicentrevalidationofthe bed-sidepaediatricearlywarningsystemscore:aseverityofillnessscoretodetect evolvingcriticalillnessinhospitalisedchildren.CritCare2011;15:R184.

13.AkreM,FinkelsteinM,EricksonM,LiuM,VanderbiltL,BillmanG.Sensitivity ofthepediatricearlywarningscoretoidentifypatientdeterioration.Pediatrics 2010;125:e763–9.

14.DemmelKM,WilliamsL,FleschL.Implementationofthepediatricearlywarning scoringsystemonapediatrichematology/oncologyunit.JPediatrOncolNurs 2010;27:229–40.

15.Parshuram CS, Hutchison J, Middaugh K. Development and initial vali-dation of the Bedside Paediatric Early Warning System score. Crit Care 2009;13:R135.

16.TuckerKM,BrewerTL,BakerRB,DemerittB,VossmeyerMT.Prospective evalu-ationofapediatricinpatientearlywarningscoringsystem.JSpecPediatrNurs 2009;14:79–85.

17.RobsonMA,CooperCL,MedicusLA,QuinteroMJ,ZunigaSA.Comparisonof threeacutecarepediatricearlywarningscoringtools.JSpecPediatrNurs2012,

http://dx.doi.org/10.1016/j.pedn.2012.12.002.

18.EdwardsED,PowellCV,MasonBW,OliverA.Prospectivecohortstudytotest thepredictabilityoftheCardiffandValepaediatricearlywarningsystem.Arch DisChild2009;94:602–6.

19.DasS,EisenbergLD,HouseJW,etal.Meaningfuluseofelectronichealthrecords inotolaryngology:recommendationsfromtheAmericanAcademyof Otolaryn-gology–HeadandNeckSurgeryMedicalInformaticsCommittee.Otolaryngol HeadNeckSurg2011;144:135–41.

20.JhaAK.Thepromiseofelectronicrecordsaroundthecornerordowntheroad? JAmMedAssoc2011;306:880–1.

21.SittigDF,WrightA,OsheroffJA,etal.Grandchallengesinclinicaldecision sup-port.JBiomedInform2008;41:387–92.

22.Grzymala-BusseW,HuM.Acomparisonofseveral approachestomissing attributevaluesindatamining.In:Proceedingsofthesecondinternational con-ferenceonroughsetsandcurrenttrendsincomputingRSCTC’2000.2000.p. 340–7.

23.VenablesWN,RipleyBD.ModernappliedstatisticsstatisticswithS-PLUS.4hed. NewYork:Springer;2002.

24.MetzCE.BasicprinciplesofORCanalysis.SeminNuclMed1978;8:283–98.

25.LalkhenG,McCluskeyA.Clinicaltests:sensitivityandspecificity.CritCarePain 2008;8:221–3.

26.BonafideCP,HolmesJH,NadkarniVM,LinR,LandisJR,KerenR.Development ofascoretopredictclinicaldeteriorationinhospitalizedchildren.JHospMed 2012;7:345–9.

Figure

Fig. 2. Identification and selection procedure of clinical elements for machine learn- learn-ing algorithm.
Fig. 3. Timestamp experiment results.

References

Related documents

Studies were conducted to evaluate the influence of trap colour on the captures of honeybees, Apis mellifera L., codling moth, Cydia pomonella L., and non target muscoid flies

The values of the afferent weights provide appropriate lateral weights since the objective of pre-integration lateral inhibition is to allow an active node with a strong synaptic

Transforming all samples to the same reference space allows normalizing the information from brains imaged under different conditions or image modalities, and anatomical regions

Transmission quality control charts developed in this investigation are very useful in synthesizing general motion laws for single roller- to- cam double- crank mechanisms. The

Collaborative , Durham NH October 2006. Using fluorescence to measure carbon quality at watershed scales and implications for nitrogen retention in a rapidly suburbanizing

Strictly within a biological context, the natural origin of morality is central to teaching the basics of evolution as a whole.. The Challenges of Biological Determinism and

It is not only the size of the sugar value chain but also the limited alternative economic opportunities that make reducing NCDs through SSB taxes contradict the goals of

Thus, this study was designed to investigate the associ- ation of components of MetSyn with the presence of polymorphisms in VDR gene [VDR 1544410 A &gt; G (BsmI); VDR 2228570 C &gt;