Developing and evaluating a machine learning based algorithm to predict the need of pediatric intensive care unit transfer for newly hospitalized children

(1)

ContentslistsavailableatScienceDirect

Resuscitation

jo u r n al hom ep age:w w w . e l s e v i e r . c o m / l o c a t e / r e s u s c i t a t i o n

Developing

and

evaluating

a

machine

learning

based

algorithm

to

predict

the

need

of

pediatric

intensive

care

unit

transfer

for

newly

hospitalized

children

Haijun

Zhai

a

_,

_Patrick

_Brady

b,d

_,

_Qi

_Li

a

_,

_Todd

_Lingren

a

_,

_Yizhao

_Ni

a

_,

Derek

S. Wheeler

c

_,

_Imre

_Solti

a,d,∗

a_Division_of_Biomedical_Informatics,_Cincinnati_Children’s_Hospital_Medical_Center,_Cincinnati,_OH,_USA b_Division_of_Hospital_Medicine,_Cincinnati_Children’s_Hospital_Medical_Center,_Cincinnati,_OH,_USA c_Division_of_Critical_Care_Medicine,_Cincinnati_Children’s_Hospital_Medical_Center,_Cincinnati,_OH,_USA

d_James_M._Anderson_Center_for_Health_Systems_Excellence,_Cincinnati_Children’s_Hospital_Medical_Center,_Cincinnati,_OH,_USA

a

r

t

i

c

l

e

i

n

f

o

Articlehistory:

Received27September2013

Receivedinrevisedform20February2014 Accepted8April2014

Keywords:

Clinicalstatusdeterioration Clinicalcare PICU PEWS Machinelearning EHR

a

b

s

t

r

a

c

t

Background:Earlywarningscores(EWS)aredesignedtoidentifyearlyclinicaldeteriorationbycombining physiologicand/orlaboratorymeasurestogenerateaquantifiedscore.CurrentEWSleverageonlya smallfractionofElectronicHealthRecord(EHR)content.Theplannedwidespreadimplementationof EHRsbringsthepromiseofabundantdataresourcesforpredictionpurposes.Thethreespecificaimsof ourresearchare:(1)todevelopanEHR-basedautomatedalgorithmtopredicttheneedforPediatric IntensiveCareUnit(PICU)transferinthefirst24hofadmission;(2)toevaluatetheperformanceofthe newalgorithmonaheld-outtestdataset;and(3)tocomparetheeffectivenessofthenewalgorithm’s withthoseoftwopublishedPediatricEarlyWarningScores(PEWS).

Methods:Thecaseswerecomprisedof526encounterswith24-hPediatricIntensiveCareUnit(PICU) transfer.Inadditiontothecases,werandomlyselected6772controlencountersfrom62516inpatient admissionsthatwerenevertransferredtothePICU.Weused29variablesinalogisticregressionand comparedouralgorithmagainsttwopublishedPEWSonaheld-outtestdataset.

Results:Thelogisticregressionalgorithmachieved0.849(95%CI0.753–0.945)sensitivity,0.859(95%CI 0.850–0.868)specificityand0.912(95%CI0.905–0.919)areaunderthecurve(AUC)inthetestset.Our algorithm’sAUCwassignificantlyhigher,by11.8and22.6%inthetestset,thantwopublishedPEWS. Conclusion:Thenovelalgorithmachievedhighersensitivity,specificity,andAUCthanthetwoPEWS reportedintheliterature.

1. Introduction

Failuretorescuehospitalizedpatientsfromcomplicationsof diseaseortreatmentis thesourceofsubstantialmorbidity and death.1,2_A_{cardiopulmonary}_arrest_or_code_outside_the_intensive careunit(ICU)isaprofoundconsequenceoffailuretorescuethat isassociatedwitha poorprognosisinhospitalizedchildren and adults.3_As_clinical_antecedents_are_present_before_most_codes,_rapid responsesystems (RRS)have beendesigned,tested, and imple-mentedtodetectdeteriorationearlyandtorapidlyintervene.4,5

∗_{Corresponding}_author_at:_Cincinnati_Children’s_Hospital_Medical_Center,_Division

ofBiomedicalInformatics,3333BurnetAvenue,MLC7024,Cincinnati,OH 45229-3039,USA.

E-mailaddress:[email protected](I.Solti).

OnechallengewithRRSisfailuretoactivateortriggertheafferent limb.6_Early_warning_scores_(EWS)_are_designed_to_address_this chal-lengebycombiningphysiologicand/orlaboratorymeasuresintoa quantiﬁedscorethatcanthenbelinkedtoclear,expectedaction suchasincreasednursingassessmentsor activationofRRS.7–18 ThemostcommonlyusedPediatricEWS(PEWS)combinescoresin 3–7sub-scalestogenerateascorebetween0and26.12,15,16_Initial developmentandvalidationofthesescores,whicharedesignedto betabulatedbyhandbynurses,occurredbeforewidespread imple-mentationofelectronichealthrecords(EHR)andthereforeleverage onlyasmallfractionoftheEHRcontent.

The predictive validity of two commonly used PEWS scores12,15,16 _has _been _examined _using _the _outcome _of sub-sequent transfer to the PICU. The Bedside PEWS is the most extensively validated to date and includes seven components: heartrate,systolicbloodpressure,capillaryreﬁlltime,respiratory http://dx.doi.org/10.1016/j.resuscitation.2014.04.009

(2)

Fig.1.Stepstogeneratecasesandcontrols.

rate, respiratory effort, transcutaneous oxygen saturation, and oxygentherapy.15_A_score_of_0,_1,_2,_or₄_is_generated_from_each categoryandaggregatedtoatotalscore,whichhasanareaunder the receiving operating characteristics curve (AUC) of 0.91 in itsderivationcohortand AUCof0.87 and 0.73in twoseparate validationcohorts.12,15,17

TheMonaghan’sPEWSusedinourinstitutioncombines sub-scoresinbehavior,cardiovascular,andrespiratorydomains,with addedpointsfornebulizers¼hourlyorvomitingfollowingsurgery tocreatea0–9overallscore.Whilelessextensivelyvalidated,this scorehadAUCof0.89whenprospectivelyevaluated.16_Since_an EWSwillonlysucceedinpreventingdeteriorationwhenitistied toclearaction,eachscorehascutpointswhereassociated algo-rithmscallforspeciﬁcactionstobetaken.TheBedsidePEWShas mostcommonlybeenstudiedusingacutpointof8,whilethe Mon-aghan’sPEWScommonlyusesascore>2forincreasednurseand physicianevaluation.15,16

TheplannedwidespreadimplementationofEHRsbringsthe promise of abundant data resources for research purposes via secondaryuseof EHRdata,includingbetterprediction of clini-caldeterioration.19 _As_noted,_EHRs_and_EHR-based_research_can transformhealthcaredeliverythroughadvancedclinicaldecision support.20_However,_many_of_the_grand_challenges_in_developing clinicaldecisionsupportarestillbarelyaddressed.21_One_of_these challengesistominelargeclinicaldatasetstodevelopnew clini-caldecisionsupportsystemstoimproveclinicaloutcomes.Inour studyweaimtocontributetoachievingthisexactgoalbyusing thedatacollectedintheEHRduringroutineclinicalcaretoderive andevaluateapredictionalgorithmforPICUtransferforchildren inacutecarewardswithintheﬁrst24hofadmission.

2. Methods

2.1. Deﬁnitionofcasesandcontrols

CincinnatiChildren’sHospitalMedicalCenter’s(CCHMC) Insti-tutionalReviewBoardapprovedtheprotocolforourretrospective study. We extracted EHR data that were generated by clinical providersbetweenJanuary1,2010andAugust31,2012.During thisperiod,CCHMChad71,752admissionstoitsinpatientwards. Ofthese,1438admissionswerelatertransferredfromthegeneral wardstothePICU. Our unitof analysiswastheencounter and not the patient. For each inpatient encounter, we deﬁned the ﬁrst24hofadmissionasthestudyperiodforthreereasons.First, we attempted to determine which patients might need more attentionandresourcesatthestartoftheirinpatientstay.Second, aspresentedbelow,thePICUtransfersthatoccurredinthisscope covered a largepercentage of total PICU transfers (i.e.,36.6%). Third,thealgorithmdevelopedinthisscopecouldbegeneralized

andtestedinotherscopes.Weidentiﬁed526caseand6772control encounters(Fig.1).

Casesandcontrolsweresplitintotwoexperimentaldatasets,a trainingsetwith90%ofcases(including473casesand473controls) andatestsetwith10%ofcases(consistingof53casesand6299 con-trols).The119:1ratioof“no-PICUtransfer”:“24-hPICUtransfer” wasmaintainedinthetestsettopreservethegeneralizabilityof thestudy’sﬁndings.

2.2. Identiﬁcationandselectionofpredictiveclinicalelementsfor themachinelearningalgorithm

We collected over 300,000,000 data points from all 71,752 encountersthatoccurredbetweenJanuary1,2010andAugust31, 2012.Thedatasetincluded7587uniqueclinicalelementsas can-didatepredictors.Throughasix-stepprocess(Fig.2),weselected thepredictiveclinicalelementsfromthisdataset.

Inthefirststep,wesortedtheclinicalelementsbytheir fre-quency.Inthenextstepwefilteredouttheelementsthatwere mea-suredinlessthan20%ofclinicalencountersandretainedthetop 400mostfrequentelements.Inthethirdstep,apediatrichospitalist manuallyreviewedthe400clinicalelementsandgeneratedalist of16candidateclinicalelementswithpredictivepotential.To cre-ateindependentvariables,wecollectedallmeasurementsforthe 16clinicalelementsrecordedintheEHRuntil1hbeforethe trans-fereventforcasesandmeasurementsrecordedinthefirst24hfor

Top 400 most frequent elements extracted from electronic records

Discretization and categorization

Measurement selection based on chi-square test

36 measurements (155 variables) used for machine learning algorithm

16 elements selected by expert

36 measurements extracted

Fig.2. Identiﬁcationandselectionprocedureofclinicalelementsformachine learn-ingalgorithm.

(3)

Table1

Predictiveclinicalelements.

Allpredictiveclinical elements

Clinicalelement Type Availability Categorization

Temperature Continuous 924(97.7%) C1 ≤36◦_C C2 >36◦_C_and <38◦_C C3 ≥38◦_C Systolicbloodpressure Continuous 912(96.4%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >60and<80 >80and<100 >90and<110 >90and<120 >100and <130 C2 ≥80or≤60 ≥100or≤80 ≥110or≤90 ≥120or≤90 ≥130or ≤100 C3 ≥100or≤50 ≥120or≤70 ≥125or≤75 ≥140or≤80 ≥150or≤85 C4 ≥130or≤45 ≥150or≤60 ≥160or≤65 ≥170or≤70 ≥190or≤75

Oxygensaturation Continuous 836(88.4%) C1

>94

C2

91–94

C3 ≤90

Heartrate Continuous 930(98.3%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >110and <150 >100and <150 >90and<120 >70and<110 >60and<100 C2 ≥150or ≤110 ≥150or ≤100 ≥120or≤90 ≥110or≤70 ≥100or≤60 C3 ≥180or≤90 ≥170or≤80 ≥150or≤70 ≥130or≤60 ≥120or≤50 C4 ≥190or≤80 ≥180or≤70 ≥170or≤60 ≥150or≤50 ≥140or≤40

Respiratoryrate Continuous 929(98.2%) Agegroup

0–3months 3–12months 1–4years 4–12years >12years C1 >29and<61 >24and<51 >19and<41 >19and<31 >11and<17 C2 ≥61or≤29 ≥51or≤24 ≥41or≤19 ≥31or≤19 ≥17or≤11 C3 ≥81or≤19 ≥71or≤19 ≥61or≤15 ≥41or≤14 ≥23or≤10 C4 ≥91or≤15 ≥81or≤15 ≥71or≤12 ≥51or≤10 ≥30or≤9 Levelofconsciousness Narrativetext 666(70.4%) C1

Normal C2 Unclear C3 Highrisk Ispatientexperiencing pain? Nominal 920(97.2%) C1 Yes C2 No Cardiac Nominal 881(93.1%) C1 WNL C2 X Respiratory Nominal 883(93.3%) C1 WNL C2 X Neurologic Nominal 875(92.5%) C1 WNL C2 X Urinary Nominal 872(92.2%) C1 WNL C2 X Tissueperfusionand

oxygenation Nominal 858(90.7%) C1 Excellent C2 Compromised C3 Adequate C4 Extremely compro-mised Development appropriate Nominal 813(85.9%) C1 Yes C2 No

AcuityLevel Nominal 674(71.2%) C1

Level1 C2 Level2 C3 Level3 C4 Level4 C5 Level5 Workofbreathing Narrativetext 758(80.1%) C1

Normal

C2

Unclear

C3

Highrisk

Perfusioncapreﬁll Nominal 398(42.1%) C1

<3s

C2 ≥3s

Levelofconsciousness Workofbreathing

Categorydeﬁnitionsforthe“levelofconsciousness”and“workofbreathing”clinicalelements Normal Alert,active,quiet,arousable,WNL(withinnormallimits),

responds,oriented,awake,answersquestions,follows commands

Easy

Unclear Sleeping,drowsy,emerging,jittery Othercomments

Highrisk Lethargic,sedated,combative,unresponsive,seizure, agitated,seizing

Retracting,mild,nasalflaring,grunting,headbobbing,bobbinghead, moderate,reportsdyspnea,prolongedexpiratoryphase,gasping,severe Note1:Thefirstfiveclinicalelementswerevitalsigndataenteredintotheelectronichealthrecordasnumericvaluesbyanurseornursingassistant.Theremaining10 clinicalelementswereenteredbyanurseatleastevery4haspartofthebodysystemsassessment.Forthisassessmentthenursechosefromalistofpre-existingdescriptors ofthebodysystemorselected“other”andinputfreetext.Note2:Thesecondcolumninfirstsectionoftablenamed“Availability”showedthetotalnumberandpercentage ofencountershavingmeasurementforeachclinicalelementinthetrainingset;Note3:Ciwasusedtoindicatedifferentcategoriesundereachelement;Note4:WNLand

Xdenoted“WithinNormalLimits”and“ExceptionstoWNL”;Note5:For“Acuitylevel”,alocallydevelopedindicator-basedtoolwasusedforassessingpatientacuityand dependencyregardingnurseresources.

controls.Thatis,tobeconservativeinouralgorithmdevelopment, weusedonlyamaximumoftheﬁrst23hours’datapointsforthe cases.Table1presentsthedetailsofthe16clinicalelements,which includeﬁvecontinuous,twonarrative,andninenominalelements.

Inthefourthstep,foreachencounter,weextractedthemost recentmeasurementinthestudytime-window(accountingforthe 1hcutoffthresholdforcases)foreachclinicalelementtodevelop a machine learning algorithm. For the ﬁve continuous clinical

(4)

Table2

PerformancecomparisonoflogisticregressionagainstBedsidePEWSandMonaghan’sPEWS.

Sensitivity(95%CI) Speciﬁcity(95%CI) PPV(95%CI) AUC(95%CI) Performance Trainingset BedsidePEWS 0.715(0.672–0.755) 0.708(0.665–0.749) 0.710(0.669–0.751) 0.785(0.757–0.819) Monaghan’sPEWS 0.741(0.675–0.800) 0.830(0.775–0.877) 0.797(0.739–0.854) 0.814(0.774–0.850) Logisticregression 0.827(0.793–0.861) 0.873(0.843–0.903) 0.867(0.836–0.898) 0.919(0.899–0.935) TestSet BedsidePEWS 0.736(0.597–0.847) 0.717(0.706–0.728) 0.021(0.015–0.028) 0.816(0.806–0.826) Monaghan’sPEWS 0.684(0.434–0.874) 0.816(0.802–0.829) 0.023(0.010–0.035) 0.744(0.728–0.759) Logisticregression 0.849(0.753–0.945) 0.859(0.850–0.868) 0.048(0.035–0.062) 0.912(0.905–0.919) Sensitivity (improvement, P-value) Speciﬁcity (improvement, P-value) PPV (improvement, P-value) AUC (improvement, P-value) Performancecomparison

Trainingset LogisticregressionagainstBedsidePEWS 15.7%,P<0.001 23.3%,P<0.001 22.1%,P<0.001 17.1%,P<0.001 LogisticregressionagainstMonaghan’sPEWS 11.6%,P=0.01 5.2%,P=0.13 8.8%,P=0.03 12.9%,P<0.001 Testset LogisticregressionagainstBedsidePEWS 15.4%,P=0.15 19.8%,P<0.001 125.4%,_P_<_0.001 11.8%,P<0.001 LogisticregressionagainstMonaghan’sPEWS 24.1%,P=0.22 5.3%,P<0.001 114.3%,P=0.02 22.6%,P<0.001

elements,wecreatedfouradditionalmeasurements,includingthe oldest,maximum,minimumandmeaninthestudytime-window. With these four measurements, we intended to represent the dynamicnature ofthepatients’ clinical conditions.In total, we collected36candidatemeasurements.

Inthefifthstep,wecategorizedeachofthese36measurements. Formeasurementsfromtheninenominalelements,weusedthe originallabelstoindicatethecategories.Forthefivecontinuous elements, we performeda categorization step based on defini-tionsofcut-offpointsidentifiedfromthepublishedwork15 _and guidedbytwophysicians.Wecategorizedthefinaltwonarrative elementsbasedonkeywordsandsynonymsprovidedbythe physi-cian.Table1showsthecategorizationofthe16clinicalelements.In ourdataset,thestudyvariablesrarelyhadmissingvalues. Conse-quently,insteadofattemptingtoimputevaluesfortheoccasionally missingclinicalelements,weaddedanewcategory“NotAvailable” (“N/A”)foreachclinicalelementandhandledthe“N/A”category asothernaturallyoccurringcategoriesofthedata.22_The catego-rizationofthese36measurementsresultedin155dichotomous variables.

Finally,weusedChi-squarecalculationstotestthesigniﬁcance ofeachmeasurementinthetrainingset.All36measurementswere selectedandusedtodevelopamachinelearningalgorithmat0.05

P-value.

2.3. Algorithmdevelopment

WeselectedlogisticregressionastheMachineLearning(ML) algorithmandusedWeka3.6.8asourexperimentalplatform.We basedthechoiceonlogisticregression’swideusageinclinical deci-sionsystemsandtherelativeeaseofinterpretingitsoutput.Inthis study,insteadofcalculatingaPEWSforeachcategory,weused binary-valuedvariablestoindicatethepresence/absence of cat-egories.Weapplieda forwardstepwiseapproachwithAkaike’s InformationCriterion(AIC)toselectthebestmodel.23

2.4. Evaluationmetrics

Tomeasurethealgorithm’spredictiveperformance,we calcu-latedthesensitivity,speciﬁcity,positivepredictivevalue(PPV)and AUC.24_A_predicted_positive_was_any_combination_of_predictor vari-ablesthathadanoutputof>0.5fromthelogisticregressionmodel.

2.5. Experimentalcomparison

InordertoevaluatethePICUtransferpredictionmethods,we implementedastandard10-foldcrossvalidationstep.Toassessthe performanceofthemodelsonpractical,realworlddata,we evalu-atedthemodelsonthetestingset,whichrepresentedtherealworld ratioof24-htransferandnon-transferencounters.For compari-son,wealsoevaluatedParshuram’sBedsidePEWSandMonaghan’s PEWS.

WhenevaluatingParshuram’sBedsidePEWS,wefollowed Rob-son’swork17_and_included_only_ﬁve_clinical_elements_available_in ourdatasetinstead oforiginal seven (heartrate,systolic blood pressure,capillaryreﬁlltime,respiratoryrate,respiratoryeffort, transcutaneousoxygensaturation,andoxygentherapy).Allofthe elementsexceptrespiratoryeffortandoxygentherapywerealso usedinourmachinelearningalgorithm.

Monaghan’sPEWSiscalculatedfornearlyalladmittedpatients atCCHMC.Forthecases,weextractedthehighestPEWSvalues recordedbeforethetransferevent;forcontrols,weextractedthe highestPEWSvaluesrecordedintheﬁrst24hofhospitalization. Fourhundredtwentyﬁveencountersinthetrainingset (consist-ingof201casesand224controls)includedanon-missingvalue forMonaghan’sPEWSand3080encountersinthetestset (includ-ing19casesand3061controls)hadMonaghan’sPEWS.Forafair comparison,thepredictionresultsforMonaghan’sPEWSwere cal-culatedonlyontheencountersthathadanon-missingMonaghan’s PEWSvalue.

2.6. Timestampexperiment

In order to assess the robustness of our logistic regression model,weevaluateditusingavailableclinicalmeasurementsat differentpointsofthefirst24-hafteradmission.Specifically,we ranthemodel24timesusingcumulativeclinicalmeasurementsin eachhourofthefirst24-hafteradmissiononthetrainingset(i.e., 946samples).

3. Results

3.1. Predictionresultsusingbaselines

Table2showstheresultsofBedsidePEWSatscore7and Mon-aghan’sPEWSatscore 2.These cut-pointsweredeterminedby

(5)

Table3

Variablesusedintheﬁnallogisticregressionmodel.

Variable Oddsratio P-value

Oldesttemperature≤36 0.78 0.222

Mostrecenttemperature≥38b _1.52 _<0.001

Minimumtemperature≤36b _0.41 _<0.001

OldestsystolicbloodpressureC4a 72.91 0.207

MostrecentsystolicbloodpressureC1a 0.67 0.070

MaximumsystolicbloodpressureC1a,b 1.57 <0.001

MinimumsystolicbloodpressureC4a,b 25.47 <0.001

MeansystolicbloodpressureC1a,b 0.47 <0.001

Mostrecentoxygensaturation>94b _0.43 _<0.001

Maximumoxygensaturation>94 8.42 0.105 Minimumoxygensaturation≤90b _1.80 _0.021

MostrecentheartrateC1a 0.61 <0.001

MostrecentheartrateC3a,b 3.07 <0.001

MostrecentheartrateC4a,b 22.78 0.031

MaximumheartrateC3a,b 0.58 <0.001

MinimumheartrateC4a,b 24.22 0.002

MeanheartrateC4a,b 2.68 0.039

MinimumrespiratoryrateC4a,b 3.10 <0.001

MeanrespiratoryrateC4a,b 4.55 <0.001

LevelofconsciousnessHighriskb _7.16 _0.005

Isthepatientexperiencingpain?Yesb _1.97 _<0.001

Cardiacwithinnormallimitsb _0.28 _<0.001

Neurologicwithinnormallimitsb _0.26 _<0.001

Tissueperfusionandoxygenationadequateb _89.04 _<0.001

Tissueperfusionandoxygenationextremely compromised

28.65 0.246

AcuitylevelLevel3 1.52 0.611 AcuitylevelLevel4b _1.26 _<0.001

WorkofbreathingHighriskb _8.60 _0.049

Perfusioncapreﬁll<3sb _2.10 _0.031

a_See_Table₁_.

b_Indicates_variables_{signiﬁcantly}_associated_with_PICU_transfer.

logisticregressionwiththreshold0.5toguaranteeafair compari-son.

3.2. Predictionresultsusinglogisticregression

Weincluded29variables associatedwith13ofthe16 clini-calelements,intheﬁnalmodelbytheforwardstepwiseapproach (Table3).Ofthe29variables,23weresigniﬁcantlyassociatedwith PICUtransfer(P<0.05).TheresultsarepresentedinTable2.

3.3. Resultscomparison

Table2 presentstheperformance ofthedifferentpredictive methods. In comparison tothe BedsidePEWS and Monaghan’s PEWSinthetestset,ourmodel’sAUCandspecificitywere statis-ticallysignificantlyimproved.Ourmodel’ssensitivitynumerically improvedby15.4and24.1%inthetestset,buttheseimprovements werenotstatisticallysignificant.Wehypothesizethattherelatively smallnumberofcasesinthetestsethinderedthedetectionof sta-tisticalsignificance.ThePPVofourmodelwas8.1%,overtwicethe valueofthatfoundinourdatasetofthetwoPEWS.Asaprevious study25_described,_the_big_decrease_of_PPV_in_the_test_set_was_caused bythedominationofcontrols.

3.4. Timestampexperiment

Fig.3 (top)showstheperformanceofourlogisticregression modelondifferentpoints(from1to24h)afteradmission.Wecan seethatAUCincreasessignificantlyinthefirst6handreachesthe plateauafter7h,whichmeansthatourmodelcanperformagood predictionusingtheclinicalmeasurementsoffirst7hafter admis-sion.ThebottomofFig.3displaysthePICUtransferdistribution forthe473casesinthetrainingset.TheaverageandmedianPICU

transfertimesare11.7and11hrespectively,and79%admissions aretransferredtoPICUafter7h.

4. Discussion

WeusedexistingclinicaldataintheEHRandmachine learn-ingtodevelopandvalidateapredictionalgorithmforPICUtransfer ofhospitalizedpatientsinthefirst24h.Throughaprocessusing expertclinicianopinion,categorizationandmachinelearning,we builtamodelconsistingof29variablesforpredictingPICU trans-fer.Ouralgorithmachieveda0.912(95%CI0.905–0.919)AUCin thetestset.Thisresultwasstatisticallysignificantlyhigherthan applicationoftwoexistingPEWSinourtestdataset.Unlike previ-ousPEWSwhichusedanumberofsub-scorestocreateanoverall scorewithvariouscut-points,weusedlogisticregressionsothat theoutputwasapercentagelikelihoodofPICUtransfer.Withthis approachwewereabletoachieve0.849sensitivityand0.859 speci-ficity.

Our predictionalgorithmperformedsignificantly betterthan twopublishedPEWSthatwerebasedondynamicclinicalelements, suchasvitalsigns.Onereasonforthisfindingisthatweused29 variablesfrom16clinicalelementsascomparedto3–7variablesin PEWSwithwhichwecompared.Ourvariablesincludedvitalsigns, whichbothotherscoresemploy.Wealsoincludedlevelof con-sciousness,painassessments,andworkofbreathingthateachmet twoimportantcriteria:(1)facevalidityinassociationwith wors-eningpatientstatusthatmightprecedePICUtransfer,and(2)were obtainedbyournursesinthecourseoftheirusualclinical assess-ments.Withtheexceptionofonevariable(capillaryrefill)eachof ourvariableswasavailablein>70%ofencounters,withthe major-itybeingpresentin>90%oftheencounters.Atourcenter,these datadidnotrequireanextrareportingstructure,additional clini-calassessments,orresearchnurses.EachwaspresentintheEHR forclinicalcare,butwebelieveeachwaspoorlyleveragedinthe courseofcareinidentifyingandpredictingpatientrisk.

Thetimestampexperimentshowedthatclinicalmeasurements takeninthefirst7hweresufficientforourpredictions.Wefound arelativelylowPPVastransfertothePICUinthefirst24hisan uncommonevent.Aswebelievethecostofafalsenegativeis con-siderablyhigherthanafalsepositive,relativelylowPPVmaybea tolerabletrade-off.

Our prediction algorithm can be integrated into our rapid response system to identify patients at elevated risk for PICU transfer. Current mechanisms to trigger or activate the rapid response systemhave limitations.6 _Early_warning_scores repre-sentaquantitativeandthemostextensivelyvalidatedmechanism toactivate,butotheractivationmechanismsincludingsubjective nurse,family,orphysicianconcernsarealsoused.These subjec-tivemechanismshavelimitationsregardingvalidityandreliability, andtodate,PEWSarelimitedbythemodestnumberofelements from which a score canbe generated. Our methodof machine learning/logisticregressionallowsanoutputofthepercentage like-lihood of PICU transferto becalculated for analmost limitless numberofclinicalelements.Whilethebestwaytousethis out-putwillneedtobedeterminedprospectively,webelievearapid responsesystemcouldhavemultiplethresholdsbasedonthe per-centage likelihood. For example,ifthe likelihoodwere >50%of PICUtransferwithin24h,this maypromptanautomaticcallof themedicalemergencyteamformultidisciplinaryassessment.A scoreof>25%mighttriggerabedsideevaluationbytheprimary medicalteamandarecalculationofpredictionwithin2h.An out-putof>95%mightputinmotion, throughclinicaldecisionaids, aprocessthatmakesimmediatePICUtransferthedefaultaction andaphysicianwouldneedtotakeactiveactiontoavoidsucha result.

(6)

0 5 10 15 20 25 30 35 40 1h 2h 3h 4h 5h 6h 7h 8h 9h _10h _11h _12h _13h _14h _15h _16h _17h _18h _19h _20h _21h _22h _23h _24h

Number of PICU transfers

PICU transfer me aer admission (in hours)

PICU transfer me distribuon

0 0.2 0.4 0.6 0.8 1 1h 2h 3h 4h 5h 6h 7h 8h 9h _10h _11h _12h _13h _14h _15h _16h _17h _18h _19h _20h _21h _22h _23h _24h Performance

Time aer admission (in hours)

PICU transfer classiﬁcaon performance over me

Sensivity Speciﬁcity AUC PPV

Fig.3.Timestampexperimentresults.

4.1. Limitations

Missingdatawasamajorcauseofincorrectpredictionandwe needtodevelopaproperimputationmethodology.Inthecurrent study,weusedaverysimplemethodtoaddressthechallengesof missingdata.Wewillimplementmorecompleximputation meth-odsinourfuturework.Imputationwillbeespeciallyimportant becauseaswewilladdmorevariablestothemodel,theadditional variableswillincludemissingvaluesmorefrequently.

TransfertothePICU,whileaclinicallyimportantevent,does havesomelimitationsasitmaybedriveninpartbynon-patient factorssuchasPICUbedavailability.Infuturestudies,insteadof focusingexclusivelyontheneedforPICUtransferasadependent variable,wewillpredictthedeteriorationofhospitalizedchildren’s clinicalstatusandwillincludeothervariables(e.g.callingamedical emergencyteam,nurseorphysicianidentiﬁcationashighrisk)as dependentvariables.

Inthisstudy,alltheclinicalelementshadamuchhigher per-centageofavailabilitythaninpreviousstudies.Thehighpercentage of available clinical measures provided the bases for applying machinelearningtodetect24-hPICUtransfer.Thismayreduce thegeneralizabilityofourfindingsincenterswithmorefrequently missingdata.Infuturestudies,wewillpartnerwithotheracademic andnon-academicchildren’shospitalstovalidateouralgorithm ina diversesetof institutionsona prospectivesetof patients. Althoughour algorithm was created in thefirst 24h, applying itafter24hisquitestraightforward.Similarlytothetimestamp experiment,weneedtoregeneratethevalueofeachvariableevery fewhours(e.g.,1,2,4)andusethemodeltocalculatethe probabil-ity.However,theeffectivenessofthisapproachneedstobeverified onprospectivedata.Thedeteriorationofhospitalizedchildrenisa complicatedphenomenonwithavarietyofclinicalantecedentsand causes.AprolongedseizurethatresultsinPICUtransferhasa differ-entpathophysiologyandlikelydifferentclinicalantecedentsthan worseningrespiratorydistressinpneumonia.Inourcurrentstudy, weonlytested16clinicalelementsforprediction.Weincluded

just one static element (presence of developmentaldelay) and 15dynamic elements. Whilewe believedynamic elementswill likelybemoreusefulinpredictingPICUtransferintheshortterm, wesuspectfurtheradditionofstaticelementssuchasaneedfor medicaltechnologywillimprovepredictionaccuracy.Wedidnot includelabtestresults,andanearlierworkhasexploredthe pre-dictivepotentialof labtests andmedications for patientstatus deterioration.26_In_future_works,_we_will_include_many_more_data pointsforthepatientssuchaslabtests,medications,diagnostic history,socialhistory,andfamilyhistory.

5. Conclusion

Onaretrospectivedatasetwesuccessfullydevelopedalogistic regressionalgorithmthatutilizedtheEHRcontenttopredictPICU transferforpediatricpatients’ﬁrstdayofhospitalization.Thenovel algorithmachievedhighersensitivity,speciﬁcityandAUCthantwo ofthecurrentPEWSreportedintheliterature.

Authorcontributions

IS,PBandHZdesignedthestudy.HZrantheexperiments, ana-lyzedtheresults,createdthetablesandfigures,andcontributed tothedraftandfinalmanuscript.QL,TL,DWandYNcontributed ideasforalgorithmdevelopment.TheprojectwassupervisedbyIS. ThefirstdraftofthemanuscriptwaspreparedbyHZ,IS,andPB withadditionalcontributionsbyallauthors.Allauthorsreadand approvedthefinalmanuscript.

Funding

TheauthorsweresupportedbyinternalfundsfromCincinnati Children’sHospitalMedicalCenter.HZ,QL,TL,YN,andISwere par-tiallysupportedbygrants5R00LM010227-05,1R21HD072883-01, and1U01HG006828-01.

(7)

Conﬂictofintereststatement

Noconﬂictofinterestforanyoftheauthors. References

1.GhaferiAA,BirkmeyerJD,DimickJB.Variationinhospitalmortalityassociated withinpatientsurgery.NEnglJMed2009;361:1368–75.

2.SilberJH,KaestnerR,Even-ShoshanO,WangY,BresslerLJ.Aggressivetreatment styleandsurgicaloutcomes.HealthServRes2010;45:1872–92.

3.NadkarniVM,LarkinGL,PeberdyMA,etal.Firstdocumentedrhythmandclinical outcomefromin-hospitalcardiacarrestamongchildrenandadults.JAmMed Assoc2006;295:50–7.

4.ScheinRM,HazdayN,PenaM,RubenBH,SprungCL.Clinicalantecedentsto in-hospitalcardiopulmonaryarrest.Chest1990;98:1388–92.

5.Jones DA, DeVita MA, Bellomo R. Rapid-response teams. N Engl J Med 2011;365:139–46.

6.DeVitaMA,SmithGB,AdamSK,etal.“Identifyingthehospitalisedpatientin crisis”–aconsensusconferenceontheafferentlimbofrapidresponsesystems. Resuscitation2010;81:375–82.

7.Bradman K, Borland M, Pascoe E. Predicting patient disposition in a paediatric emergency department. J Paediatr Child Health 2012,

http://dx.doi.org/10.1111/jpc.12011.

8.Petrillo-AlbaranoT,StockwellJ,LeongT,HebbarK.Theuseofamodiﬁed pedi-atricearlywarningscoretoassessstabilityofpediatricpatientsduringtransport. PediatrEmergCare2012;28:878–82.

9.SweneyJS,PossWB,GrissomCK,KeenanHT.Comparisonofseverityofillness scorestophysicianclinicaljudgmentforpotentialuseinpediatriccriticalcare triage.DisasterMedPublicHealthPrep2012;6:126–30.

10.ParshuramCS,BaylissA,ReimerJ,MiddaughK,BlanchardN.Implementingthe BedsidePaediatricEarlyWarningSysteminacommunityhospital:aprospective observationalstudy.PaediatrChildHealth2011;16:e18–22.

11.SkaletzkySM,RaszynskiA,TotapallyBR.Validationofamodiﬁedpediatricearly warningsystemscore:aretrospectivecase-controlstudy.ClinPediatr(Phila) 2012;51:431–5.

12.ParshuramCS,DuncanHP,JoffeAR,etal.Multicentrevalidationofthe bed-sidepaediatricearlywarningsystemscore:aseverityofillnessscoretodetect evolvingcriticalillnessinhospitalisedchildren.CritCare2011;15:R184.

13.AkreM,FinkelsteinM,EricksonM,LiuM,VanderbiltL,BillmanG.Sensitivity ofthepediatricearlywarningscoretoidentifypatientdeterioration.Pediatrics 2010;125:e763–9.

14.DemmelKM,WilliamsL,FleschL.Implementationofthepediatricearlywarning scoringsystemonapediatrichematology/oncologyunit.JPediatrOncolNurs 2010;27:229–40.

15.Parshuram CS, Hutchison J, Middaugh K. Development and initial vali-dation of the Bedside Paediatric Early Warning System score. Crit Care 2009;13:R135.

16.TuckerKM,BrewerTL,BakerRB,DemerittB,VossmeyerMT.Prospective evalu-ationofapediatricinpatientearlywarningscoringsystem.JSpecPediatrNurs 2009;14:79–85.

17.RobsonMA,CooperCL,MedicusLA,QuinteroMJ,ZunigaSA.Comparisonof threeacutecarepediatricearlywarningscoringtools.JSpecPediatrNurs2012,

http://dx.doi.org/10.1016/j.pedn.2012.12.002.

18.EdwardsED,PowellCV,MasonBW,OliverA.Prospectivecohortstudytotest thepredictabilityoftheCardiffandValepaediatricearlywarningsystem.Arch DisChild2009;94:602–6.

19.DasS,EisenbergLD,HouseJW,etal.Meaningfuluseofelectronichealthrecords inotolaryngology:recommendationsfromtheAmericanAcademyof Otolaryn-gology–HeadandNeckSurgeryMedicalInformaticsCommittee.Otolaryngol HeadNeckSurg2011;144:135–41.

20.JhaAK.Thepromiseofelectronicrecordsaroundthecornerordowntheroad? JAmMedAssoc2011;306:880–1.

21.SittigDF,WrightA,OsheroffJA,etal.Grandchallengesinclinicaldecision sup-port.JBiomedInform2008;41:387–92.

22.Grzymala-BusseW,HuM.Acomparisonofseveral approachestomissing attributevaluesindatamining.In:Proceedingsofthesecondinternational con-ferenceonroughsetsandcurrenttrendsincomputingRSCTC’2000.2000.p. 340–7.

23.VenablesWN,RipleyBD.ModernappliedstatisticsstatisticswithS-PLUS.4hed. NewYork:Springer;2002.

24.MetzCE.BasicprinciplesofORCanalysis.SeminNuclMed1978;8:283–98.

25.LalkhenG,McCluskeyA.Clinicaltests:sensitivityandspeciﬁcity.CritCarePain 2008;8:221–3.

26.BonaﬁdeCP,HolmesJH,NadkarniVM,LinR,LandisJR,KerenR.Development ofascoretopredictclinicaldeteriorationinhospitalizedchildren.JHospMed 2012;7:345–9.