5 Model design
5.1 Prediction model
5.1.1
Causal forecasting: predicting walk-in demand
The model constructed to predict walk-in patient demand in the plaster room eight weeks in advance consists of 7 steps and can be subdivided in the categories input, output and validation and evaluation. The different steps are visualized in Figure 20.
Figure 20: Overview of forecasting model
Step 1 to 3 focusses on creating the correct input parameters and distributions, where step 5 to 7 are based on the generated predictions. Step 4 of this process consist of the causal forecasting calculations, using convolution to create one output distribution. These queries are currently used by Rhythm to predict the demand for the radiology department. We create an adapted version for the plaster room for this research and future use. Collaborating with Rhythm on building this model ensures consistency for the plaster room department. In the following sections we describe each step of the forecasting model.
5.1.1.1 Input
Step 1: Data preparation
To create the input distributions, preparation of multiple data sets is performed. For the preparation of these data sets, we use Pentaho Data Integration (Kettle). These datasets are:
β’ Outpatient clinic schedule β’ Outpatient activities β’ Plaster room activities
Page 52 of 76
All data sets are filtered on the following criteria: location Nijmegen, specialty: orthopedics, outpatient, date between January 1st, 2016 and the 31st of December 2017. In the plaster room activity data set
we also exclude all appointment-based treatments. The historical demand is divided in three groups.
a. Patients with a treatment in the plaster room on the same day as an outpatient clinic visit b. Patients with a treatment in the plaster room on the same day as an unscheduled meeting
with a doctor
c. Patients without any other treatment in the SMK on the same day
The model assumes that patients visit the doctor and the plaster room on the same part of the day. The care demand of the plaster room is therefore planned at the daypart on which the patient had an outpatient clinic visit.
Step 2: Create input distributions
We use the historical demand to generate probability distributions per session category by aggregating the historical demand. The outpatient clinic sessions can be categorized in multiple ways and is aggregated over the data originating from group a. The following distributions are created:
- Outflow per physician per daypart [e.g., doctor A morning]
- Outflow per physician without daypart differentiation [e.g., doctor A]
- Outflow per orthopedic specialty per daypart [e.g., specialty foot morning]
- Outflow per outpatient clinic session without physician or orthopedic specialty differentiation
[e.g., generic]
The generic outflow per outpatient clinic session can be used to predict outflow of outpatient clinic sessions of physicians that are unknown in the historical data set.
The data of group b and c is used to create a distribution that aggregates the data per daypart (Monday morning) since those could not be linked to outpatient clinic sessions.
Step 3: Session selection
We combine the obtained distributions in four different input sessions. The sessions created all include the distribution created from group b and c. The distinction between the sessions regard the classification of outpatient clinic sessions (physician, specialty, daypart).
Page 53 of 76
This results in the following combined input distributions:
Step 4: Causal forecasting
In step 4 of the process, we alter the causal forecasting model. The forecasting model is implemented in the radiology department of Sint Maartenskliniek. The model is adapted to make it applicable to use in the plaster room. The scientific research underlying this model is described in Section 4.2. The outpatient clinic schedule contains the scheduled physicians, which is linked to their historical outflow distribution. By using discrete convolution all outflow distributions are be combined per daypart, resulting in one probability distribution per daypart. The expected demand is calculated for January, February and March 2018. Both the input and the output are treatment time in minutes.
5.1.1.2 Output
Step 5: Prediction output
The output provided by model produce an output containing the following variables:
Figure 21 shows an example of the output variables over multiple time periods. The output consists of a mean value and two confidence intervals: a 25%-75% confidence interval and a 2%-98% confidence interval. Suppose that physician A and physician B have an outpatient clinic session scheduled on Monday morning, then we expect their mean outflow to the plaster room to be around 365 minutes. Considering all dayparts where the outpatient clinic schedule is equal, we expect that the realized mean walk-in demand of these dayparts falls within the boundaries of the 25%-75% interval with a certainty of 50%.
Page 54 of 76 Figure 21: Example of visualization of the causal forecasting output
5.1.1.3 Validation and evaluation
Step 6: Validation and evaluation.
The forecasting distributions are validated and evaluated with the data from January and February 2018. The evaluation methods are:
π΄π£πππππ πΈππππ (π΄πΈ) = βπππ¦ππππ‘π π· β π βπππ¦ππππ‘π ππππ π΄ππ πππ’π‘π πππππππ‘πππ πΈππππ (ππ΄ππΈ) = β | π β π· π | πππ¦ππππ‘π β πππ¦ππππ‘π β 100 πππππππ‘ ππ π΄πππ’ππππ¦ (πππ΄) = ββπππ¦π π· πππ¦ *100% π΄π£πππππ π΄πππ’ππππ¦ (π΄π΄) = βπππ¦ππππ‘π ( |π· β π | < 90 ππππ’π‘ππ ) β πππ¦ππππ‘π
To gain additional insights in how often the forecasting distribution over- or underestimates the walk- in demand, the number of times the realization is between π·2 and π·25, π·25 and π· , etc. to get insight are counted.
Step 7: Forecasting selection
Based on the evaluation measures in step 6, the most accurate and promising forecasting distribution is selected. This prediction is used in the second part of the model as if the plaster room were to use it when making patient planning decisions. Possible inaccuracies or deviations from realization are analyzed to determine errors in the causal forecasting method or give recommendations on how to implement the selected forecasting distribution.
Page 55 of 76