Process Mining
The influence of big data (and the internet
of things) on the supply chain
Wil van der Aalst
www.vdaalst.com @wvdaalst
www.processmining.org
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
htt
p:/
/w
w
w
.en
gi
ne
ersj
ou
rnal
.i
e/
fac
tor
y
-of
-t
he
-future
-w
il
l-see
-m
ergi
ng
-of
-v
irt
ua
l-and
-real
-w
or
lds
/
My personal view:
1. Unprecedented amounts of data
about machines, products,
people, organizations, …
2. The ability to analyze such data
(scale, types of analysis,
My personal view:
1. Unprecedented amounts of data
about machines, products,
people, organizations, …
2. The ability to analyze such data
(scale, types of analysis,
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
1) Data: Internet of Events
Internet of
Content
Internet of
People
“social”
Internet of
Things
Internet of
Places
“cloud”
“mobility”
Internet of Events
“Big
Data”
2) Data Science: Generating value from data
•
Spectacular progress in:
−
Analytics (various
types of mining)
−
Databases (NoSQL,
In-Memory)
−
Distribution
(MapReduce,
Hadoop)
•
New discipline that is
data
mining
process
mining
visualization
data
science
behavioral/
social
sciences
domain
knowledge
machine
learning
large scale
statistics
industrial
engineering
databases
stochastics
privacy
algorithms
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
A new discipline …
A new discipline …
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
A new discipline …
mathematics
mathematics
computer scie
nce
computer scie
nce
data sc
ience
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
DSC/e:
Competences and Research Programs
28 groups involved
Context: Why are we using data science, does it have the intended effect, and will
people accept it? Analysis: How to turn data into real value
(models, answers/decisions, and visualizations/insights)?
Enabling technologies: How to get the data and deal with computational/ infrastructural challenges (big data and
hard questions)?
Probability and Statistics
Stochastic Networks Data Mining Process Mining Visualization Large-Scale Distributed Systems Data-Intensive Algorithms Data-Driven Operations Management Data-Driven Innovation and
Business
Human and Social Analytics Privacy, Security, Ethics, and
Governance Internet of Things
sy
st
em
s
infr
ast
ruc
tur
es
cit
ies
o
rg
an
iza
tio
n
s
people
[RP1] Process Analytics: Improving Service While Cutting Costs [RP2] Customer Journey:Correlating Events to Learn and Influence Customer Behavior
[RP3] Smart Maintenance & Diagnostics: Safeguarding Availability
[RP4] Quantified Self:
Improving Performance and Well-Being
[RP5] Data Value and Privacy: Economic and Legal Aspects of Data Science
[RP6] Smart Cities:
Ensuring Safety and Convenience for Citizens [RP7] Smart Grids:
Data Intensive Infrastructures
[RP1] Process Analytics:
Improving Service While Cutting Costs
[RP2] Customer Journey:
Correlating Events to Learn and Influence Customer Behavior
[RP3] Smart Maintenance & Diagnostics:
Safeguarding Availability
[RP4] Quantified Self:
Improving Performance and Well-Being
[RP5] Data Value and Privacy:
Economic and Legal Aspects of Data Science
[RP6] Smart Cities:
Ensuring Safety and Convenience for Citizens
[RP7] Smart Grids:
Data Science Flagship (Philips & DSC/e)
•
4 Strategic topics
•
4 TU/e departments
•
16 PhD students
•
30 Data science specialists
1. Data Driven Value Propositions
2. Healthcare Smart Maintenance
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: On the interface
between process science and
data science
Spreadsheet: Killer App for early computers
•
VisiCalc
(killer
app for Apple II,
Oct. 1979)
•
Lotus 1-2-3
(killer
app for IBM PC
1983)
•
Microsoft Excel
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Spreadsheet: Static data
31 items
sold
total
value
average
distribution
Spreadsheet: Static data
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
•
Input: events (“things that
have happened”)
•
Mandatory per event:
−
case identifier
−
activity name
−
timestamp/date
•
Optional
−
resource
−
transaction type
−
costs
−
…
case
identifier
activity
name
timestamp
resource
row = event
Process Mining: Spreadsheet for behavior
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
batching for activities
“opstellen eindnota” and
Loesje van
der Aalst
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
process discovery
NO
modeling
Process Mining: Spreadsheet for behavior
process discovery
NO
modeling
needed!
74 act.
11 act.
3 act.
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
event data
process
model
desire line
very safe
system
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
Process Mining: Spreadsheet for behavior
conformance checking
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
conformance checking
final inspection is
skipped 40 times
Process Mining: Spreadsheet for behavior
conformance checking
move on model
(something should have
happened, but did not)
move on log
(something happened that
should not happen)
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
performance analysis
average
flowtime is
1.92 months
bottleneck
NO
modeling
needed!
Process Mining: Spreadsheet for behavior
performance analysis
waiting time of
15.74 days
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: Spreadsheet for behavior
animating reality
real cases
NO
modeling
Process Mining: Spreadsheet for behavior
16 cases are
queueing
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Process Mining: The missing link
•
Process-centric
and
data-centric
!
•
To answer
conformance
and
performance
questions!
•
Offline
and
online
.
•
Multiple
perspectives:
−
control-flow
−
resources
−
time
−
data
−
…
process
mining
data-oriented analysis
(data mining, machine learning, business intelligence)
process model analysis
(simulation, verification, optimization, gaming, etc.)
performance-oriented
questions,
problems and
solutions
compliance-oriented
questions,
problems and
solutions
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Replay
register travel
request (a)
get detailed
motivation
letter (c)
get support
from local
manager (b)
check budget
by finance (d)
decide (e)
accept
request (g)
reject
request (h)
reinitiate
request (f)
start
end
a c d e g
Replay
register travel
request (a)
get detailed
motivation
letter (c)
get support
from local
manager (b)
decide (e)
accept
request (g)
a c
check budget (d)
is missing!
e g
?
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Replay
register travel
request (a)
get detailed
motivation
letter (c)
get support
from local
manager (b)
check budget
by finance (d)
decide (e)
accept
request (g)
reject
request (h)
reinitiate
request (f)
start
end
a c d e g
h
reject request (h) is
impossible
?
Replay with timestamps
register travel
request (a)
get detailed
motivation
letter (c)
get support
from local
manager (b)
check budget
by finance (d)
decide (e)
accept
request (g)
reject
request (h)
start
end
a
9.15
c
9.20
d
9.35
e
10.15
g
11.30
9.15
9.20
10.15
11.30
5
55
20
40
75
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Replay with timestamps
register travel
request (a)
get detailed
motivation
letter (c)
get support
from local
manager (b)
check budget
by finance (d)
decide (e)
accept
request (g)
reject
request (h)
reinitiate
request (f)
start
end
5
55
20
40
75
15
20
60
45
65
20
25
45
55
50
5
55
20
40
75
15
20
60
45
65
20
25
45
55
50
5
55
20
40
75
15
20
60
45
65
20
25
45
55
50
Deviations
Where?
Why?
time
costs
…
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Supply Chain / Industry 4.0
•
SAP process mining projects (cf.
ProcessGold, Celonis, etc.)
•
EDImine project
(http://edimine.ec.tuwien.ac.at/)
•
CORE (Consistently Optimized
Resilient Secure Global Supply Chains)
project (http://www.coreproject.eu/)
•
Looking for interesting supply chain
applications (data first) …
How to get started?
•
Event Data
•
Process Mining Tools
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Starting point for process mining:
Event data
student name
course name
exam date
mark
Peter Jones
Business Information systems
16-1-2014
8
Sandy Scott
Business Information systems
16-1-2014
5
Bridget White
Business Information systems
16-1-2014
9
John Anderson
Business Information systems
16-1-2014
8
Sandy Scott
BPM Systems
17-1-2014
7
Bridget White
BPM Systems
17-1-2014
8
Sandy Scott
Process Mining
20-1-2014
5
Bridget White
Process Mining
20-1-2014
9
John Anderson
Process Mining
20-1-2014
8
…
…
…
…
case id
activity name
timestamp
other data
every row is an event
(here: an exam attempt)
Another event log: patient treatment
patient
activity
timestamp
doctor
age
cost
5781
make X-ray
[email protected]
Dr. Jones
45
70.00
5541
blood test
[email protected]
Dr. Scott
61
40.00
5833
blood test
[email protected]
Dr. Scott
24
40.00
5781
blood test
[email protected]
Dr. Scott
45
40.00
5781
CT scan
[email protected]
Dr. Fox
45
1200.00
5833
surgery
[email protected]
Dr. Scott
24
2300.00
5781
handle payment
[email protected]
Carol Hope
45
0.00
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Another event log: order handling
order
number
activity
timestamp
user
product
quantity
9901
register order
[email protected]
Sara Jones
iPhone5S
1
9902
register order
[email protected]
Sara Jones
iPhone5S
2
9903
register order
[email protected]
Sara Jones
iPhone4S
1
9901
check stock
[email protected]
Pete Scott
iPhone5S
1
9901
ship order
[email protected]
Sue Fox
iPhone5S
1
9903
check stock
[email protected]
Pete Scott
iPhone4S
1
9901
handle payment
[email protected]
Carol Hope
iPhone5S
1
9902
check stock
[email protected]
Pete Scott
iPhone5S
2
9902
cancel order
[email protected]
Carol Hope
iPhone5S
2
…
…
…
…
…
…
How to get started?
•
Event Data
•
Process Mining Tools
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
900+ plug-ins available covering the
whole process mining spectrum
©Wil van der Aalst & TU/e (use only with permission & acknowledgements) ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
How to get started?
•
Event Data
•
Process Mining Tools
Process Mining
Data Science in Action
43.000+25.000 people joined!
Starts again on October 7
th
2015!
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)