COMP310
MultiAgent Systems
What are Multi-Agent Systems?
Environment
sphere of influence
KEY
interaction
organisational relationship
What are Multi-Agent Systems?
•
Thus a multiagent system contains a number of agents that:
•
interact through communication;
•
are able to act in an environment;
•
have different “spheres of influence” (which may coincide); and
•
will be linked by other (organisational) relationships.
•
Here we will look at how agents decide how to interact in
Utilities and Preferences
•
Assume we have just two agents:
Ag = {i, j}
.
•
Agents are assumed to be
self-interested
:
•
they have preferences over how the environment is
.
•
Assume
Ω
= {
ω
1
,
ω
2
, . . .}
is the set of “outcomes” that agents
have preferences over.
•
We capture preferences by
utility functions
, represented as real
numbers:
•
Utility functions lead to
preference orderings
over outcomes
•
For example, if
ω
and
ω
'
are both possible outcomes of
Ω
:
ω
≽
i
ω′
means
u
i
(
ω
)
≥
u
i
(
ω′
)
u
i
:
Ω
→
IR
What is Utility
•
Typical relationship between utility & money:
•
Utility is
not
money. Just a way to encode preferences.
utility
Multiagent Encounters
•
We need a model of the environment in which these
agents will act...
•
agents simultaneously choose an action to perform, and as a
result of the actions they select, an outcome in
Ω
will result;
•
the
actual
outcome depends on the
combination
of actions;
•
assume each agent has just two possible actions that it can
perform
C
(“cooperate”) and “
D
” (“defect”).
•
Environment behaviour given by
state transformer
function
(introduced in Chapter 2):
τ
:
!"#$
Ac
agent
i
’s action
×
!"#$
Ac
agent
j
’s action
Multiagent Encounters
•
Here is a state transformer function:
τ
(D,D) =
ω
1
τ
(D,C) =
ω
2
τ
(C,D) =
ω
3
τ
(C,C) =
ω
4
(This environment is sensitive to actions of both agents.)
•
Here is another:
τ
(D,D) =
ω
1
τ
(D,C) =
ω
1
τ
(C,D) =
ω
1
τ
(C,C) =
ω
1
(Neither agent has any influence in this environment.)
•
And here is another:
τ
(D,D) =
ω
1
τ
(D,C) =
ω
2
τ
(C,D) =
ω
1
τ
(C,C) =
ω
2
Rational Action
•
Suppose we have the case where both agents can influence the
outcome, and they have utility functions as follows:
u
i
(
ω
1
)=1 u
i
(
ω
2
)=1 u
i
(
ω
3
)=4 u
i
(
ω
4
)=4
u
j
(
ω
1
)=1 u
j
(
ω
2
)=4 u
j
(
ω
3
)=1 u
j
(
ω
4
)=4
•
With a bit of abuse of notation:
u
i
(D,D) = 1 u
i
(D,C) = 1 u
i
(C,D) = 4 u
i
(C,C) = 4
u
j
(D,D) = 1 u
j
(D,C) = 4 u
j
(C,D) = 1 u
j
(C,C) = 4
•
Then agent
i
’s preferences are:
C, C
≽
i
C, D
≻
i
D, C
≽
i
D, D
•
In this case, what should
i
do?
•
i
prefers all outcomes that arise through C over all outcomes that
arise through D.