CSE 573: Introduction to Artificial Intelligence

(1)

Hanna Hajishirzi Search

(Un-informed, Informed Search)

slides adapted from

Dan Klein, Pieter Abbeel ai.berkeley.edu And Dan Weld, Luke Zettelmoyer

(2)

Recap: Search

o _{Search problem:}

o States (configurations of the world)

o Actions and costs

o Successor function (world dynamics)

o _{Start state and goal test}

o _{Search tree:}

o Nodes: represent plans for reaching states

o _{Search algorithm:}

o Systematically builds a search tree

o _{Chooses an ordering of the fringe (unexplored nodes)} o Optimal: finds least-cost plans

(3)

Example: Pancake Problem

(4)

(5)

Example: Pancake Problem 3 2 4 3 3 2 2 2 4

State space graph with costs as weights

3

4

3 4

(6)

General Tree Search

Action: flip top two Cost: 2

Action: flip all four Cost: 4

Path to reach goal: Flip four, flip three

(7)

Uniform Cost Issues

o _{Remember: UCS explores increasing cost} contours

o _{The good: UCS is complete and optimal!}

o _{The bad:}

o Explores options in every “direction”

o No information about goal location

o _{We’ll fix that soon!}

Start _Goal …

c ≤ 3 c ≤ 2 c ≤ 1

(8)

Up next: Informed Search o _{Uninformed Search} o _DFS o _BFS o _UCS ▪ Informed Search ▪ Heuristics ▪ Greedy Search ▪ A* Search ▪ Graph Search

(9)

Search Heuristics ▪ A heuristic is:

▪ A function that estimates how close a state is to a goal

▪ Designed for a particular search problem

▪ Pathing?

▪ _{Examples: Manhattan distance, Euclidean distance for}

pathing

10

5

(10)

Example: Heuristic Function

(11)

Example: Heuristic Function

Heuristic: the number of the largest pancake that is still out of place

4 3 0 2 3 3 3 4 4 3 4 4 4 h(x)

(12)

(13)

Greedy Search

o _{Expand the node that seems closest…}

o _{Is it optimal?}

(14)

Greedy Search

o _{Strategy: expand a node that you think is}

closest to a goal state

o _{Heuristic: estimate of distance to nearest goal}

for each state

o _{A common case:}

o _{Best-first takes you straight to the (wrong) goal}

o _{Worst-case: like a badly-guided DFS}

… b

(15)

(16)

(17)

(18)

A* Search

UCS Greedy

(19)

Combining UCS and Greedy

o Uniform-cost orders by path cost, or backward cost g(n)

o _Greedy_{orders by goal proximity, or forward cost h(n)}

o A* Search orders by the sum: f(n) = g(n) + h(n)

S a d b G h=5 h=6 h=2 1 8 1 1 2 h=6 h=0 c h=7 3 e h=1 1

Example: Teg Grenager

S a b c e d d G G g = 0 h=6 g = 1 h=5 g = 2 h=6 g = 3 h=7 g = 4 h=2 g = 6 h=0 g = 9 h=1 g = 10 h=2 g = 12 h=0

(20)

When should A* terminate?

o _{Should we stop when we enqueue a goal?}

o _{No: only stop when we dequeue a goal}

S B A G 2 3 2 2 h = 1 h = 2 h = 0 h = 3 S 0 3 3 g h + S->A 2 2 4 S->B 2 1 3 S->B->G 5 0 5 S->A->G 4 0 4

(21)

Is A* Optimal?

o _{What went wrong?}

o _{Actual bad goal cost < estimated good goal cost} o _{We need estimates to be less than actual costs!}

A G S 1 3 h = 6 h = 0 5 h = 7 g h + S 0 7 7 S->A 1 6 7 S->G 5 0 5

(22)

Idea: Admissibility

Inadmissible (pessimistic) heuristics break optimality by trapping

good plans on the fringe

Admissible (optimistic) heuristics slow down bad plans but

(23)

Admissible Heuristics o _{A heuristic}h is admissible (optimistic) if:

where is the true cost to a nearest goal

o _Examples_:

o _{Coming up with admissible heuristics is most of what’s involved in}

using A* in practice.

(24)

Properties of A*

… b …

b

(25)

UCS vs A* Contours

o _{Uniform-cost expands equally in} all “directions”

o _{A* expands mainly toward the} goal, but does hedge its bets to ensure optimality

Start _Goal

(26)

Comparison

(27)

(28)

(29)

(30)

(31)

(32)

(33)

(34)

Optimality of A* Tree Search

Assume:

o _{A is an optimal goal node} o _{B is a suboptimal goal node} o _{h is admissible}

Claim:

o _{A will exit the fringe before B}

(35)

Optimality of A* Tree Search: Blocking

Proof:

o _{Imagine B is on the fringe}

o _{Some ancestor n of A is on the fringe,}

too (maybe A!)

o _{Claim: n will be expanded before B}

1. f(n) is less or equal to f(A)

Definition of f-cost Admissibility of h

…

(36)

Proof:

o _{Imagine B is on the fringe}

o _{Some ancestor n of A is on the}

fringe, too (maybe A!)

o _{Claim: n will be expanded before B}

2. f(A) is less than f(B)

B is suboptimal h = 0 at a goal

(37)

Proof:

o Imagine B is on the fringe

o Some ancestor n of A is on the fringe, too (maybe A!)

o Claim: n will be expanded before B

2. f(A) is less than f(B)

3. n expands before B

o All ancestors of A expand before B

o A expands before B

o A* search is optimal

(38)

(39)

A*: Summary

o _{A* uses both backward costs and (estimates of) forward} costs

o _{A* is optimal with admissible (optimistic) heuristics}

(40)

Video of Demo Empty Water Shallow/Deep – Guess Algorithm

(41)

(42)

Creating Admissible Heuristics

o _{Most of the work in solving hard search problems optimally is in coming up with} admissible heuristics

o _{Often, admissible heuristics are solutions to relaxed problems, where new actions are} available

o _{Inadmissible heuristics are often useful too}

15

(43)

Example: 8 Puzzle

o What are the states?

o How many states?

o What are the actions?

o How many successors from the start state?

o What should the costs be?

Start State Actions Goal State

Admissible heuristics?

(44)

8 Puzzle I o _{Heuristic: Number of tiles misplaced}

o _{Why is it admissible?} o _{h(start) =}

o _{This is a relaxed-problem heuristic}

8

Average nodes expanded when the optimal path has…

…4 steps …8 steps …12 steps

UCS 112 6,300 3.6 x 106

TILES 13 39 227

Start State _{Goal State}

(45)

8 Puzzle II

o _{What if we had an easier 8-puzzle}

where any tile could slide any direction at any time, ignoring other tiles?

o _{Total Manhattan distance} o _{Why is it admissible?}

o _{h(start) =}

3 + 1 + 2 + … = 18

Average nodes expanded when the optimal path has…

…4 steps …8 steps …12 steps

TILES 13 39 227

MANHATTAN 12 25 73

(46)

8 Puzzle III

o _{How about using the actual cost as a heuristic?}

o _{Would it be admissible?}

o _{Would we save on nodes expanded?} o _{What’s wrong with it?}

o _{With A*: a trade-off between quality of estimate and work per node}

o _{As heuristics get closer to the true cost, you will expand fewer nodes but}

(47)

Example: Pancake Problem

o _{Action: Flip over top n pancakes}

(48)

(49)

Trivial Heuristics, Dominance

o Dominance: h_a ≥ h_c if

o _{Heuristics form a semi-lattice:}

o Max of admissible heuristics is admissible

o _{Trivial heuristics}

o Bottom of lattice is the zero heuristic (what does this give us?)

(50)

(51)

Tree Search: Extra Work!

o _{Failure to detect repeated states can cause exponentially more work.}

Search Tree State Graph

(52)

Graph Search

o _{In BFS, for example, we shouldn’t bother expanding the circled nodes} (why?) S a b d p a c e p h f r q q c G a q e p h f r q q c G a

(53)

Graph Search

o Idea: never expand a state twice

o How to implement:

o Tree search + set of expanded states (“closed set”)

o _{Expand the search tree node-by-node, but…}

o Before expanding a node, check to make sure its state has never been expanded before

o If not new, skip it, if new add to closed set

o Important: store the closed set as a set, not a list

o _{Can graph search wreck completeness? Why/why not?} o How about optimality?

(54)

A* Graph Search Gone Wrong? S A B C G 1 1 1 2 3 h=2 h=1 h=4 h=1 h=0 S (0+2) A (1+4) B (1+1) C (2+1) G (5+0) C (3+1) G (6+0)

State space graph Search tree

(55)

Consistency of Heuristics

o _{Main idea: estimated heuristic costs ≤ actual costs} o Admissibility: heuristic cost ≤ actual cost to goal

h(A) ≤actual cost from A to G

o Consistency: heuristic “arc” cost ≤ actual cost for each arc h(A) – h(C)≤ cost(A to C)

o Consequences of consistency:

o The f value along a path never decreases h(A) ≤ cost(A to C) +h(C)

o A* graph search is optimal

3 A C G h=4 1 h=1 h=2

(56)

A* Graph Search o Sketch: consider what A* does with a

consistent heuristic:

o Fact 1: In tree search, A* expands nodes in increasing total f value (f-contours)

o Fact 2: For every state s, nodes that reach s optimally are expanded before nodes that reach s suboptimally

o Result: A* graph search is optimal

…

f ≤ 3 f ≤ 2 f ≤ 1

(57)

Optimality of A* Search

o _{With a admissible heuristic, Tree A* is optimal.} o _{With a consistent heuristic, Graph A* is optimal.}

(58)

(59)

A* Applications

o Video games

o Pathing / routing problems

o _{Resource planning problems}

o Robot motion planning

o Language analysis

o _{Machine translation}

o _{Speech recognition}

(60)

A* in Recent Literature

o _{Joint A* CCG Parsing and}

Semantic Role Labeling (EMLN’15)

o _Diagram

(61)

Search and Models

o _{Search operates over} models of the world

o _{The agent doesn’t}

actually try all the plans out in the real world!

o _{Planning is all “in}

simulation”

o _{Your search is only as}

(62)