• No results found

B. Stochastic Neural Networks

N/A
N/A
Protected

Academic year: 2021

Share "B. Stochastic Neural Networks"

Copied!
36
0
0

Loading.... (view fulltext now)

Full text

(1)

B.

Stochastic Neural Networks

(2)

9/29/08 2

(3)
(4)

9/29/08 4

(5)

Motivation

• Idea: with low probability, go against the local

field

– move up the energy surface

– make the “wrong” microdecision

• Potential value for optimization: escape from local

optima

• Potential value for associative memory: escape

from spurious states

(6)

9/29/08 6

The Stochastic Neuron

Deterministic neuron :  s i = sgn h

( )

i Pr  {s i = +1} =  h

( )

i Pr  {s i = 1} = 1  h

( )

i Stochastic neuron : Pr  {s i = +1} =  h

( )

i Pr  {s i = 1} = 1 h

( )

i Logistic sigmoid : ( )h = 1 1+ exp 2 h T( ) h (h)

(7)

Properties of Logistic Sigmoid

• As h  +, (h)  1 • As h  –, (h)  0 • (0) = 1/2 

( )

h = 1 1+ e2h T

(8)

9/29/08 8

Logistic Sigmoid

With Varying T

(9)

Logistic Sigmoid

T = 0.5

(10)

9/29/08 10

Logistic Sigmoid

(11)

Logistic Sigmoid

(12)

9/29/08 12

Logistic Sigmoid

(13)

Logistic Sigmoid

(14)

9/29/08 14

Logistic Sigmoid

(15)

Pseudo-Temperature

• Temperature = measure of thermal energy (heat) • Thermal energy = vibrational energy of molecules • A source of random motion

• Pseudo-temperature = a measure of nondirected (random) change

• Logistic sigmoid gives same equilibrium

(16)

9/29/08 16

Transition Probability

Recall, change in energy E = skhk

= 2skhk Pr 

{

s k = ±1sk = m1

}

= 

(

±hk

)

= 

(

skhk

)

Pr s

{

k  sk

}

= 1 1+ exp 2s

(

khk T

)

= 1 1+ exp E T

(

)

(17)

Stability

• Are stochastic Hopfield nets stable?

• Thermal noise prevents absolute stability • But with symmetric weights:

(18)

9/29/08 18

Does “Thermal Noise” Improve

Memory Performance?

• Experiments by Bar-Yam (pp. 316-20):

 n = 100  p = 8

• Random initial state

• To allow convergence, after 20 cycles set T = 0

• How often does it converge to an imprinted pattern?

(19)

Probability of Random State Converging

on Imprinted State (n=100, p=8)

(20)

9/29/08 20

Probability of Random State Converging

on Imprinted State (n=100, p=8)

(21)

Analysis of Stochastic Hopfield

Network

• Complete analysis by Daniel J. Amit & colleagues in mid-80s

• See D. J. Amit, Modeling Brain Function:

The World of Attractor Neural Networks,

Cambridge Univ. Press, 1989.

• The analysis is beyond the scope of this course

(22)

9/29/08 22

Phase Diagram

(fig. from Domany & al. 1991)

(A) imprinted = minima

(B) imprinted, but s.g. = min.

(C) spin-glass states

(23)

Conceptual Diagrams

of Energy Landscape

(24)

9/29/08 24

Phase Diagram Detail

(25)

Simulated Annealing

(26)

9/29/08 26

Dilemma

• In the early stages of search, we want a high temperature, so that we will explore the

space and find the basins of the global minimum

• In the later stages we want a low

temperature, so that we will relax into the global minimum and not wander away from it

• Solution: decrease the temperature gradually during search

(27)

Quenching vs. Annealing

• Quenching:

– rapid cooling of a hot material

– may result in defects & brittleness – local order but global disorder

– locally low-energy, globally frustrated

• Annealing:

– slow cooling (or alternate heating & cooling) – reaches equilibrium at each temperature

– allows global order to emerge – achieves global low-energy state

(28)

9/29/08 28

Multiple Domains

local coherence global incoherence

(29)
(30)

9/29/08 30

Effect of Moderate Temperature

(31)
(32)

9/29/08 32

Effect of Low Temperature

(33)

Annealing Schedule

• Controlled decrease of temperature • Should be sufficiently slow to allow

equilibrium to be reached at each temperature

• With sufficiently slow annealing, the global minimum will be found with probability 1 • Design of schedules is a topic of research

(34)

9/29/08 34

Typical Practical

Annealing Schedule

• Initial temperature T0 sufficiently high so all transitions allowed

• Exponential cooling: Tk+1 = Tk

 typical 0.8 <  < 0.99

 at least 10 accepted transitions at each temp.

• Final temperature: three successive

temperatures without required number of accepted transitions

(35)

Summary

• Non-directed change (random motion) permits escape from local optima and spurious states

• Pseudo-temperature can be controlled to adjust relative degree of exploration and exploitation

(36)

9/29/08 36

Additional Bibliography

1. Anderson, J.A. An Introduction to Neural

Networks, MIT, 1995.

2. Arbib, M. (ed.) Handbook of Brain Theory &

Neural Networks, MIT, 1995.

3. Hertz, J., Krogh, A., & Palmer, R. G.

Introduction to the Theory of Neural Computation, Addison-Wesley, 1991.

References

Related documents

As discussed at the November 7 meeting, both the Sheriff's JHIS and the Probation, Department's Probation Electronic Medical Records System (PEMRS) require code í upgrades for their

From 003 to 00 the OSOSS programme – conducted by the ICTU foundation by order of the Ministry of the Interior and Kingdom Rela- tions and the Ministry of Economic Affairs –

They include co-operative, community- driven, participatory, or self-managed housing, and can involve different property arrangements, including shared tenure, or the sharing

Considering the five Business Engineering dimensions we assess different maturity factors, i.e., process/structure maturity (extent to which SoMe processes are defined,

Mary asked about the course shell approval process by clarifying that instructors who have not had a Canvas shell approved yet or instructors that want to teach a new course in

Magnetoencephalographic (MEG) data from human participants to stationary and moving sounds identified responses elicited only from moving sounds and whose source

education and 1 year of professional records management experience •   Or, 1 year of professional records manager experience can be.. substituted for each year of college education

Although the council had provided the church with a clear statement of faith, this is no way signalled the end of Arianism as some of the later emperors after Constantine