Learning
Learning
•
A process that leads to an enduring change in
behavior or knowledge.
– Comes from our experiences.
•
For example: You can learn this material by
experiencing
it in class and at home.
– Anyone afraid of anything that they haven’t always
been?
Learning
• Many ways to learn information.
• Conditioning – The process of learning associations
between environmental events and behavioral responses.
– Basically: We learn how the environment and our behavior are
connected and act in predictable ways as a result.
– Applies to a major portion of how we learn throughout our
lives.
• Two types:
– Classical Conditioning
Classical Conditioning
•
Developed by Ivan Pavlov
•
Originally studying the role of saliva in digestion
in dogs.
•
“Accidentally” discovered that the dogs would
eventually salivate uncontrollably to the sound of
the feeding bell, before they were actually
presented with food.
•
This led to the development of
Classical
Classical Conditioning
•
Classical Conditioning
– Learning through repeatedly pairing a neutral
Classical Conditioning
•
Components of Classical Conditioning:
– Stimuli:
• Neutral (NCS)
• Unconditioned (UCS) • Conditioned (CS)
– Response:
Classical Conditioning
•
Stimuli:
– Neutral (NS): Stimulus that does not cause any
response before it is paired with the unconditioned stimulus.
– Unconditioned (UCS): Stimulus that occurs naturally • CAUSES THE UCR
• (e.g. Thunder)
– Conditioned (CS): Stimulus that causes the
“conditioned response”.
Classical Conditioning
– Response:
• Unconditioned (UCR): A response that occurs naturally
– CAUSED BY UCS
– (e.g. Salivating, Jumping when startled, etc…)
• Conditioned (CR): The learned response to the
Pavlov’s Experiment
• NS – Bell • UCS – Food
• CS – Bell after pairing bell + food repeatedly
• UCR – Salivating at food • CR – Salivating at bell
• The goal is to turn the UR into a CR.
Another Example
•
http://www.youtube.com/watch?v=Eo7jcI8fAuI
•
NS?
•
UCS?
•
CS?
Another Example
•
NS: “That was easy”
•
UCS: Being shot with a toy gun (painful)
•
CS: “That was easy”
•
UCR: Jumping at the feeling of pain.
•
CR: Jumping in anticipation of pain after
Factors That Affect Conditioning
•
Timing:
– VERY IMPORTANT
– Conditioning most effective when the neutral
stimulus (the intended CS) is presented immediately before the UCS.
• Bell THEN food.
• If food is given before the bell the dogs will never learn
that the bell means they will get food.
Factors that Affect Conditioning
•
Stimulus Generalization
– Occurs when stimuli that are similar to the original
conditioned stimulus also elicit the conditioned response, even though they have never peen paired with the unconditioned stimulus.
– Basically, things that are similar to the CS can also
cause a CR unintentionally.
Factors that Affect Conditioning
•
Stimulus Discrimination
– Occurs when you only exhibit a CR to a specific CS,
and not other similar stimuli.
– Generally indicative of good and specific
conditioning.
– For example: Being afraid of fluffy white rats, but
Remember!
•
Steps of Classical Conditioning:
•
1. US->UR
Classical Conditioning in the Real World
•
Problem:
– Coyotes prey on sheep belonging to farmers
•
How could you use classical conditioning
Classical Conditioning in the Real World
•
Solution:
– Gustavson and Gustavson (1985)
• Took sheep meat (CS)
• Sprinkled meat with chemical (US) to cause upset
stomach (UR)
• After becoming sick from the altered meat the coyotes
The Birth of Behaviorism
• As a result of Pavlov’s experiments, Behaviorism became a large
focus in psychology.
– Only cares about observable, measurable behaviors – not our thoughts.
• Founded by John Watson.
– Believed that virtually all human behavior is a result of conditioning and learning.
– “Give me a dozen healthy infants, well-formed, and my own specified
world to bring them up in and I’ll guarantee to take any one at random and train him to become any time of specialist I might select – doctor, lawyer, artist, merchant-chief, and yes, even beggar-man and thief,
regardless of his talents, penchants, tendencies, abilities, vocations, and race of his ancestors”.
Operant Conditioning
•
Developed by B.F. Skinner
•
Operant – Any behavior that generates
consequences.
– The nature of the consequences are unimportant,
only that they occur.
•
Operant Conditioning
– Learning based on associating one’s own voluntary
Operant Conditioning
•
Different from Classical Conditioning
– Classical Conditioning
• Focused on association between stimuli
– Operant Conditioning
Thorndike and the Law of Effect
• Thorndike’s Law of Effect
– If a response in a particular situation is followed by a
satisfying consequence, it will be strengthened
– If followed by an unsatisfying consequence it will be
weakened
• Thorndike’s Puzzle Box
–
OC: Discriminative Stimulus
•
Discriminative Stimulus
– Stimulus situation that sets the occasion for a
response to be followed by a reinforcement or punishment
– “Sets the occasion” for a response to be rewarded
• Being in class (Discriminative Stimulus) sets the
occasion for question-asking to be rewarded.
• A ringing phone (Discriminative Stimulus) sets the
OC: The Nature of Reinforcement
• Reinforcement – consequences that increase the
likelihood of a particular response happening again
– Positive – an event that increases the likelihood of a
response
• Example: getting a candy bar for asking a question in class
– Negative – an event that, when removed, increases the
likelihood a response
• Example: seat belt alarm in a car that only stops when the seat
OC: The Nature of Punishment
• Punishment – Consequences that decrease the
likelihood of responding in a similar way again
– Positive punishment - an event that decreases the
likelihood of a response when presented after the response
• Example: Scolding a child for running in the street
– Negative punishment - an event that, when removed,
decreases the likelihood a response
OC: The Use of Punishment
• Punishment may not always be appropriate – Only teaches what NOT to do
– Doesn’t teach the appropriate behavior
• Also, what we may think is punishment may not be – Crying child who wants attention
• We punish the child but in turn are giving him what he wants – attention. This can actually reinforce the unwanted behavior.
• Learned Helplessness
– Exposure to inescapable and uncontrollable aversive events produces
passive behavior.
• When we can’t escape or control punishment then we tend to become passive
Schedules of Reinforcement
•
Partial reinforcement
– Reinforcement is only given sometimes after the
response
– Four types:
Partial Reinforcement Schedules
•
Fixed-Ratio
– A fixed number of responses is required for
reinforcement
– Example: one piece of candy for every 8 correct
answers
– Elicits steady, consistent responding because the
Partial Reinforcement Schedules
•
Variable-Ratio
– A certain number of responses is required for
reinforcement, but the number changes
– Example: one piece of candy after one correct
answer, then every 3 answers, then 2, then 5…..
– Elicits high rates of responding because do not
know when next reward will occur.
Partial Reinforcement Schedules
•
Fixed-Interval
– Reinforcement is given for responses that occur
after a fixed amount of time
– Example: 5 minutes after asking questions, a
correct answer gets one piece of candy
– Elicits low response rates – no reason to respond
Partial Reinforcement Schedules
• Variable-Interval
– Reinforcement is given for responses that occur after a
certain amount of time, and the time changes
– Example: 5 minutes after questions, an answer gets one
piece of candy; 2 minutes later, 10 seconds….
– Example: calling someone, getting busy signal – you need
to call back, but not clear how long needs to pass
– Elicits high response rates because not predictable when
Shaping
•
How do you train a response that never occurs
in the first place?
•
Shaping – Reinforcement is delivered for
Shaping: Example
• Reinforcing a mouse to push a lever:
– 1. Simply turning toward the lever will be reinforced
– 2. Only stepping toward the lever will be reinforced
– 3. Only moving to within a specified distance from the lever will be reinforced
– 4. Only touching the lever with any part of the body, such as the nose, will be reinforced
– 5. Only touching the lever with a specified paw will be reinforced
– 6. Only depressing the lever partially with the specified paw will be reinforced
Biological Constraints on Learning
•
Can’t teach just any response in any situation
– Biological constraints limit responses that can be
taught.
•
Raccoons and coins
– Can use learning principles to teach a raccoon to
pick up a coin but it’s biological responses will lead it to rub and dunk the coin – never put it in the
Biological Constraints on Learning
•
Instinctual Drift
– Behaviors that go against animals biological drives
can be learned
– However, over time these behaviors will erode
OC: The Nature of Reinforcement
•
Primary Reinforcers:
– A stimulus that acts as a natural reinforcer and
requires no prior learning experiences.
•
Conditioned Reinforcers (Secondary
Reinforcer):
– A stimulus that acts as a reinforcer because of
prior learning experiences
What Does Classical and Operant
Conditioning Boil Down To?
• Behavior Modification
– The primary use of operant and classical conditioning is behavior modification.
– However; we learn many original behaviors without purposefully engaging in conditioning.
• If we learn behaviors from conditioning, and modify
behaviors from conditioning…do we actually have any active role in our lives or are they
predetermined based on the stimuli we encounter?
Practical Uses: Classical Conditioning
•
Classical Conditioning:
– http://www.youtube.com/watch?v=nE8pFWP5QD
CC
•
Neutral Stimulus – Sound of “Windows”
shutting down.
•
UCS – Offer Altoid
•
UCR – Realize needs an Altoid.
•
NS+UCS
•
NS = “Windows” shutting down makes Dwight
Practical Uses: Operant Conditioning
•
Operant Conditioning:
– http://www.youtube.com/watch?v=euINCrDbbD4
Learning from Others: Observational
Learning
• It would be extremely dangerous to only learn about the
consequences our behavior through simple trial and error
• Observational Learning - Learning by observing the
experience of others
– Has great adaptive value
• Chimpanzees in the wild learn how to use stone tools to crack
open nuts by observing older chimps eating
• Also, cats can to wash their food from watching other cats
Learning From Others: Modeling
•
Modeling – Tendency to imitate the behavior
of significant others
•
Vicarious reinforcement – When the model is
reinforced for an action, the viewers
Modeling: Bobo
•
Bandura and the Bobo doll
– http://www.youtube.com/watch?v=vdh7MngntnI
•
Children modeled their behavior after the
adult
Properties of Learning
•
Extinction and Spontaneous Recovery.
– Extinction – Weakening and disappearance of the
conditioned behavior.
• In Classical Conditioning: Presenting the neutral
stimulus without following it with the UCS.
• In Operant Conditioning: Providing inconsistent
consequences for the same behaviors.
– Spontaneous Recovery – The reappearance of a