The Principles of Horsemanship: Part 3 – Pavlov’s Principle

pavlovGraphic

with Andrew McLean

Relaxation and attentiveness can only occur if the horse responds to light signals. During training the horse learns to respond from light aids if they are offered just before and overlapping stronger motivating pressure. When training a new signal, the new signal should happen just before and during the light aid.
Pavlov’s principle is all about training the horse to operate from light aids, including seat, weight and positional cues. There is a definite science about training horses and other animals to operate from previously neutral signals, and Ivan Pavlov (1849 – 1936) was the first to describe it.

He was a Russian scientist and in fact wasn’t searching for any behavioural principles when he discovered and revealed the process of acquiring signals. This process is now known as Classical or Pavlovian conditioning. Pavlov was doing some rather gory experiments on a dog with a glass plate sewn over a large hole cut right through its ribcage to its stomach so he could see the digestive process first hand. He gave the dog a meat extract so he could see what goes on in the animal’s stomach during salivation and digestion. During the course of the experiments however, he soon noticed that the dog began to salivate earlier on in each session – the dog was beginning to ‘anticipate’ the food.

Now most of us would be happy to explain the dog’s anticipation as simply that, but Pavlov was determined to delve deeper. He wanted to know what ‘anticipation’ actually was – to uncover the mechanistic principle behind anticipation. In humans and perhaps in some animals as well it can involve some kind of mental visualisation of the anticipated event, but is this always the case? And does it need to be so? Pavlov found that there were some strict rules that govern the way an animal learns to associate a previously neutral stimulus with a particular response. He found that the timing of when the light signal appears in relation to the already known stimulus is critical.

TIMING OF ASSOCIATIONS

Pavlov showed that new cues must be given just before or during an inborn or previously learned signal or aid. For example, if the rider wishes to train the voice aid “whoa”, the word should occur just before and overlapping the application of the rein signal for ‘stop’. The further the word occurs after the rein aid, the less the horse learns the voice aid. The rider must also remember that as the horse does not actually understand what “whoa” means, but rather learns it as a cue, it must always be delivered with the same tone and pitch. Shouting “whoa” won’t make the horse stop more quickly, it is only the learned version of the voice aid that is meaningful to the horse.

In dressage voice aids are not used in competition. Instead, as the horse learns to respond from lighter rein and/or leg aids, the horse becomes increasingly aware of the associations of seat and weight that naturally occur with rein and/or leg aids if the rider’s position is correct. These responses are learned through classical conditioning as the seat and weight aids occur just before and during the rein and /or leg aids. Similarly during training the horse to lunge, the horse learns to respond to voice aids. However the fact that the trainer needs to keep the lunge whip in his hand is testimony to the fact that classical conditioning is but a thin veneer in training. Voice, seat, weight and position associations are easily forgotten and need reminding with the primary aids of rein and/or leg and in the case of lunging, the whip.

Pavlov’s work suggested that Classical conditioning doesn’t need any kind of visualisation or comprehension in the animal of an anticipated event in order to work – it simply needs the correct order of presentation. That’s why Classical conditioning can be seen in worms and houseflies as well as horses and humans. If the sound of running water makes you want to go to the toilet, that’s classical conditioning. As you respond to traffic lights and other road signals yet you aren’t consciously aware of what you are doing, you just do it – that’s Classical conditioning. The life of all animals is full of associations that have formed and will form between various signals and events. Acquiring signals that predict events makes life predictable and controllable for animals. It evolved to improve the efficiency of the animal’s interaction with its environment. Any environmental event that coincides with a known signal will quickly become incorporated as a new cue. So, the sound of rustling bushes which precedes the appearance of a predator – and triggers the flight response in the horse – quickly becomes a cue for running away.

PREDICTABILITY

Horse training relates to this notion of predictability in a very definite way. The reason we want to place all of the horse’s trained responses under the control of light aids isn’t just for our convenience and laziness – it is for the horse’s mental well being. The horse needs unobtrusive, pain free signals for all its movements in hand and under saddle. Good coaches and trainers have long known the importance of lightness of the aids as well as the fact that if the horse needs stronger pressure to motivate him to do something, then these pressures must always be preceded by light aids. But it was the writings of a man called Piet Wiepkema, a Dutch cognitive scientist, which first brought me to understand the relationship between an animal’s well-being and the nature and consistency of the signals that it encounters in its life.

Take a moment to contemplate this – any animal’s existence involves giving and receiving signals and/or ‘pressures’ from its environment. The signals/’pressures’ it issues to its environment in order to procure benefits for itself are either inborn ones or learned ones, and the ones it receives from its environment are either from the physical or the behavioural world (including signals from its rider or handler). Animals have evolved the ability to offer and learn to respond to mild unobtrusive signals so that they don’t have to endure a life of painful or unpredictable events such as the sudden attack by another horse during a squabble over resources such as food or mates as well as to predict predatory attacks. For example, horses soon learn that before another horse attacks, it lays its ears back.

CONSISTENCY

Animals are thus able to learn signals that surround all events that not only predict nasty things but also nice ones too, such as the arrival of the person carrying a bucket – it means food. Wiepkema showed that how often a particular signal consistently predicts a particular response is directly proportional to the amount stress in an animal. If the signal always leads to the same response, the animal is relaxed in its response to the signal. Think of your own life. What makes you calm (or not) as an adult is that you have (or have not) found ways, generally using language, to control your response to others and to control the behaviour of others. All organisms need to be able to make their behavioural world predictable. The less predictable and controllable their world is, the more stress and tension they show.

In horse training, predictability comes through the horse learning to respond in the same way from a light aid for each response (stop, go, turn and leg-yield). Responding in the same way means responding: immediately to the light aid, in a self-maintained rhythm and tempo, with self-maintained straightness, with self-maintained contact and outline, with impulsion, with all of the above properties everywhere and every time.

So when you look at the above qualities of the rein and leg aids, you will appreciate that each response requires the development of many properties. These properties need to be trained one by one as you will see in a later article.

This is why correct horse training has always focussed on producing a consistent set of responses each time an aid is given rather than a random assortment of various incorrect responses. The German training scale is the best known of mankind’s attempts to train consistency of outcome in horse training. What is not well known in any equitation discipline is that problem horses are a result of defects in consistency of outcome from the aids. Instead, horsepeople describe a horse’s training in terms of its ‘will to please’ rather than its reaction to the aids. Horses are frequently described as ‘naughty’, ‘dirty’, ‘dumb’ or ‘hot’ rather than using terms that describe what the horse’s legs do or do not do in response to the aids.

SIGNAL PRIORITY

It is important in horse training to recognise that there is a priority in training signals. At the very earliest stages in horse training, the horse learns to respond from pressures, such as lead rein pressure for leading, and under saddle pressure from both legs means go forward while pressure from the reins means slow. However, good trainers ensure that at the very beginning of each rein or leg pressure, that there is a light version of that particular pressure. This has been known for centuries and is described at length in classical training literature. This light aid therefore is the very first cue that the horse learns in-hand and under-saddle for go, stop, turn and leg-yield. The horse learns through Pavlov’s principle, (Classical conditioning) to respond from those light versions of the pressures.

Furthermore, during this training the horse also learns, again through Pavlov’s principle to respond from associated seat, weight and position cues because they occur just before and during the light rein and leg aids. Horses learn these aids readily and unfortunately sometimes well before the horse has thoroughly learned about the pressures that actually enforce responses. Therefore, learning to respond from pressures and light aids should always precede any reliance on seat and position cues. Too many people rely on seat and position aids and forget to either establish or maintain the underlying foundations of rein and leg signals

. When the response to the seat and position aids begin to fail or even take too long to work and the horse no longer responds as he should, he soon forgets these aids entirely, as Pavlov predicted. Pavlov found that a conditioned response will be repressed if the stimulus proves “wrong” too often. If the seat does not produce a reliable response, the horse will stop responding to it. The same is true for the leg and rein aids – if they don’t work, the stronger motivating pressure should follow. It’s as if the horse is saying: “Please don’t use a force 6 pressure, I’ll do it from the light aid.’’ Using the right amount of pressure is a vital skill in horse training – not too much and not too little. Problems also occur if the rider maintains the mild pressure of the light aid when the horse has already responded. The horse desensitises to the light aid.

LOSSES OF PREDICTABILITY

However the problem with the horse that has become desensitised to the aids for whatever reason is not just that he loses his response to light signals. There is a big price that is often paid for this and it is called conflict behaviour. Conflict behaviour incudes flight response behaviours (i.e. fast ones) such as shying, bolting, bucking, rearing and leaping. It also accompanies associated health and welfare issues that include worsening colic attacks, immune suppression, hormonal changes and poor and ‘stringy’ body condition. Conflict behaviour arises from the stress that occurs due to losses, from the horse’s viewpoint, of predictability and controllability of its behavioural world. The horse is trying to run away from the stressful situation.

Conflict behaviour may also arise when the trainer does not regularly target a consistent set of responses from an aid. When all of the properties of each response (rhythm, straightness etc) are automatic from each light aid, the horse becomes relaxed in its body because the aid predicts a precise response. The horse’s world is now predictable. The horse’s general life is calmer too – things like separation anxiety disappear as well as other nervous tendencies. This is because, unlike before when responses were more random and less precise, the horse is now able to ‘read’ humans. More than anything else it is responsible for establishing what are known as rapport and trust between horses and humans. The horse is no longer insecure and whinnying for its mates shouting: “Help, I can’t read humans, I can only read horses, is there anybody out there?” Think about this. Does a small squeeze of the legs result in the right response but a bigger kick result in the opposite response, i.e. slowing or showing ‘piggy’ behaviour?

Obedient, well trained horses are like obedient, well trained dogs – they just don’t call out or become controlled by their environment. They are ‘on the aids’. Of course young horses going out for the first few times can be expected to be more nervous, but after five or so outings they should become unfazed by new surroundings if they are on the aids and their responses are consistent.

POSITIVE REINFORCEMENT

There are many associations that we train into the horse’s repertoire of signals. One of the most important is the acquisition of verbal praise such as ‘Good Boy’ for reward. Few people consider how the horse acquires this as a positive reinforcer, and consequently few horses adequately respond to it. We assume the horse knows what we mean, as if the horse has some kind of pre-programmed English vocab in its head. Because verbal commands have to be learned, they are called secondary reinforcers. To be effectively learned they have to be associated with a primary reinforcer, such as food or scratching/caressing the horse at the base of the withers (a proven site where French researchers showed lowered heart rate more than any other site). Because the base of the withers is so close to the hands of the rider, scratching/caressing is the most useful primary reinforcer. To train the horse to respond to ‘Good Boy’, the words should occur just before and during the scratching. The words and tone should be the same each time. Soon the words come to evoke the relaxation that results from scratching at the base of the withers. The words should be re-associated with the wither scratching from time to time.

PRECISE TIMING

The light aid should be attached to the stronger aid that comes after it. It should not be isolated by a gap in time before the stronger aid. In horse training, the time gap between the light aid and the stronger pressure should be the same as the time gap in between footfalls in the rhythm of the gait – all responses should have occurred by the count of 3. In other words the stronger pressure comes very swiftly after the light aid. Training horses effectively therefore requires skill and speed in decision making and action. In a fraction of a second you have to decide if the horse responded adequately to the light aid, and if not back it up with stronger pressure which is subsequently released the instant the horse gives the correct response. This way the horse optimally learns the light aid, and it rapidly evokes the correct response without any increase in pressure of the aid. This is the aim of all horse training – to transform the pressure-release training rapidly to lighter versions of the pressure, and then later to transform those aids into seat, weight and position aids. Through careful repetitive training, the horse eventually ‘rote learns’ his responses to the light aids so that he can avoid stronger pressures altogether.

Next the Exclusivity Principle, which is about the importance of keeping aids and their responses unique – and what happens when they are not…

Originally published in the September 2004 edition of The Horse Magazine