Training a dog can be a stressful experience for the dog. Ultimately, it’s a guessing game from their standpoint. You know what you want them to do, but they don’t. And you can’t just sit down and explain it to the dog. No one likes to be wrong; that’s stressful! Not knowing what is being asked of you is stressful too. We have lots of training methods to get the message across to our dogs. But they can all be more or less stressful depending on how we work with our dog to teach them.
Learning something new involves stress. If we are comfortable and confident, that stress manifests as the excitement of a new challenge. If we are not so confident, that stress can be less pleasant. When you are training your dog, she will either get it right or she will get it wrong. As a trainer, how do you respond to her efforts? Well, that depends on the trainer. It depends on what they know about teaching dogs. It depends on how easily they get frustrated. It depends on how committed they are to getting things their way. It depends on how far they will go to get results. In short, the results of being right or wrong can be anything from barely noticeable to extreme whether they are good or bad.
Facts, figures, and results
For decades, the “best” dog training was defined by rigorous training based on “discipline” and forceful methods used to correct the dog’s errors when they got the exercise wrong. Trainers like Blanche Saunders and William Koehler popularized training methods that involved physical consequences for the dog when they were wrong. Yanking on a leash attached to a choke chain collar (also called “popping” the leash) served to let the dog know that they had given an incorrect response. Some of these methods also advocated verbal praise as a response when the dog gets it right. More modern methods have gone further to suggest that rewards like food or play should be offered for correct performance.
Regardless of whether punishments like leash yanks or rewards like food are used, dog training is generally judged by results. How fast did the dog learn what they were supposed to do? How quickly do they offer the behaviour after hearing/seeing the cue for it? Will the dog perform as required under a variety of different or difficult situation? Today the debate continues as to whether rewards, punishment, or some combination of the two will be the most effective in getting good training results.
I have seen a lot of talk lately about the necessity of letting a dog know when they get it wrong so that good learning can occur. Advocates for using punishments or markers for letting the dog know they were wrong often cite studies like this one which purport to show that behaviours can be learned more efficiently with a combination of punishing incorrect responses and rewarding correct responses. The difficulty I have with such studies is that they show results in the isolation of only the behaviours used for that study. We live with our dogs for years. What is the effect of all of telling a dog they are wrong on the long term willingness of our dogs to continue working with us?
Comparing apples to growing an orchard
While it is a very human thing to try to quantify progress by measuring things, conclusions about learning that are based on the isolated results of a single behaviour or learning instance don’t tell the whole story. This study on the differential effects of rewards and punishment on learning in a larger context paints a very different picture. The study measured response times, error rates, and resilience to variation in presented cues to quantify the implicit learning of various behaviour sequences. The researchers report their findings as follows:
“The behavioral experiment, therefore, showed that only reward but not punishment enhances the implicit learning of sequences. This finding is at odds with previous studies of reward and punishment on learning, which, however, were generally confined to associative-learning tasks.”
So, while punishments used to indicate errors may increase the efficiency of learning an individual task, subsequent performance of that task under different conditions will actually suffer both in response time and reliability over subjects who were trained using only rewards. That is a significant finding. It is significant because we do not live with our dogs and function in an isolated series of individual moments. We have a continuous relationship with them and we expect the behaviours we teach to continue over the variety of places and situations in our life with our dog.
Is there much value in measuring our success based only on how quickly we can get our dog to comply with our commands? I think it is far better to measure success by the long term durability of not just behaviours trained but the overall willingness of our dogs to engage in the learning process. That is a better indicator of their ability to learn and adapt. It may even be an indicator of their overall intelligence. It appears from the available research that how we go about teaching our dogs can have a tremendous impact on their long term behaviour and relationship with us.
How do you respond to “wrong”?
From an Operant Conditioning perspective, there are only a few responses that are possible when our dog “gets it wrong” during training.
- Do Nothing -Essentially we provide no response to an incorrect attempt. All the dog knows is that they were not rewarded for that response. We attempt to “extinguish” the incorrect response or make it not worth repeating.
- No Reward Marker – We provide some indicator that the response was wrong. “Oops!’, “Good Try!”, “Nope”, or some other indicator that the dog understands as meaning that no reward will be coming. We deny them any reward for their effort. This is a mild punishment.
- Time Out – We end the training session with the dog. They messed up so they are punished by ending any opportunity for further rewards.
- Apply Pressure – We can begin doing something the dog doesn’t enjoy like tugging on the leash or moving into their space. If the dog offers the correct response, we immediately stop the uncomfortable stimulus. This is Negative Reinforcement in that the dog is encouraged to do the behaviour that “shut off” the unpleasant feeling.
- Punish The Error – We can give the dog a “correction.” This could be a verbal or physical punishment to indicate that the response they gave was unacceptable. The dog may learn to do the required behaviour to avoid the punishment next time.
It is inevitable that a dog will offer an incorrect response at some point in any training process. For my own training, I try to choose how I will respond to any errors based on how it will affect my dog’s willingness to engage in further training rather than finding the shortest path to the behaviour I want. Can I make it clear that the response was not what I was looking for and not discourage my dog from trying again?
A novel approach
Animal trainer Ken Ramirez offered a wonderful solution for handling errors in training at a recent seminar I attended. It’s called a “Least Reinforcing Stimulus/Scenario” (LRS) and was introduced in 1991 by Sea World head trainer Mike Scarpuzzi. The LRS is actually a hybrid of the “Do Nothing” response listed above and an invitation to “try again” by offering another opportunity to earn a reward.
The LRS procedure is remarkably simple. Over the course of a training session, my dog will make several attempts at the correct behaviour. When my dog offers a correct response, I use a marker signal and offer a reward (usually a food treat). If my dog offers an incorrect response, I go completely neutral offering no marker, no reward, no punishment, no response for a brief period(3 seconds). After this pause, I re-engage her as before and offer another chance to try the behaviour.
There are several advantages to the LRS. First, it is brief and therefore not overly disruptive to the training process. Second, it gives my dog a few seconds to realize that the last attempt was not productive in terms of earning a reward. Third, it offers an immediate opportunity to re-try for the reward. Lastly, and perhaps most importantly, it is not overly aversive so it does not discourage any future efforts to discover the correct response. The new opportunity to earn a reward overshadows the previous unproductive response.
As a dog trainer, I have always believed that keeping the dog engaged and willing to play the learning game (training) is the most important aspect of what we do in dog training. Without our dogs eager and enthusiastic participation, training becomes drudge work and neither of us enjoy it. If instead we can keep things light and fun, training becomes a game where being “right” is great because of the rewards and being “wrong” is no big deal because there is always another chance to get it “right.”
While using aversives or “corrections” to get the results you want in the short term may seem like a productive way to go, the science of behaviour shows that making errors a non-event provides better long term engagement with your dog. Making the training experience as non-threatening and enjoyable as possible means you get to enjoy a lifetime of training with your dog. And isn’t that what it’s all about?
Until next time, have fun with your dogs!
Be sure to check out our Canine Nation ebooks in the Canine Nation store and join our conversation on Facebook in the Canine Nation Forum!
The first Canine Nation ebooks are now available –
“Dogs: As They Are” & “Teaching Dogs: Effective Learning”
Photo credits –
Worried – Philippe Gilotte 2011 from Flickr
Wrong – bullcitydogs 2009 from Flickr
Smile – K J Payne 2012 from Flickr
good article and nice to see a thorough response to a commenter who differs…. 🙂
Thanks for the comment Janice! Glad you enjoyed the article.
Eric
I suspect that there is not a single right answer for all training situations (or dogs). For instance:
Do Nothing — Dog is learning a new behavior. Everything that doesn’t approximate new behavior is ignored rather than rewarded.
No Reward Marker — Dog is learning a behavior chain. A no reward marker lets them know where they’ve gone astray. For instance, I’m teaching my dog to pivot independently around a stool with her front paws on and her back paws on the floor. After a few steps she starts to bring a back paw up onto the stool. A no reward marker lets her know that all paws up is not what I’m looking for.
Time Out — In a working session the dog decides to wander off and sniff. A time out becomes a reminder that work is a privilege which you can lose by not trying.
Pressure — Dog is learning a sit stay and begins to get up. Pressure of moving toward the dog reminds them to stay in place.
Punish the Error — Dog chooses to do something self-reinforcing and unacceptable, like running off and refusing to come when called when they know the command. A tug on a long line says that this dangerous behavior will not be allowed.
Much of dog training, it seems to me, is figuring out how to respond in the best way in the particular moment, bearing in mind that the goal is always to develop a working relationship that is a pleasure for both the dog and the trainer.
Hi Lynn –
I agree that there is no one single right answer for every training situation and I would go further and say that each dog will have their own limits as well. The variety of possible responses just provides a spectrum to choose from and I believe that they should be employed with full knowledge of the possible outcomes.
The analysis you offer in your comment of these possible responses is interesting. They seem to make some assumptions that I’m not willing to make. For example, in talking about the No Reward Marker you say, “A no reward marker lets her know that all paws up is not what I’m looking for.” Well, I’m sure you INTEND for your dog to take that idea away from it but is that all she is learning? Having done some experiments with my dog years ago on No Reward Markers (NRMs), I learned that a dog can be annoyed by No Reward Markers. Our training sessions became shorter and shorter over a period of weeks as my dog grew tired of being marked for incorrect behaviour. I stopped using them and the duration of our sessions almost immediately went up again. My dog was willing to work longer for me if I stopped nagging her with NRMs.
Under Time Out you offer that “work is a privilege” for a dog. What do you base that on? Why should a dog feel privileged to do things because they are asked to do them? If, instead, you are saying that the opportunity to earn material rewards (e.g., food, play, etc.) is a privilege then we could probably agree. In my experience, a Time Out is a welcome break for a dog that isn’t enjoying training so it could be that the use of a Time Out could actually reward the dog for wandering off and sniffing!
You suggest that Pressure would remind a dog that has been asked to “Stay” to stay in place. That may be true. It may assist you in getting the behaviour. But you may also have taught the dog that you are willing to physically intimidate them if they do not comply with what they were told to do. Is the the best thing for your relationship with your dog? That would be a matter for each owner to decide for themselves.
With regard to your example of Punishing an Error with a dog on a long line, I would suggest that over time the dog will learn when the line is and is not attached. So yes, you will have an effective tool so long as the line is connected. When it is not you are depending on the ignorance of the animal that the line is gone to do the job.
I will leave the whole issue of “self-reinforcing” for another day as I believe that it is a fallacy that trainers rely on to excuse the fact that they can’t find ways to be more reinforcing than the environment that they find themselves in.
As always, thanks for reading and sharing your comments!
Eric
I have to agree, Eric. I personally find it very difficult and annoying when I am trying to learn something and someone is behind me saying, “No”, “That’s not it”, “Not that way”. I really just want to hand over whatever it is to them and let them do it instead. I have inadvertently done this in training with as little as a laugh…that laugh indicated to my dog that he had obviously done something wrong (not right?) and you could actually see him deflate. I did not intend to use a NRM, but I did. He doesn’t like it. Yes, I laugh often when things are good, but there is obviously some subtle difference to my dog that distinguishes one from the other.