Posted
Nov 2010
Reward Now, Punish Later
Punishment alone may not change social norms
Based on the research of Florian Herold
Over the years, economic theory has generally held that self-interest predominantly governs the behavior of individuals in small groups. However, recent experiments have suggested a situation less cut and dried: some experimental subjects, they indicate, are willing to reward good behavior by their peers and to punish bad behavior, even at some cost to themselves. Now Florian Herold, an assistant professor of managerial economics and decision sciences at the Kellogg School of Management, has developed a theoretical model that clarifies those findings. The model, he says, explains why rewarding and punishing behaviors can not only coexist with selfish behavior, but also induce cooperative behavior in small groups.
To illustrate his findings, Herold suggests two ways to fulfill the recommendation of many doctors that people sneeze into their elbows rather than into their hands. Here the carrot and the stick—rewarding and punishing behaviors—can work in concert. “Do you punish everybody who does the wrong thing by sneezing into the hand or reward those who use their elbows?” he asks. “Punishment is not viable in the beginning because you’d be punishing almost everybody. Rewarding works at that time, as few people will be getting rewarded in the beginning. But at some point punishment takes over, as by then few people will be doing the wrong thing, and hence few will be punished.” That shows “that it’s much easier to change an established behavior by offering rewards, rather than threatening with punishments,” he adds. “But once you have established a norm, sustaining it by the threat of punishment is cheap. Only a few people will violate the norm, so you will rarely have to follow through with your threat.”
As that example indicates, while both the carrot and the stick play strong roles in influencing changes in a group’s actions, they operate in different ways. Herold makes the point more formally in a paper on his research. “Rewarders,” the paper states, “enhance the evolution of preferences for punishing.” That means, Herold explains, that “reward may help to get you to a situation in which you can establish punishment.” The model also leads to a conclusion that echoes the much-quoted comment of baseball manager Leo Durocher: “Nice guys finish last.” Says Herold, “People who are always nice have a hard time in my model.”
Three Settings of a Simple Game
Herold has developed his theoretical model to explain experimental observations by other researchers. A specialist in game theory, he bases the model on a formal mathematical treatment of a simple game played by “haystack groups”—small collections of individuals familiar with each other. He applies the treatment to three versions of the game.
Reward may help to establish punishment.
The first version includes rewards offered for cooperation. Individual “proposers” can choose to cooperate, and hence become eligible to receive a reward, or refuse cooperation; the other group, called “seconders,” can present a reward for cooperation or choose not to do so. “The proposer will cooperate if he’s confident that he’ll be rewarded,” Herold says.
In the second game only punishments are available, but the same conditions apply: Proposers can choose to cooperate and hence avoid punishment, or not do so and thereby face the possibility of punishment, while seconders can choose to punish noncooperators or not do so. “Punishment hurts both parties,” Herold says. “If the proposer believes he’ll be punished, he’ll want to cooperate. And if a lot of people punish, in the end everyone will. So you establish a norm in which everybody cooperates because they fear punishment, and you get very little punishment. But another equilibrium involves defection by everybody, and ultimately nobody is punished.”
The third game includes both punishment and reward options. “The model shows that the rewarding option is important to get you out of the situation in which everybody will defect because nobody punishes,” Herold explains. “Rewarders can go in fairly cheaply; they have to do it only when they get cooperation. Once enough rewarders are in most groups, the general environment will become one of cooperation; then punishment becomes cheaper.”
The Key Insight
Herold’s approach, which used mathematical methods from evolutionary biology, is not entirely new. “A couple of economists have previously applied this idea to the evolution of strategies or the evolution of preferences,” Herold recalls. However, he has expanded the field by applying the concepts to a specific setting. What’s new about the research project, he says, “is that it looks at both reward and punishment behavior.” And that, he continues, has provided “the key insight that we can try to exploit the fact that, if you want to change a norm, it’s cheaper to start doing so by rewarding people.”
The research originated in Herold’s interest in the foundations of behavioral economics, which he describes as “one of the interesting developments of the last decade; along with other people, I find it interesting to find additional constraints on behavior.” The project represented an effort to discover whether nonselfish preferences can survive. More broadly, the research “is really about what evolutionary forces are in play to understand how reward and punishment at this level interact,” Herold says. He adds that the term “evolutionary forces” has a different connotation for economists than for life scientists. “It can mean simply successful preferences in certain markets—specifically preferences that are not all selfish,” he says.
Can policy makers and corporate leaders apply the principles revealed by the model when they want to make behavioral changes to their organizations? “I’m careful not to rush to conclusions,” Herold says. “And you want to consider other things when deciding to apply the carrot or the stick. But it’s useful to consider these basic assumptions to change a culture: You start out by rewarding people who move to the culture you want to establish. Then once you have it, you start to punish those who haven’t made the change.” The key to success is timing the transition from reward to punishment. “You have to think in the long run about changing from carrot to stick,” Herold continues. “You have to consider when the carrot is more expensive or the stick is more expensive.”
Related reading on Kellogg Insight:
Healthy Choices: Do people prefer the carrot or the stick?




4 Comments
Nov 17 2010
Hello sir….
I am doing my MBA from Western International University, Phoenix, AR.
I just read the amazing article.
I have a small query about the effects of it. If you say that people would start fearing punishments and thats the reason they start co-operating….is it ethical to induce fear into them indirectly and make them work ? and would they give their best at work under such conditions ? What if they start to react violently (especially in Manufacturing firms if not Service sector) ?
And about rewarding…..would the senior members of the team let go the rewards to their juniors ? They would definitely find a way to make their juniors do mistakes and get punished….so they dont get any rewards, and they dont cooperate, so they are also not rewarded ultimately !
I really apologize if my way of thinking is not correct.
Nov 17 2010
The word “fear” may be a little strong for what I have in mind. It"s more like being aware that if you do not do your fair share in a relationship, i.e., you do not co-operate, the other side will reciprocate with something that has negative consequence to you, i.e., they will punish you. You have some forms of explicit or implicit punishment in almost any society or organization. If you do not follow the law the police or a court will punish you. If you do not show up for work, you will not receive your salary and if you do not contribute to your team’s efforts you are less likely to be promoted. I agree that being punished may lead to negative reactions (some may even start to react violently as you suggest). That is precisely why I think executing a punishment is costly not only for the one who is punished, but also for the one who executes the punishment. Sometimes even the announcement of a punishment for certain behavior may be harmful if it undermines trust in a relationship. (My paper “Contractual Incompleteness as a Signal of Trust” is about this effect.) You are right that in practice there are many subtleties you have to worry about. The key insight however is that if one norm of behavior is not yet well established trying to enforce it by punishments can be very harmful to all people involved and may not be viable. Yet, once a norm of appropriate behavior (i.e. co-operation) is well established and expected from everybody, punishing the few free-riders who try to shirk by not following this generally accepted norm of behavior typically helps to keep the norm in place at relatively low costs.
Nov 17 2010
Here is the full citation and link to the paper Professor Herold mentioned in his comment:
Herold, Florian. 2010. Contractual incompleteness as a signal of trust. Games and Economic Behavior, January, 68(1): 180-191.
Gated link (individual or institutional subscription necessary to access the full text):
http://dx.doi.org/10.1016/j.geb.2009.05.001
Jan 10 2011
The carrott could be having health insurance- the stick could be having medical expenses that are not paid.
The question is who wants the security? Persons in their late teens and twenties tend to be risk takers. The carrott is not the same for everyone. More often then not they change their minds only when one of their friends gets in a car accident and is seriously injured.
Persons on medicare pay about 25% of their pension in health insurance premiums and medical expenses. If they select to opt out of coverage B and D they can be bankrupt in 2 months. Then they will qualify for medicade and will have reasonalble health coverage.
A young man that did not have health insurance and incurred a medical condition that cost tens of thousands to cure had the expense paid for by contributions from friends and his community. He was absolutely against mandatory health insurance. If he became sick again he assumed that chairity would take care of min again. The carrott and stick argument is not the same in his mind as in mine.
The stick might be that if you don’t have insurance medical providers do not have to treat you. This would include life threatening situations. If insurance coverage did not include the treatment that a person wants the person could pay for it or go without.
Health insurance is simply a method of financing medical costs. It separates the buyer and seller from the cost discussion. Insurance companies and government organizatoins have negotiated rates that are to the patient’s advantage. This becomes very evident in medical suits against uninsured persons. The spread between negotiated medical rates is frequently 40-50%. Mandated coverage adds to the medical cost. Grade schools require a number of medical tests and hand the bill to the parent. The parent wants to hand this cost off to the insurance company. These mandates continue to add to the cost of insurance. The cost of insurance is transferred to the employer.
This complex financing system simply confuses people as to what is a carrott and what is a stick. Throughout time sticks have been added with frequency. There are deductibles that are called co-pays that don’t count as deductibles. Then there are deductibles that do count as dedcutibles. Add coinsurance and stop loss to the mix and most people have no idea what their benefit is. There are nets at one level and out of nets at another. Then there are the medical providers that work in network facilities, but are non network.
Over time too many carrotts and too many sticks have been added to an already complicated medical reimbursement system.