Addiction: Decreased Reward Sensitivity and Increased Expectation Sensitivity Conspire to Overwhelm the Brain's Control Circuit (2010)
COMMENTS: A review by the head of the National Institute on Drug Abuse, Nora Volkow, and her team. This review lists the 3 major neurobiological dysfunctions involved in all addictions. Simply stated they are: a) Desensitization: a numbed pleasure response due to a decline in dopamine signaling; b) Sensitization: enhanced dopamine response to addiction cues, triggers or stress; and c) Hypofrontality: weakened self-control circuits due to decline in the volume and functioning of the frontal cortex. These same brain changes were decsribed by the American Society for Addiction Medicine (ASAM) in their new definition of addiction released in August, 2011.
Volkow ND, Wang GJ, Fowler JS, Tomasi D, Telang F, Baler R. Bioessays. 2010 Sep;32(9):748-55.
National Institute on Drug Abuse, NIH, Bethesda, MD 20892, USA.
Based on brain imaging findings, we present a model according to which addiction emerges as an imbalance in the information processing and integration among various brain circuits and functions.
The dysfunctions reflect:
(a) decreased sensitivity of reward circuits,
(b) enhanced sensitivity of memory circuits to conditioned expectations to drugs and drug cues, stress reactivity, and negative mood,
(c) and a weakened control circuit.
Although initial experimentation with a drug of abuse is largely a voluntary behavior, continued drug use can eventually impair neuronal circuits in the brain that are involved in free will, turning drug use into an automatic compulsive behavior. The ability of addictive drugs to co-opt neurotransmitter signals between neurons (including dopamine, glutamate, and GABA) modifies the function of different neuronal circuits, which begin to falter at different stages of an addiction trajectory. Upon exposure to the drug, drug cues or stress this results in unrestrained hyperactivation of the motivation/drive circuit that results in the compulsive drug intake that characterizes addiction.
The last 25 years of neuroscience research have produced evidence that addiction is a disease of the brain, providing a powerful argument for upholding the same standards of medical care to the addicted individual as those that are common to other diseases with a major public impact, like diabetes. Indeed, research on addiction has started to uncover the sequence of events and long-lasting sequelae that can result from the persistent abuse of an addictive substance. These studies have shown how repeated drug use can target key molecules and brain circuits, and eventually disrupt the higher order processes that underlie emotions, cognition and behavior. We have learned that addiction is characterized by an expanding cycle of dysfunction in the brain. The impairment typically begins in the evolutionarily more primitive areas of the brain that process reward, and then moves on to other areas responsible for more complex cognitive functions. Thus, in addition to reward, addicted individuals can experience severe disruptions in learning (memory, conditioning, habituation), executive function (impulse inhibition, decision making, delayed gratification), cognitive awareness (interoception) and even emotional (mood and stress reactivity) functions.
Drawing largely from the results of brain imaging studies that used positron emission tomography (PET), we introduce the key brain circuits that are affected by the chronic abuse of drugs and then present a coherent model, according to which addiction emerges as the net result of imbalanced information processing in and among these circuits. A thorough understanding of these gradual adaptive (neuroplastic) brain processes, and of the biological and environmental vulnerability factors that influence their likelihood, is critical for the development of more effective prevention and treatment approaches to combat addiction.
High, but brief, bursts of dopamine are required for addiction
Addiction is, first and foremost, a disease of the brain's reward system. This system uses the neurotransmitter dopamine (DA) as its major currency to relay information. Brain DA plays a key role in the processing of information about saliency [1, 2], which is at the heart of its ability to regulate or influence reward [3, 4], reward expectation , motivation, emotions, and the feelings of pleasure. Transient release of DA in the brain's ventral striatum is a necessary, albeit not sufficient, event in the complex processes that engender the sensation of reward: the increase in DA appears to be positively related to the intensity of “high” that subjects experience. Conditioned responses are only elicited when DA is repeatedly released as these sharp, transient, surges, in response to drugs or drug-associated cues.
Interestingly, directly or indirectly, all addictive drugs work by triggering exaggerated but transient increases in extracellular DA in a key region of the reward (limbic) system [6, 7], specifically, in the nucleus accumbens (Nac) located in the ventral striatum. Such DA surges resemble, and in some instances greatly surpass, the physiological increases triggered by naturally pleasurable stimuli (usually referred to as natural reinforcers or rewards). As we would have expected, human brain imaging studies using positron emission tomography (PET), have clearly shown that the DA increases induced by different classes of drugs (e.g. stimulants (Fig. 1A), [8, 9], nicotine , and alcohol ) within the ventral striatum, are linked to the subjective experience of euphoria (or high) during intoxication [12, 13, 14]. Since PET studies can be done in awake human subjects it is also possible to plot the relationship between the subjective reports of drug effects and the relative changes in DA levels. Most studies have reported that those displaying the greatest DA increases following drug exposures [amphetamine, nicotine, alcohol, methylphenidate (MPH)] also report the most intense high or euphoria (Fig. 1B).
Animal and human studies have demonstrated that the speed with which a drug enters, acts upon, and leaves the brain (i.e. its pharmacokinetic profile) plays a fundamental role in determining its reinforcing effects. Indeed, every drug of abuse whose brain pharmacokinetics have been measured with PET (cocaine, MPH, methamphetamine, and nicotine) exhibits the same profile when the administration is intravenous, i.e., peak levels in the human brain are reached within 10 min (Fig. 2A)and this fast uptake is associated with the “high” (Fig. 2B). Based on this association, it follows that making sure that an addictive drug enters the brain as slowly as possible should be an effective way of minimizing its reinforcing potential, hence its abuse liability. We designed an experiment to test precisely this hypothesis with the stimulant drug MPH, which, like cocaine, increases DA by slowing down its transport back into the presynaptic neuron (i.e. by blocking DA transporters), thus magnifying the DA signal. Indeed, we found that, while intravenous administration of MPH is often euphorigenic, orally administered MPH, which also increases DA in the striatum , but with 6- to 12-fold slower pharmacokinetics, is not typically perceived as reinforcing [16, 17]. Thus, the failure of oral MPH – or amphetamine  for that matter – to induce a high is likely the reflection of their slow uptake into the brain . Therefore, it is reasonable to propose the existence of a close correlation between the rate at which a drug of abuse enters the brain, which determines the speed at which DA increases in the ventral striatum, and its reinforcing effects [20, 21, 22]. In other words, for a drug to exert reinforcing effects it has to raise DA abruptly. Why should this be so?
Based on the magnitude and duration of neuronal firing, DA signaling can take one of two basic forms: phasic or tonic. Phasic signaling is characterized by high amplitude and short burst firing, whereas tonic signaling has typically low amplitude and a more protracted or sustained time course. The distinction is important because it turns out that phasic DA signaling is necessary for drugs of abuse to induce “conditioned responses,” which is one of the initial neuroadaptations that follow exposure to reinforcing stimuli (including a drug). One of the distinguishing aspects that links phasic signaling with conditioning is the involvement of D2R and glutamate n-methyl-d-aspartic acid (NMDA) receptors . On the other hand, tonic DA signaling plays a role in the modulation of working memory and other executive processes. Some of the features that distinguish this mode of signaling from the phasic type are that it operates mostly through lower affinity DA receptors (DA D1 receptors). However, and in spite of the different mechanisms involved, protracted drug exposure (and changes in tonic DA signaling through these receptors) has also been implicated in the neuroplastic changes that ultimately result in conditioning  through the modification of NMDA and and alpha-amino-3-hydroxyl-5-methyl-4-isoxazone-propionate (AMPA) glutamate receptors .
The evidence indicates that abrupt drug-induced increases in DA mimic phasic DA cell firing. This helps explain why the chronic use of an addictive substance can engender such powerful conditioned responses to the drug itself, its expectation, and myriad cues (people, things and places) associated with its use. However, while the acute reinforcing effects of drugs of abuse that depend on such fast DA increases are likely “necessary” for the development of addiction, they are clearly not “sufficient.” Repeated drug exposure causes changes in DA brain function that take time to develop because they result from secondary neuroadaptations in other neurotransmitter systems (e.g. glutamate  and perhaps also γ-aminobutyiric acid (GABA)) that, eventually, affect additional brain circuits modulated by DA. These circuits are the focus of the next sections.
Chronic drug abuse downregulates dopamine receptors and dopamine production: The “high” is blunted
The fact that drug use must become chronic before addiction takes root is a clear indication that the disease is predicated, in vulnerable individuals, on repeated perturbations of the reward system. These perturbations can eventually lead to neuroadaptations in many other circuits (motivation/drive, inhibitory control/executive function, and memory/conditioning) that are also modulated by DA . Among the neuro-adaptations that have been consistently reported in addicted subjects are the significant reductions in the levels of the D2R (high affinity) receptors and in the amount of DA released by DA cells  (Fig. 3). Importantly, these deficits are associated with lower regional metabolic activity in areas of the prefrontal cortex (PFC) that are critical for proper executive performance (i.e. anterior cingulate gyrus (CG) and orbitofrontal cortex (OFC)) (Fig. 4A). This observation led us to postulate that this may be one of the mechanisms that connect the drug-induced disruption in DA signaling with the compulsive drug administration and the lack of control over drug intake that characterizes addiction . Also, the resulting hypodopaminergic state would explain an addicted individual's decreased sensitivity to natural rewards (e.g. food, sex, etc) and the perpetuation of drug use as a means to temporarily compensate for this deficit . An important corollary of this knowledge is that addressing these deficits (by increasing striatal D2R levels and increasing DA release in striatum and prefrontal regions) could offer a clinical strategy to ameliorate the impact of addiction . Is there any evidence that reversing the hypodopaminergic state can have a positive impact on substance-abuse-related behaviors? The answer is yes. Our studies show that by forcing the overproduction of D2R, deep inside the reward system of cocaine- or alcohol-experienced rats, we can significantly reduce the self-administration of cocaine  or alcohol , respectively. Moreover, in rodents, as well as in human methamphetamine abusers , a reduced striatal level of D2R is also associated with impulsivity, and in rodents it predicts compulsive patterns of drug self-administration (see below).
Imaging studies have also shown that, in humans, addiction is associated with a decrease in DA release in the ventral striatum and in other regions of the striatum, and in blunted pleasurable responses to the drug in active and in detoxified drug users (Fig. 5) . This was an unexpected finding since it had been hypothesized that addiction reflected an enhanced sensitivity to the rewarding (and hence the dopaminergic) responses to drugs. In drug abusers, decreases in DA release could reflect either disrupted neurophysiology within the reward circuitry (i.e. in the DA neurons that release DA in the striatum) or, alternatively, a disrupted feedback regulation of the reward circuit by prefrontal (executive control) or amygdalar (emotional) pathways (prefrontal-striatal, amygdalarstriatal glutamatergic pathways). Since a pure dopaminergic dysfunction in the striatum, as seen in the chronic drug abuser, fails to account for the traits that characterize addictive behaviors, like impulsivity, cravings, and the relapse triggered by drug cues, it is very likely that prefrontal regions (as well as the amygdala) are also involved here, because their disruption would enable or at least influence these behavioral traits.
Lowered dopamine receptor (DR2) levels impair the control of impulsivity by the prefrontal cortex
It has been hypothesized that the impaired control over compulsive drug taking behaviors that characterizes addiction may be due in part to specific dysfunctions in frontal regions of the brain . There is now a significant amount of evidence that supports this notion, beginning with animal studies that explore the connection between D2R and behavioral control. Experiments with rats clearly show a correlation between low D2R and impulsivity , and between impulsivity and drug self administration . But what is the connection? As previously mentioned, in drug abusers, lower striatal D2R significantly correlates with lower brain glucose metabolism in key regions of the PFC, such as the OFC (involved with salience attribution and whose disruption results in compulsive behaviors) and in CG (involved with inhibitory control and error monitoring and whose disruption results in impulsivity) (Fig. 4B) [38, 39]. Moreover, in a study we performed in individuals (mean SD ± age, 24 ± 3 years) a family history of alcoholism, but who were with not alcoholics themselves, we also uncovered a significant association between striatal D2R and metabolism in frontal regions (CG, OFC, and dorsolateral PFC) and also in anterior insula (involved in interoception, self-awareness, and drug craving)  (Fig. 6). Interestingly, these individuals had higher striatal D2R than matched controls with no family history of alcoholism, although they did not differ in frontal metabolism. Also, in the controls, striatal D2R did not correlate with frontal metabolism. This led us to speculate that the higher than normal striatal D2R in subjects with a high genetic risk for alcoholism protects them against alcoholism in part by strengthening activity in prefrontal regions. When combined, these data suggest that high levels of D2R in striatum could protect against drug abuse and addiction by keeping impulsivity traits under control, i.e., by regulating circuits involved in inhibiting behavioral responses and in controlling emotions.
Similarly, we hypothesized that the prefrontal regions are also involved in the reduction of striatal DA release (and reinforcement) observed in addicted subjects since they regulate DA cell firing in midbrain and DA release in striatum. To test this hypothesis we assessed the relationship between baseline metabolism in the PFC and the increases in striatal DA induced by the intravenous administration of MPH in controls and in detoxified alcoholics. Consistent with the hypothesis, in alcoholics we failed to detect the normal association between baseline prefrontal metabolism and DA release in striatum, suggesting that the marked decreases in DA release in striatum seen in alcoholics reflect in part improper regulation of brain activity by prefrontal brain regions .
Thus, we have found an association between reduced baseline activity in PFC and reduced striatal D2R in drug-addicted subjects, and between baseline PFC activity and DA release in controls that is not present in addicted individuals. These associations evince the strong connections between neuroadaptations in PFC pathways and downstream dysfunctions in the DA reward and motivational system, likely because of the influence of PFC on impulsivity and compulsivity. However, these fail to account for additional behavioral phenomena, such as the effects of drug-associated cues in triggering craving, which would presumably implicate memory and learning circuits.
Conditioned memories and stereotypic behaviors replace the “high” as the driver
Over-stimulation of DA cells in the ventral striatum eventually establishes new functional connections in the brain between the act of satisfying the urge, and the situational events surrounding it (e.g., environment, routine of preparing the drug, etc.), laying down new, powerful learned associations that can trigger behavior. Ultimately, the mere memory or anticipation of the drug can trigger the impulsive behaviors that characterize addicted individuals. With repeated drug use, the firing of DA cells in the striatum begins to change the neurochemistry underlying associative learning. This facilitates the consolidation of maladaptive memory traces connected to the drug, which helps explain the ability of all sorts of drug-associated stimuli (in the learned expectation of receiving the drug reward when exposed to these stimuli)  to readily trigger DA cells firing. And because of the role of DA in motivation, these DA increases trigger the motivation drive required to secure the reward . Indeed, when rats are exposed repeatedly to a neutral stimulus that is paired with the drug (conditioned), it can elicit DA increases and reinstate drug self- administration . Such conditioned responses are clinically relevant in substance-use disorders because they are responsible for the high likelihood of an addicted person to relapse even after protracted periods of detoxification. Now, brain imaging techniques allow us to test whether exposure of humans to drug-associated cues can trigger drug craving just as shown in laboratory animals.
With repeated drug use, the firing of DA cells in the striatum begins to change the neurochemistry underlying associative learning. This facilitates the consolidation of maladaptive memory traces connected to the drug, which helps explain the ability of all sorts of drug-associated stimuli (in the learned expectation of receiving the drug reward when exposed to these stimuli)  to readily trigger DA cells firing. And because of the role of DA in motivation, these DA increases trigger the motivation drive required to secure the reward . Indeed, when rats are exposed repeatedly to a neutral stimulus that is paired with the drug (conditioned), it can elicit DA increases and reinstate drug self- administration . Such conditioned responses are clinically relevant in substance-use disorders because they are responsible for the high likelihood of an addicted person to relapse even after protracted periods of detoxification. Now, brain imaging techniques allow us to test whether exposure of humans to drug-associated cues can trigger drug craving just as shown in laboratory animals.
This question has been investigated in active cocaine abusers. Using PET and [11C]raclopride, two independent studies showed that exposure to a cocaine-cues video (of subjects smoking cocaine) but not to a neutral video (of nature scenes) increased striatal DA in human subjects addicted to cocaine (Fig. 7) and that the DA increases were associated with subjective reports of drug craving [44, 45]. The higher the DA increases triggered by exposure to the cocaine-cues video, the more intense the drug craving. Moreover, the magnitude of the DA increases was also correlated with addiction severity scores, highlighting the relevance of conditioned responses in the clinical syndrome of addiction.
It is important to emphasize, however, that in spite of the presumed strength of these maladaptive associations, we have recently gathered new evidence suggesting that cocaine abusers retain some ability to purposefully inhibit craving. Therefore, strategies to strengthen fronto-striatal regulation may offer potential therapeutic benefits .
Putting it all together
Some of the most pernicious features of drug addiction are the overwhelming craving to take drugs that can reemerge even after years of abstinence, and the severely compromised ability of addicted individuals to inhibit drug seeking once the craving erupts in spite of well-known negative consequences.
We have proposed a model of addiction  that explains the multidimensional nature of this disease by proposing a network of four interrelated circuits, whose combined dys-functional output can explain many of the stereotypic behavioral features of addiction: (a) reward, including several nuclei in the basal ganglia, especially the ventral striatum, whose Nac receives input from the ventral tegmental area and relays the information to the ventral pallidum (VP); (b) motivation/drive, located in the OFC, subcallosal cortex, dorsal striatum and motor cortex; (c) memory and learning, located in the amygdala and the hippocampus; and (d) planning and control, located in the dorsolateral prefrontal cortex, anterior CG and inferior frontal cortex. These four circuits receive direct innervations from DA neurons but are also connected with one another through direct or indirect projections (mostly glutamatergic).
The four circuits in this model work together and their operations change with experience. Each is linked to an important concept, respectively: saliency (reward), internal state (motivation/drive), learned associations (memory, conditioning), and conflict resolution (control). In addition, these circuits also interact with circuits involved with mood (including stress reactivity)  and with interoception (which result in awareness of drug craving and mood) . We have proposed that the pattern of activity in the four-circuit network outlined here influences how a normal individual makes choices among competing alternatives. These choices are influenced systematically by the reward, memory/conditioning, motivation, and control circuits and these in turn are modulated by circuits that underlie mood and conscious awareness (Fig. 8A).
The response to a stimulus is affected by its momentary saliency, i.e. its expected reward. In turn, reward expectation is processed in part by DA neurons projecting into the ventral striatum and influenced by glutamatergic projections from the OFC (which assigns saliency value as a function of context) and amygdala/hippocampus (which mediate conditioned responses and memory recollections). The value of the stimulus is weighted (compared) against that of other alternative stimuli, but also changes as a function of the internal needs of the individual, which are modulated by mood (including stress reactivity) and interoceptive awareness. In particular, stress exposure enhances the saliency value of drugs while at the same time it decreases prefrontal regulation of the amygdala . In addition, since chronic drug exposure is linked to enhanced sensitization to stress responses this explains why stress can trigger a drug relapse so often in clinical situations. The stronger the saliency value of the stimulus, in part shaped by previously memorized experiences, the greater the activation of the motivational circuit and the stronger the drive to procure it. The cognitive decision to act (or not) to procure the stimulus is processed in part by the PFC and the CG, which weigh the balance between the immediate positive versus the delayed negative outcomes, and by the inferior frontal cortex (Broadmann Area 44), which works to inhibit the prepotent response to act .
According to this model, in the addicted subject (Fig. 8B), the saliency value of the drug of abuse and its associated cues is enhanced at the expense of other (natural) rewards, whose saliency is markedly reduced. This would explain the increased motivation to seek the drug. However, acute drug exposure also resets reward thresholds, resulting in decreased sensitivity of the reward circuit to reinforcers , which also helps explain the decreasing value of non-drug reinforcers in the addicted person. Another reason for the enhanced saliency of a drug is the lack of habituation of DA responses to drugs of abuse (tolerance) compared with the normal habituation that exists for natural rewards and that results in satiety .
Moreover, exposure to conditioned stimuli is sufficient to increase reward thresholds ; thus, we would predict that in an addicted person, exposure to an environment with conditioned cues would further exacerbate their decreased sensitivity to natural rewards. In the absence of competition by other reinforcers, conditioned learning elevates the acquiring of the drug to the level of a main motivational drive for the individual. We hypothesize that drug cues (or stress) result in fast DA increases in the Nac in the ventral striatum and in the dorsal striatum that drive the motivation to take the drug and cannot be properly opposed by a dysfunctional PFC. Thus, upon drug consumption and intoxication the enhancement of the DA signals would result in a corresponding overactivation of the motivational/drive and memory circuits, which deactivate the PFC (prefrontal inhibition occurs with intense amygdala activation) , blocking the power of the PFC to control the motivational/drive circuit. Without this inhibitory control, a positive-feedback loop is established, which results in compulsive drug intake. Because the interactions between the circuits are bidirectional, the activation of the network during intoxication serves to further strengthen the saliency value of the drug and the conditioning to drug cues.
In short, we propose a model that accounts for addiction as follows: During addiction, the enhanced value of drug cues in the memory circuit drives reward expectation and enhances the motivation to consume the drug, overcoming the inhibitory control exerted by an already dysfunctional PFC. Although the drug-induced DA increase is markedly attenuated in drug-addicted subjects, the pharmacological effects of the drug become conditioned responses in themselves, further driving the motivation to take the drug and favoring a positive-feedback loop now unopposed, because of the disconnection of the prefrontal control circuit. At the same time, addiction is likely to also recalibrate the circuits that instantiate mood and conscious awareness (represented by darker shades of gray) (Fig. 8B) in ways that, if experimentally corroborated, would further tilt the balance away from inhibitory control and towards craving and compulsive drug taking.
We readily admit that this is a simplified model: we realize that other brain regions must also be involved in these circuits, that one region may contribute to several circuits, and that other circuits are likely to also be involved in addiction. In addition, though this model focuses on DA, it is evident from preclinical studies that modifications in glutamatergic projections mediate many of the adaptations observed in addiction and that we discussed here. It is also evident from preclinical studies that other neurotransmitters are involved in the reinforcing effects of drugs including cannabinoids and opioids. Unfortunately, until recently, the limited access to radio-tracers for PET imaging has limited the capacity to investigate the involvement of other neurotransmitters in drug reward and in addiction.
- cingulate gyrus
- dopamine type 2/3 receptor
- γ-aminobutyiric acid
- hypothalamic pituitary axis
- nucleus accumbens
- n-methyl-d-aspartic acid
- orbitofrontal cortex
- positron emission tomography
- prefrontal cortex
- ventral pallidum