Criminal deterrence in the reduced form: a new perspective on Ehrlich's seminal study.
Andreoni, James
I. INTRODUCTION
Perhaps the best known and most influential econometric analysis of the deterrent effect of criminal penalties is Isaac Ehrlich's [1973] "Participation in Illegitimate Activities: A Theoretical and Empirical Analysis." He reported large and significant marginal deterrent effects of penalties for a wide range of crimes. Ehrlich's work was heavily criticized on methodological grounds, and many subsequent studies have contradicted his results, while still others have not.(1) Nonetheless, Ehrlich's work remains an important hallmark in the study of criminal deterrence.
This note is motivated by recent research that indicates that penalties may have indirect consequences that could reduce their deterrent effect. Higher penalties could, for instance, encourage a criminal to spend more effort avoiding detection and prosecution (Malik [1990], and Snyder [1990]). For instance, as Lott [1987; 1992] shows, higher penalties may increase the amount of money criminals spend on legal defense.(2) This means that higher penalties could reduce the probability of apprehension and conviction. In addition, a feature of the legal system may generate another way for higher penalties to reduce the probability of punishment. This feature is that judges and juries must apply the reasonable doubt test to their decision to convict a defendant. Assume that jurors establish the level of doubt which is "reasonable" by finding the threshold that will minimize expected (social) losses. If juries, like criminals, have a forecast of likely penalties, then as the expected penalty rises, the cost of an incorrect conviction also rises. This means the juror's threshold level of reasonable doubt will rise as well. The effect is to reduce the probability that a given defendant will be convicted (Andreoni [1991]). If either the "avoidance" or the "reasonable doubt" effect is significant, then this will mitigate the direct deterrent effect of penalties on crime rates.
Reexamining Ehrlich's work we see that his prime objective was to identify a single equation in a three-equation simultaneous equations system, that being the equation predicting crime rates. He found significant negative coefficients on penalties, which he took as evidence for his theoretical model of criminal behavior in which criminals could be deterred by higher penalties. If more recent theories are correct in claiming that higher penalties may also decrease the chance of conviction, then Ehrlich's estimation of the single equation will not tell a complete story. In particular, we must estimate a reduced-form model which will capture both the direct and indirect effects of penalties on crime rates.
This paper will address these questions using Ehrlich's original data. Replicating Ehrlich's regressions, we find strong direct deterrent effects of penalties. However, we also find that increased penalties have a significant negative affect on the probability of conviction, as predicted by the theoretical studies. We combine these effects by examining the reduced-form equations for both crime rates and probabilities of conviction. We find that the total effect of penalties on crime rates is approximately zero for all twelve crime categories considered but that the effects of penalties on probabilities of conviction remain negative and significant in the reduced-form equations.
This note allows for two general conclusions. First, we provide some evidence for theoretical models that predict that penalties may reduce the probability that criminals will be convicted. Second, and more importantly, despite one's views on the many widely discussed methodological issues surrounding Ehrlich's seminal work, when generalized to account for possibilities that higher penalties may reduce the probability of conviction, his data generate much different policy implications than those widely adopted at the time of his paper's publication.
II. SPECIFICATION
This section discusses the specification of the structural equations for Ehrlich's original study. Every attempt is made to honor Ehrlich's original specifications, data, and intentions. The original data set used by Ehrlich was published in Vandaele [1978]. This data is a cross section of forty-seven states for 1960.(3) Table I lists the variables used, along with their definitions and means. All of the crime rate, probability, and sentence length variables are available for a total of twelve crime categories.
Ehrlich outlined a simultaneous equations model with three endogenous variables: crime rates CrimeRate, probabilities of apprehension and conviction Prob, and police expenditures Expend. He clearly specified the crime rate equation in his (4.1) as(4)
(1) In CrimeRate = [[Alpha].sub.0] + [[Alpha].sub.1] ln Prov + [[Alpha].sub.2] ln Sentence + [[Alpha].sub.3] ln Income + [[Alpha].sub.4] ln PctPoor + [[Alpha].sub.5] ln PctNon White + [[Epsilon].sub.1].
The primary predictions are that the coefficients on the probability, ln Prob, and the time served in prison, ln Sentence, should be negative. Predictions for the other coefficients can be found in Ehrlich [1973].
Ehrlich specified one equation for the aggregate "effectiveness of law enforcement" in his (4.5). This equation is
(2) ln Prob = [[Beta].sub.0]+ [[Beta].sub.1] ln CrimeRate + [[Beta].sub.2] ln Expend + [[Beta].sub.3] ln Population + [[Beta].sub.4] ln SMSA + [[Beta].sub.5] ln Pct Poor + [[Beta].sub.6] ln PctNonWhite + [[Beta].sub.7] ln Edu + [[Beta].sub.8] ln [Age.sub.14-24] + [[Beta].sub.9]South + [[Epsilon].sub.2].
This is the only specification Ehrlich gives for the structural equation on Prob, and will be taken as his intended specification for this equation. Several recent studies indicate that perhaps (2) should also include the potential penalties Sentence. One reason, suggested by Lott [1987] and Malik [1990], is that higher penalties may lead some criminals to spend more resources on "avoidance," that is evading detection, or pursuing more vigorous criminal defense. Andreoni [1991] argues that higher penalties may also lead judges and jurors to increase the standards of proof required to convict "beyond a reasonable doubt," leading to fewer convictions.(5) For this reason we also estimate an equation which includes In Sentence on the right-hand side:
(2[prime]) ln Prob = [[Beta].sub.0] + [[Beta].sub.1] ln CrimeRate + [[Beta].sub.2] ln Expend + [[Beta].sub.3] ln Population + [[Beta].sub.4] ln SMSA + [[Beta].sub.5] ln PctPoor + [[Beta].sub.6] ln PctNonWhite + [[Beta].sub.7] ln Edu + [[Beta].sub.8] ln [Age.sub.14-24] + [[Beta].sub.9]South + [[Beta].sub.10] ln Sentence + [[Epsilon].sub.2[prime]],
where the coefficient on ln Sentence is predicted to be negative.
The final equation, on expenditures, was not explicitly specified by Ehrlich. In his equation (3.8) Ehrlich presented a partial adjustment model of expenditures that, in log form, is
(3) ln Expend = Z[Gamma] + [[Gamma].sup.*] ln CrimeRate + (1-[[Gamma].sup.*]) ln [Expend.sub.t - 1] + [[Epsilon].sub.3]
where Z is a vector of exogenous variables intended to represent, in part, the expected loss due to crime. Ehrlich (footnote 34) indicates that one way to proxy for this is to include lagged crime rates, [CrimeRate.sub.t - 1], in Z. There are two remaining exogenous variables used in Ehrlich's two-stage least-squares estimates that have not yet entered any structural equation, the unemployment rate of males, [Unemp.sub.35-39], and the percentage of the population that is male, PctMales. We must assume that Ehrlich also meant to include these in the expenditures equation. However, Ehrlich made no attempt to estimate equation (3) due to a lack of variables needed to fully specify this equation. Since we have no additional data here, we also will not attempt to estimate this equation.
[TABULAR DATA FOR TABLE I OMITTED]
III. RESULTS
Equations (1), (2) and (2[prime]) were estimated for all twelve crime categories. Space constraints will not allow all of the regression results to be reported, but for illustration the structural parameters for these three equations for the "all crimes" category are listed in Table II.(6) The first thing to note from Table II is that the coefficient on the penalty ln Sentence in equation (1) is -1.063 and is highly significant (t = -4.49). This indicates a large direct deterrent effect of penalties. Turning to equations (2) and (2[prime]), we see that the coefficient on ln Sentence in (2[prime]) is -.858, and is also highly significant (t = -4.21). Moreover, the addition of ln Sentence to equation (2) significantly increases the [R.sup.2] of the regressio.(7) This is consistent with a substantial avoidance or reason-able-doubt effect of penalties on probabilities of conviction. Hence, it is reasonable to expect that the structural parameter on In Sentence reported from equation (1) will overstate the total deterrent effect of penalties. TABLE II Structural Coefficients and Standard Errors for the All Crime Category
Equation (1) (2) (2[prime]) Variable ln CrimeRate ln Prob ln Prob
Constant -2.169 1.489 4.702(*) (3.581) (2.477) (2.191)
ln CrimeRate -.854(**) -.674(**) (.226) (.192)
ln Prob -.937(**) (.156)
ln Expend .219 -.169 (.358) (.311)
ln Sentence -1.063(**) -.858(**) (.237) (.204)
ln Income 1.383(**) (.471)
ln PctPoor 1.779(**) 1.094 .724 (.408) (.623) (.525)
ln PctNonWhite .261(**) .267(**) .257(**) (.050) (.092) (.077)
ln Population -.226(**) -.063 (.076) (.074)
ln SMSA -.059 -.045 (.039) (.033)
ln Edu 2.370(*) 1.498 (.898) (.774)
ln [Age.sub.14-24] -1.074 -1.036 (.817) (.678)
South .428(*) .298 (.189) (.160)
[R.sup.2] .736 .787 .858
** Significant beyond the .01 level * Significant beyond the .05 level
Table III reports the parameters on In Sentence in the structural equations (1) and (2[prime]) as well as in the reduced-form equations for crime rates, ln CrimeRate, and probabilities of conviction ln Prob. The reduced-form coefficients are obtained from the ordinary-least-squares regressions [TABULAR DATA FOR TABLE III OMITTED] conducted as the first stage to the two-stage least-squares procedure. We see from the first column of Table III that Ehrlich's main finding is replicated. For seven of the twelve categories there is a significant direct deterrent effect of penalties. However, we also see from column 2 that in nine of twelve categories the penalty has a significantly negative effect on the probability of conviction. While the data is not precise enough to clearly identify the avoidance or reasonable doubt models, this coefficient is consistent with their predictions. In the final two columns we can see the total effect of penalties in the reduced-form equations. In all of the twelve categories the total effect of penalties on crime rates collapses to almost zero. The point estimate is actually positive in seven of the categories, although in no category is the effect significant. By contrast, the coefficients on In Sentence in the ln Prob (probability of conviction) equations remain negative and largely significant in the reduced form.
Although the coefficient on ln Sentence was never seen to have a significant effect on crime rates, other variables were often significant in the reduced-form equation. These include ln [CrimeRate.sub.t-1], as well as demographic variables such as ln Income, ln Population, ln PctPoor, ln PctNonWhite, ln PctMales and South. Surprisingly, lagged expenditures on law enforcement, ln [Expend.sub.t-1], was only significant in the reduced-form equation for larceny, with a coefficient of -0.292 (s.e. = 0.135). It is likely, however, that the lack of significance is due to measurement problems and the difficulty of making meaningful comparisons across states.
IV. CONCLUSION
Ehrlich's [1973] paper remains one of the best known and most influential studies of criminal deterrence. This paper notes that Ehrlich only discussed the partial effect of penalties by estimating just one of the structural equations of his three-equation model. Penalties that affect crime rates directly may have indirect effects via other endogenous variables. To measure the total effect, we reconsider Ehrlich's data but estimate the reduced-form coefficients.
In reanalyzing Ehrlich's model, we find that the parameters in the structural equations are broadly consistent with economic models of crime. First, penalties have a direct and significant negative effect on crime. This is consistent with a direct deterrent effect of penalties. Second, penalties have a direct and significant negative effect on probabilities of conviction. This suggests the existence of possible "avoidance" or "reasonable doubt" effects in which players in the legal process, such as judges and jurors, make decisions that mitigate the impact of sentencing policies. Combining these effects in the reduced-form parameters yields marginal deterrent effects of approximately zero for all twelve crime categories, and none of the effects is significant. Nonetheless, the reduced-form effects of penalties on probabilities of conviction remain negative and significant.
This paper now puts Ehrlich's seminal work in a new light. When generalized to account for the fact that penalties may reduce the probability of conviction, Ehrlich's data generate much different policy implications than those widely adopted at the time of its publication.
1. Since Blumstein, Cohen and Nagin [1978] researchers have emphasized that studies based on individuals (or that are aggregated at a low level, such as by city, community, or neighborhood) will avoid difficult identification problems inherent in highly aggregated data, such as that used by Ehrlich. For recent work in this area see Bursik et al. [1990], Grogger [1991], Klepper and Nagin [1989], Myers [1983], Schmidt and Witte [1984], Votey [1984], and Witte [1980].
2. Lott goes on to argue that this has the beneficial effect of increasing the penalty (which includes defense expenditures) even though it may reduce the probability of conviction.
3. While Ehrlich also considered data from 1940 and 1950, those data were not available for this study. This is not constraining, since Ehrlich was primarily concerned with the 1960 data.
4. Ehrlich also tested for the effects of unemployment, labor-force participation and age composition. He augmented his equation (4.1) to become
ln CrimeRate = [[Alpha].sub.0] + [[Alpha].sub.1] lnProb + [[Alpha].sub.2] ln Sentence + [[Alpha].sub.3] ln Income + [[Alpha].sub.4] ln PctPoor + [[Alpha].sub.5] ln PctNonWhite + [[Alpha].sub.6] ln [Unemph.sub.4-24] + [[Alpha].sub.7] ln [L.sub.14-24] + [[Alpha].sub.8] ln [Age.sub.14-24] + [[Epsilon].sub.1[prime]],
which is his equation (4.4). We also estimate systems with equation (1[prime]) and find results similar to those reported in this paper. For brevity we do not present these here. Tables containing these results are available from the author upon request.
5. See Snyder [1990] for empirical evidence of this effect in the enforcement of antitrust laws.
6. One should note that computational errors made by Ehrlich were discovered during Vandaele's [1978] replication. These errors do not change Ehrlich's qualitative finding. However, they do imply that the coefficients we report on the crime rate equations will not be identical to those published by Ehrlich, but instead will match those found by Vandaele [1978].
7. A standard F-test for the significance of additional regressors is F(1,37) = 18.5, which is highly significant.
REFERENCES
Andreoni, James. "Reasonable Doubt and the Optimal Magnitude of Fines: Should the Penalty Fit the Crime?" RAND Journal of Economics, 22(3), Autumn 1991, 385-95.
Blumstein, A. J. Cohen, and D. Nagin, eds. Deterrence and Incapacitation: Estimating the Effects of Criminal Sanctions on Crime Rates. Washington, D.C.: National Academy of Sciences, 1978.
Brier, Stephen S., and Stephen E. Fienberg, "Recent Econometric Modeling of Crime and Punishment." Evaluation Review, 4, 1980, 147-91.
Bursik, Robert J., Harold Grasmick, and Mitchell Chamlin, "The Effect of Longitudinal Arrest Patterns on the Development of Robbery Trends at the Neighborhood Level." Criminology, 28, 1990, 431-50.
Ehrlich, Isaac. "Participation in Illegitimate Activities: A Theoretical and Empirical Investigation." Journal of Political Economy, 81(3), May-June 1973, 521-64.
Grogger, Jeffrey. "Certainty versus Severity of Punishment." Economic Inquiry, 29(2), April 1991, 297-309.
Klepper, Steven, and Daniel Nagin. "The Deterrent Effect of Perceived Certainty and Severity of Punishment Revisited." Criminology, 27, 1989, 721-46.
Lott, John R. "Should the Wealthy Be Able to 'Buy Justice'." Journal of Political Economy, 95(6), December 1987, 1302-15.
_____. "Do We Punish High Income Criminals Too Heavily?" Economic Inquiry, 30(4), October 1992, 583-608.
Malik, Arun. "Avoidance, Screening and Optimum Enforcement." RAND Journal of Economics, 21(3), Autumn 1990, 341-53.
Myers, Samuel L. "Estimating the Economic Model of Crime: Employment versus Punishment Effects." Quarterly Journal of Economics, 98(1), February 1983, 157-66.
Pilivian, Irving, Craig Thornton, Rosemary Gartner, and Ross L. Matsueda, "Crime, Deterrence, and Rational Choice." American Sociological Review, 51, 1986, 101-19.
Schmidt, Peter, and Anne D. Witte. An Economic Analysis of Crime and Justice: Theory, Methods, and Applications. Orlando: Academic Press, 1984.
Snyder, Edward A. "The Effect of Higher Criminal Penalties on Antitrust Enforcement." Journal of Law and Economics, 33(2), October 1990, 439-62.
Vandaele, Walter. "Participation in Illegitimate Activities: Ehrlich Revisited," in Deterrence and Incapacitation: Estimating the Effects of Criminal Sanctions on Crime Rates, edited by A. Blumstein, J. Cohen and D. Nagin. Washington, D.C.: National Academy of Sciences, 1978, 270-315.
Votey, Harold L. Jr. "Crimes in Sweden: Causality, Control Effects, and Economic Efficiency." Advances in Applied Micro-Economics, 3, 1984, 229-65.
Witte, Ann Dryden. "Estimating the Economic Model of Crime with Individual Data." Quarterly Journal of Economics, 94(1), February 1980, 57-84.
JAMES ANDREONI, Associate Professor, University of Wisconsin-Madison. I am grateful to some anonymous referees for helpful comments, and to Soren Hauge for excellent research assistance. Financial support from the Alfred P. Sloan Foundation Research Fellowship and the National Science Foundation, grant SES-9110821, are also gratefully acknowledged.