[PPT] - Using elasticities to derive optimal income tax rates Emannuel Saez PowerPoint Presentation

SLIDE 1

Using elasticities to derive

ptimal income tax rates

Emannuel Saez (2001)

Tax and transfer policies M2 PPD Nicholas McSpedden-Brown

SLIDE 2

Introduction

How much progressivity should there be in tax

schedules? ⟹ equity-efficiency trade-off : redistribution vs incentives

Optimal tax rate: Tax rate that collects the

most revenue

Original model : Mirrlees (1971)
Saez’s goal: to clearly show that there is a

simple link between optimal tax formulas and elasticities of earnings.

SLIDE 3

Plan

1. Optimal marginal tax rate for top

incomes

2. General non-linear optimal tax

rates for any tax bracket.

3. Numerical simulations of optimal

tax schedules

SLIDE 4

1. HIGH INCOME OPTIMAL TAX

RATES

SLIDE 5

Base specifications

Maximisation of a utility function 𝑣 = 𝑣(𝑑, 𝑨)

Where 𝑣𝑑 =

𝑒𝑣 𝑒𝑑 > 0 , 𝑣𝑨 = 𝑒𝑣 𝑒𝑨 < 0 , ( 𝑨 = 𝑥𝑚),

according to the constraint 𝑑 = 𝑨 1 − 𝜐 + 𝑆 Where

τ is the top marginal tax rate on
R is virtual (non-labour) income : this is the post-tax income

and individual would get if he supplied zero labour and was allowed to stay on the “virtual” linear schedule

SLIDE 6

For those who failed/skipped/forgot Micro 101…

Substitution effect : If the price of a good increases

relative to another, then people will consume relatively more of the other good.

⟹ If the tax rate goes up, leisure becomes more

attractive because the ‘price’ paid for it (after-tax income forgone by not working) has fallen.

Income effect : If total income is reduced, then people

will cut back on the consumption of all goods that are not essential (i.e. normal goods).

⟹ If the tax rate goes up, I have less income, and

therefore I ‘consume’ less leisure, i.e. I work more.

SLIDE 7

Elasticity concepts

Uncompensated elasticity of earnings ∶ 𝜂𝑣 = 𝑒𝑨

𝑨 𝑒(1−𝜐) 1−𝜐

: (uncompensated, because it does not compensate for a change in income)

Income effects (= the marginal propensity to earn out of

non-labour income): 𝜃 = 1 − 𝜐

𝑒𝑨 𝑒𝑆 ≤ 0, since leisure is

assumed not to be an inferior good.

Compensated elasticity of earnings :

𝜂𝑑 = 1−𝜐

𝑨 𝑒𝑨 𝑒 1−𝜐 𝑣 = 𝑑𝑡𝑢 : (purely substitution effects since it

compensates for a change income)

Slutsky equation: 𝜂𝑑 = 𝜂𝑣 − 𝜃 ≥ 0

SLIDE 8

Deriving the high income optimal tax rate

Government sets top marginal rate τ for

incomes above 𝑨

Population with income above 𝑨 normalised to

1

ℎ(𝑨) : density of earnings distribution at
ptimum tax regime
Consider a small increase dτ in the top tax rate

τ for incomes above 𝑨

SLIDE 9

High income tax rate perturbation

SLIDE 10

Decomposing the change in total taxes paid

Total taxes paid at income 𝑨 above 𝑨 = Marginal

rate for incomes above 𝑨 × Income above 𝑨 + Total taxes paid at income 𝑨

⟹ 𝑈 𝑨 = 𝜐 𝑨 − 𝑨 + 𝑈(𝑨 )
⟹ 𝑒𝑈 𝑨 = 𝑨 − 𝑨 𝑒𝜐 + 𝜐𝑒𝑨
⟹

𝑒𝑈 𝑨 ℎ 𝑨 𝑒𝑨 = 𝑁 + 𝐶

∞ 𝑨

The total taxes paid therefore changes due to two

things : a mechanical effect and behavioural responses

SLIDE 11

The mechanical effect

Mechanical effect: The increase in tax receipts

if there were no behavioural responses.

Taxpayer with income 𝑨 > 𝑨 pays 𝑨 − 𝑨 𝑒𝜐 in

additional taxes.

Summing over population with 𝑨 > 𝑨 , we

have total mechanical effect on tax receipts: 𝑁 = 𝑨𝑛 − 𝑨 𝑒𝜐

SLIDE 12

Behavioural responses

As 𝑨 = 𝑨 1 − 𝜐, 𝑆 , therefore with total differential:
𝑒𝑨 = −

𝜖𝑨 𝜖 1−𝜐 𝑒𝜐 + 𝜖𝑨 𝜖𝑆 𝑒𝑆

Let’s express this in terms of income effect and

uncompensated elasticity :

𝜃 = 1 − 𝜐

𝑒𝑨 𝑒𝑆 ⇒ 𝜖𝑨 𝜖𝑆 = 𝜃 (1−𝜐)

𝜂𝑣 = 𝑒𝑨

𝑨 𝑒(1−𝜐) 1−𝜐

⇒ −

𝜖𝑨 𝜖 1−𝜐 = 𝜂𝑣𝑨 1−𝜐

And as 𝑒𝑆 = 𝑨 𝑒𝜐 (overall increase in virtual income),
Therefore: 𝑒𝑨 = −(𝜂𝑣𝑨 − 𝜃𝑨 )

𝑒𝜐 1−𝜐 : reduction in

individual z’s earnings due to behavioural responses

SLIDE 13

Reduction in tax receipts due to behaviour responses

As we saw, a reduction in earnings of dz implies a

reduction in tax receipts of τdz, for one individual.

This implies total that the total reduction in tax

receipts is :

𝐶 =

− 𝜂𝑣𝑨 − 𝜃𝑨

𝜐𝑒𝜐 1−𝜐 ℎ 𝑨 𝑒𝑨 ∞ 𝑨

= −(𝜂 𝑣𝑨𝑛 − 𝜃 𝑨 ) 𝜐𝑒𝜐 1 − 𝜐

Where 𝜂 𝑣 is the weighted average of the

uncompensated elasticity, and 𝜃 the average income effect.

SLIDE 14

Obtaining the optimal tax rate

Need to equalise the revenue effect (the sum of the

mechanical effect and behavioural response) to the welfare effect.

Compute welfare effect : Let 𝑕 = Marginal social utility
f money for top bracket tax payers divided by

marginal value of public funds for government. Thus each additional dollar raised by government as a result

f tax reduces on average social welfare of the top

bracket by 𝑕 .

Hence the total welfare loss due to tax reform is 𝑕 M.
Revenue effect = Welfare effect ⇔ M+B = gM

SLIDE 15

Interpretation

Result:

𝜐 1−𝜐 = (1−𝑕 )(𝑨𝑛 𝑨 −1) 𝜂 𝑣𝑨𝑛 𝑨 −𝜃

Decreasing function of 𝑕 , 𝜂 𝑣, and increasing in

𝜃 .

When 𝑨 is close to the top, 𝑨𝑛 𝑨

tends to 1 ⟹ 𝜐 tends to zero. This is because M is negligible compared to B near the top.

SLIDE 16

𝑨𝑛 𝑨 for the U.S. in 1992/93 : Constant for high incomes ⟹ Zero top result has no practical interest

SLIDE 17

Pareto distributions

Distributions with constant 𝑨𝑛 𝑨

ratio are exactly Pareto distributions.

A Pareto distribution is such that:

𝑄𝑠𝑝𝑐 𝐽𝑜𝑑𝑝𝑛𝑓 > 𝑨 = (𝑨 𝑨)

𝑏

We have 𝐹 𝑎 = 𝑨𝑛 = 𝑏𝑨

𝑏−1 ⇒ 𝑨𝑛 𝑨 = 𝑏 𝑏−1 .

For 𝑨𝑛 = 2, 𝑏 = 2.

The higher a, the thinner is the tail of the

income distribution

SLIDE 18

Rewriting the optimal marginal tax as a limiting tax for high incomes

From 𝜐

1−𝜐 = (1−𝑕 )(𝑨𝑛 𝑨 −1) 𝜂 𝑣𝑨𝑛 𝑨 −𝜃

∶

⇒ 𝜐 =

1−𝑕 1−𝑕 +𝜂 𝑣+ 𝜂 𝑑(𝑏−1) with 𝑨𝑛 𝑨 = 𝑏 𝑏−1

Decreasing function of a : thinner tail
Role of elasticity effects vs income effects is

visible

𝑕 = 0, 𝜂 𝑣 = 𝜂 𝑑gives the Laffer rate 𝜐 =

1 1+ 𝜂 𝑑𝑏 .

SLIDE 19

Optimal tax rates for high earners (using asymptotic rate formula)

SLIDE 20

2.OPTIMAL NON-LINEAR INCOME TAX RATES FOR ANY TAX BRACKET

SLIDE 21

Initial specifications

𝐼(𝑨) : Cumulated income distribution function

i.e. the number of people with earnings below z (total population normalised to 1)

ℎ(𝑨): Density of the income distribution at z, i.e.

the number of people earning z

ℎ

𝑨 : Virtual density : density of income distribution at z that would exist if the tax schedule were replaced by a linear tax schedule at z.

𝑕 𝑨 : Social marginal value of consumption for

taxpayers with income z, at optimum

SLIDE 22

Formula for optimal tax rate at level 𝑨

𝑈′(𝑨 ) 1 − 𝑈′(𝑨 ) = 1 𝜂𝑑(𝑨 ) × 1 − 𝐼(𝑨 ) 𝑨 ℎ (𝑨 ) × 1 − 𝑕 𝑨 exp 1 𝑨′ 1 − 𝜂𝑣(𝑨′) 𝜂𝑑(𝑨′) 𝑒𝑨′

𝑨 𝑨

ℎ 𝑨 1 − 𝐼(𝑨 ) 𝑒𝑨

∞ 𝑨

SLIDE 23

An increase in the marginal rate for [𝑨 ,𝑨 + 𝑒𝑨 ]

SLIDE 24

Mechanical effect net of welfare loss

Every taxpayer with income 𝑨 > 𝑨 pays 𝑒𝜐𝑒𝑨

additional taxes, which are valued 1 − 𝑕 𝑨 𝑒𝜐𝑒𝑨 by the government.

Therefore overall mechanical effect net of

welfare loss is:

𝑁 = 𝑒𝜐𝑒𝑨

1 − 𝑕 𝑨 ℎ 𝑨 𝑒𝑨

∞ 𝑨

SLIDE 25

Elasticity effect

Two components:
Direct compensated elasticity effect due to

exogenous increase 𝑒𝜐

Indirect effect due to the shift of the taxpayer on

the tax schedule by 𝑒𝑨, inducing an endogenous additional change in marginal rates equal to 𝑒𝑈′ = 𝑒𝑈′′𝑒𝑨

𝑒𝑨 = 𝜂𝑑𝑨 𝑒𝜐+𝑒𝑈′

1−𝑈′ .

Using virtual density and summing:
⇒ 𝐹 = −𝜂𝑑𝑨

𝑈′ 1−𝑈′ ℎ

𝑨 𝑒𝜐𝑒𝑨

SLIDE 26

Income effect

A taxpayer with income 𝑨 > 𝑨 pays −𝑒𝑆 = 𝑒𝜐𝑒𝑨

additional taxes

⟹ Taxpayers above the bracket [𝑨 ,𝑨 + 𝑒𝑨 ] are

induced to work more through income effects, which reinforce mechanical effect.

Direct income effect 𝜃 𝑒𝑆

1 − 𝑈′

Indirect elastic effect due to endogenous change in

marginal rates 𝑒𝑈′ = 𝑒𝑈′′𝑒𝑨

𝑒𝑨 = −𝜂𝑑𝑨

𝑒𝜐+𝑒𝑈′ 1−𝑈′ − 𝜃 𝑒𝜐𝑒𝑨 1−𝑈′ .

Using virtual density and summing:
⇒ 𝐽 = 𝑒𝜐𝑒𝑨

−𝜃

𝑈′ 1−𝑈′ ℎ

𝑨 𝑒𝑨

∞ 𝑨

SLIDE 27

Total effect of tax reform

Revenue effect = Welfare effect therefore 𝑁 + 𝐹 + 𝐽 = 0 giving

differential equation:

⇒

𝑈′ 1−𝑈′ = 1 𝜂𝑑 1−𝐼(𝑨 ) 𝑨 ℎ (𝑨 )

1 − 𝑕 𝑨

ℎ 𝑨 1−𝐼(𝑨 ) 𝑒𝑨 +

−𝜃

𝑈′ 1−𝑈′ ℎ 𝑨 1−𝐼(𝑨 ) 𝑒𝑨 ∞ 𝑨 ∞ 𝑨

By integration:
𝑈′(𝑨 )

1−𝑈′(𝑨 ) = 1 𝜂𝑑(𝑨 ) 1−𝐼(𝑨 ) 𝑨 ℎ (𝑨 )

1 − 𝑕 𝑨 exp

1 𝑨′ 1 − 𝜂𝑣(𝑨′) 𝜂𝑑(𝑨′) 𝑒𝑨′ 𝑨 𝑨 ℎ 𝑨 1−𝐼(𝑨 ) 𝑒𝑨 ∞ 𝑨

SLIDE 28

Formula for optimal tax rate at level 𝑨

𝑈′(𝑨 ) 1 − 𝑈′(𝑨 ) = 1 𝜂𝑑(𝑨 ) × 1 − 𝐼(𝑨 ) 𝑨 ℎ (𝑨 ) × 1 − 𝑕 𝑨 exp 1 𝑨′ 1 − 𝜂𝑣(𝑨′) 𝜂𝑑(𝑨′) 𝑒𝑨′

𝑨 𝑨

ℎ 𝑨 1 − 𝐼(𝑨 ) 𝑒𝑨

∞ 𝑨

SLIDE 29

Interpretation

Three elements determine optimal tax rates at

𝑨 :

shape of the income distribution :

1−𝐼(𝑨 ) 𝑨 ℎ (𝑨 )

substitution/income effects :

1 𝜂𝑑(𝑨 ) and

exp

1 𝑨′ 1 − 𝜂𝑣(𝑨′) 𝜂𝑑(𝑨′) 𝑒𝑨′ 𝑨 𝑨

and social marginal weights : 1 − 𝑕 𝑨

SLIDE 30

Shape of income distribution

The shape of the income distribution:

1−𝐼(𝑨 ) 𝑨 ℎ (𝑨 )

The elastic distortion at 𝑨 induced by marginal rate

increase is proportional to income at that level times number of people at that level: 𝑨 ℎ(𝑨 ).

Gain in tax receipts is proportional to the number of

people above 𝑨 : 1 − 𝐼(𝑨 )

⟹ Government should apply high marginal rates at

levels where the density of taxpayers is low compared to the number of taxpayers with higher income

SLIDE 31

Further explanation

This is clearly the case at the bottom : 𝑨 ℎ(𝑨 ) is

close to 0 while 1 − 𝐼(𝑨 ) is close to 1

At the top, assuming a Pareto distribution of

parameter a, 1−𝐼(𝑨 )

𝑨 ℎ(𝑨 ) = 1/𝑏

For U.S., a = 2 ⟹ 1/a = 0.5

SLIDE 32

Variations of

1−𝐼(𝑨) 𝑨ℎ(𝑨) across incomes

SLIDE 33

Substitution and income effects

Behavioural effects enter the formula in two

ways:

Compensated response from taxpayers

(substitution effect) via compensated elasticity

1 𝜂𝑑(𝑨 )

Increase in the tax burden of taxpayers above

𝑨 inducing them to work more (via exponential term which is larger than 1)

SLIDE 34

Social marginal welfare weights

Represented by the term 1 − 𝑕 𝑨

.

𝑕 𝑨 : the relative value for the government of

an additional dollar of consumption at income z.

If 𝑕 𝑨 decreases with z, then the government

has redistributive tastes.

SLIDE 35

3. NUMERICAL SIMULATIONS

SLIDE 36

Methodology

Aim : To simulate the importance of substitution vs

income effects and utilitarian vs Rawlsian social welfare weights

Two utility functions:
Type 1 : 𝑣 = log 𝑑 −

𝑚1+𝑙 1+𝑙 , no income effects

Type 2 : 𝑣 = log 𝑑 − log 1 +

𝑚1+𝑙 1+𝑙 , with income

effects.

In both cases, constant compensated elasticity = 1/k
Use of the skill distribution as exogenous measure of

income distribution

SLIDE 37

Results : optimal non-linear & linear rates according to wage income

SLIDE 38

Results

In all four cases optimal rates are U-shaped:

close to actual tax schedules

High rates for low w correspond to phasing-
ut of guaranteed income levels
Income effects increase rates
Higher compensated elasticity decreases rates
Rawlsian criterion leads to higher rates, but

difference between Rawlsian and utilitarian decreases for higher incomes

SLIDE 39

GENERAL CONCLUSIONS

Elasticity estimates from the empirical

literature suggest that top marginal rates should not be below 50% and can go as high as 80%.

The elasticity method is fruitful as it precisely

divides the individual impact of the shape of the income distribution, substitution and income effects, and redistributive tastes on the optimal marginal tax rate.