[PPT] - A Rational Approach to Outcome Measurement Duncan Raistrick Leeds PowerPoint Presentation

SLIDE 1

A Rational Approach to Outcome Measurement

Duncan Raistrick Leeds Addiction Unit

Leeds and York Collaborations for Leadership in Applied Health Research and Care (CLAHRC) Funded by the NIHR Payment by Results Conference Friday 28th September 2012

SLIDE 2

NIHR Nine CLAHRCs approx £10million each CLAHRC for Leeds/York/Bradford Addiction Research in Acute Settings (ARiAS) 1 of 5 themes ARiAS outcome measurement 1 of 6 strands CLAHRC research group: Duncan Raistrick, Gillian Tober, Christine Godfrey, Charlie Lloyd, Steve Parrot, Jude Watson, Veronica Dale Co-opted Expert Group convened March 2011: Owen Bowden-Jones, Alex Copello, Ed Day, Eilish Gilvarry, Don Lavoie, Damian Mitchell, Julia Sinclair, John Strang, and Alex Whincup

SLIDE 3

Summary

Views and expectations of different stakeholders
Service user and carers
Politicians and commissioners
Practitioners
Substance use and scales
Self report
Biological measures
Scales EQ5D LDQ CORE SSQ
Societal Impact Measures
Further interesting things to find out
Relationship between measures
Clinically significant change

SLIDE 4

Outcome measures answer the question “how do I know this person is getting better?” and they also...

summarise complex information in a clinically meaningful and ‘real world’ way communicate complex information in a clear and simple way using minimal data can be integrated into routine practice in a clinically useful way but.............

SLIDE 5

......different stakeholders have different ideas of ‘getting better’ ...... the same stakeholders have different needs

Service users and carers – abstinence
Politicians and commissioners – costs and benefits
Practitioners
Health workers – physical and mental health
Criminal justice workers – offending behaviour
Social workers – safeguarding children

......so, there is no ideal measure, just the mix of measures that best suits the purpose.

SLIDE 6

Service user and carer views on what

utcomes are important

Agency Type Type of Activity SU F&F

FG1

NHS Treatment 7 4

FG2

NHS/3rd sector DRR 3 2

FG3

NHS/3rd sector Harm reduction

FG4

SMART group Recovery mutual aid 7 n/a

FG5

3rd sector Recovery SU only 7 n/a

FG6

3rd sector Recovery F&F only n/a 6 TOTAL 24 12

Source: unpublished CLAHRC study 2012

Six focus groups (FGs) were held to elicit service user (SU) views and views of Family and Friends (F&F) on what constitutes a good outcome.

SLIDE 7

Analysis of Focus Groups

FG1 recovery is abstinence, social life, support for family and friends, relationships, lifestyle with partner, being in control, life style changes during recovery process, physical and mental health, confidence FG2 positive environment to maintain abstinence and move away from drugs and alcohol, no good

utcome – it’s with you for life, not using methadone or buprenorphine, social support, doing

something to fill time, safe accommodation, children back from care, confidence, physical health and appearance FG4 physical and mental health, ability to distract and not act upon cravings, self-worth, energy, not feeling isolated, personal responsibility, living a normal life, engaging in treatment, improved relationships FG5 self esteem, eating healthily, abstinence, sleep, physical health, relationships, social networks, feeling valued, new interests and skills, courses. FG6 employment, safe and supportive social networks, something to do during the day, self-worth, physical and psychological well-being, having a good addiction, not feeling ashamed, financial situation

There was a distinct view that abstinence or perhaps moderation is the first step but the social support to maintain change is all important. Other gains such as health and relationships were seen as good but not the essence of a good outcome.

Source: unpublished CLAHRC study 2012

SLIDE 8

Substance Use as the Primary Outcome

SLIDE 9

Justification for substance use as the primary outcome measure.......

Viewed by service users as most important
Viewed by careers as most important
Most convincing for general public and policy

makers

Substance misuse is the condition
Correlation with societal costs

SLIDE 10

Different Ways to Measure Substance Use

Frequency Categories ASSIST & AUDIT Objective Testing blood tests toxicology Episodes

f Use

OTI Quantity / Frequency MAP & TOP Actual Frequency ASI

Composite measures are restrictive:

always a compromise – too much/too little detail eg ASI
typically rater completed eg TOP, ASI, MAP
item selection bias eg TOP
predetermined categories not suited to particular needs eg AUDIT
not easy to update/modify and revalidate eg TOP, ASI

SLIDE 11

10 20 30 40 50 60 % of sample in drinking volume group Interview year

Abstinent Sensible (<14/21 units) Hazardous (14-35 / 21-50) Heavy (35-70 / 50-100) Very heavy (70+ / 100+)

Birmingham Heavy Drinkers 10yr Follow-up

Uses UNITS of alcohol grouped in ranges

n=259 followed at 10yrs: >50% reported major life events in previous 2yrs: typically health, employment, shift of attitude. 18 known deaths: 3 CVS, 4 liver, 3 cancer, 1 suicide, 2 diabetes. Abstinence increased from 0-10%, weekly units decreased from 90-59 (m) and 60-36 (f)

Source: Rolfe et al.(2009) Report to DH

SLIDE 12

UK Alcohol Treatment Trial

FREQUENCY and UNITS of alcohol

20 40 60 80 100 Percent of Days Abstinent Baseline 12 Months MET SBNT 5 10 15 20 25 30 Drinks per Drinking Day Baseline 12 months MET SBNT Source: UKATT Research Team, BMJ (2005)

SLIDE 13

Problems of the substance itself as an

utcome measure……
Is it how much (quantity) or how often (frequency) that

matters? How reliably can these be measured?

Is it the range of substances used or just the presenting

main substance that should be the outcome? Does this include prescribed drugs (eg methadone, diazepam)?

Are some drugs associated with more harmful routes of

administration than others (eg heroin)?

Does substance use itself matter or is it the related harms?

Are some harms clearly attributable to particular substances (eg alcohol) some much less so (eg heroin)?

How to measure it?

SLIDE 14

72-hour detection window (drugs that can be detected up to 72hrs)

Frequency of drug use Frequency of testing 2x /wk 1x /wk 2x /mth 1x /mth 8x /yr Every day

3±2 7±2 15±10 30±13 46±40

Every other day

4±3 8±3 18±12 35±17 51±50

1x /wk

5±4 11±7 23±18 48±31 71±66

1x /wk

9±9 18±14 40±33 80±64 118±106

2x /mth

19±21 39±35 91±88 160±124 272±260

1x /mth

36±42 71±66 150±141 306±283 560±598

Toxicology Screening as an Indirect Measure

Days (mean ± sd) before a positive test would be expected given different drug use frequency and different testing frequency

Source: Crosby et al., (2003) J Addictive Diseases

SLIDE 15

Correlations with Societal Costs

example data for units of alcohol and health status

10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60

ne2one

increase decrease weekly units of alcohol health as % of best possible health

moderate drinking not associated with major health problems health may not recover

nce damage is done

SLIDE 16

Correlations with Substance Use.... Baseline Follow-up EQ5D LDQ CORE SSQ EQ5D LDQ CORE SSQ Alcohol freq

0.32** 0.48**

0.28**

0.08*
0.30** 0.62**

0.39**

0.20**

Alcohol units

0.38** 0.47**

0.25**

0.19** -0.30** 0.62**

0.38**

0.20**

Heroin freq

0.11

0.39** 0.09

0.10
0.21

0.40 0.40

0.60**

**p<.001 *p<.01 Source: unpublished CLAHRC clinical sample

but....

substance use is difficult to measure – LDQ good proxy
other scales are needed to paint a picture of outcome

Conclusion

There is a scientific and political case for the primacy of substance use as an outcome measure....

SLIDE 17

Choosing Scales as Secondary Outcome Measures

SLIDE 18

Types of Outcome Scales

Generic Measures (Health)

Is treatment cost effective? How ill are people with addiction problems compared to other users of health care? How complex are the health problems? What is the illness profile of people with addictions?

Dimension Measures (Addiction)

How severe is the addiction? How difficult is treatment likely to be? How good is

ne addiction service compared to another? Do problems persist?

Condition Specific Measures (Depression, Pregnancy)

How severe is the specific condition? How do services targeting the condition compare? How effective is treatment for this specific problem?

Personal Goal Measures (Me)

The personal outcome goals agreed with each service user.

Source: Fitzpatrick et al. (1998) Health Technology Assessment

SLIDE 19

Societal Impact Measures - SIMs

Treating the individual has benefits at the societal

level – SIMs are service level

Political interest is in societal costs (a research

exercise) – SIMs are a headline contribution

SIMs need to be: objective, easy to collect, capable
f showing variability, reflect the impact of the service

in the fewest possible measures...............

Pregnancy and Parenting i) birth weight ii) child with mother at 12months Hospital In-reach i) A&E attendances ii) in-patient admissions in last 12months Detoxification i) % completing ii) % supervised disulfiram

SLIDE 20

A Quality Framework for Outcome Scales

Self completion scales are the gold standard Scales evaluated by scoring for:

Evidence Base (including independent evaluations)
Psychometric Properties
Normative Data
Availability (free and supported by website)
Ease of Use
Universality (all substances and all socio-economic

groups)

Service User Evaluations

Source: NIMHE (2009)

SLIDE 21

Scales that best match quality criteria...

Evidence (0-2) Number of validation publications Psychometric properties (0-6) Availability (0-2) Practicality (0-3) Universality (0-2) Population norms (0-2)

EQ-5D health

2 tbc 6 2 2 2 2

LDQ dependence

2 3 5 2 3 2 1

CORE-10 mental health

2 2 5 2 3 2 2

SSQ satisfaction

1 1 5 2 3 2 1

APQ problems

2 2 6 2 3

HoNOS mental health

2 tbc 4 2 3 2 tbc

PHQ9 depression

2 tbc tbc 2 3 2 tbc

GAD7 anxiety

2 tbc tbc 2 3 2 tbc

IRS impulsivity

2 5 2 2 2

FMQ family coping

tbc tbc tbc 2 tbc 2

PCQ parenting

1 1 2 2 3 2

SLIDE 22

EQ5D weighted scores baseline and 3mth

higher score = better health weighted score max = 1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

All Alcohol Heroin & Opiates Methadone Other drug Population 25-35

Baseline Follow up Gen Pop

Source: unpublished clinical data

SLIDE 23

Frequency of scores at assessment and 3mth

LDQ CORE SSQ

Source: unpublished clinical data

SLIDE 24

Conclusion

The EQ5D LDQ CORE SSQ are clinically useful, used in routine practice, have predictive value and paint a comprehensive picture....

20 40 60 80 100 5 10 15 20 25 30

Your score now… …this is what happens in treatment

SLIDE 25

Interesting Things to Find Out

SLIDE 26

Domains (or components) of Addiction

psychological well being dependence social well being substance use

SLIDE 27

Factor analysis with #4 factor solution EQ5D LDQ CORE-10 SSQ

F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 .77 .13 .07

.09

.10 .80 .25

.10

.19 .36 .71

.12

.11

.04
.01

.66 .67

.02

.15

.08

.12 .81 .24

.15
.15
.01

.15

.40
.01
.05
.04

.70 .57 .28 .31

.15

.12 .80 .27

.10

.02 .26 .44

.22
.15
.19
.03

.57 .68 .22 .08

.07

.11 .83 .20

.11

.04 .19 .60

.14
.05
.22
.06

.59 .25 .31 .69

.12

.18 .65 .20

.11

.28 .25 .75

.10
.17
.13
.30

.56 .15 .75 .18

.13

.34 .14 .57

.20
.11
.07
.30

.60 .04 .75 .25

.08

.26 .30 .55

.11
.06
.05
.20

.64 .04 .66 .22

.12

.22 .31 .77

.22
.02
.11
.25

.57 .09 .78 .22

.12

.20 .30 .73

.23

.15 .74 .35

.13

.27 .22 .69

.17

8% of variance 23% of variance 17% of variance 9% of variance

SLIDE 28

Relationship between EQ5D LDQ CORE SSQ

LDQ and CORE close to each other. Predicts if dependence is treated then mental health improves. SSQ is most separated. Predicts it will change most differently to other measures. EQ5D is generic but independent and between the other

measures. Has less

influence than LDQ on mental health more on social satisfaction.

SLIDE 29

Clinically Significant Change - ‘Gold Standard’

Reliable Change Score Well Functioning Population LDQ >= 4 < 12 CORE-10 >= 6 < 14 SSQ >= 4 > 10

Source: CLAHRC submitted

Jacobson et al. (1999) proposed that in order to take account of baseline scores and measuring error, clinically significant change should a) be statistically reliable b) end scores be in a well functioning population range

retest e

r 1 S S

1

 

2 e diff

) 2(S S 

RC = reliable change 95% probability if RC >=1.28 Sdiff = standard error of difference between means of LDQ scores Se = standard error of measurement of LDQ S1 = standard deviation of mean1 rretest = test/retest reliabilty of LDQ

SLIDE 30

Clinically Significant Change - example

10 20 30 40 50 60 70 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 LDQ scores

reliable change >=4 if pushed into well functioning population range then Clinically Significant Change achieved well functioning population is 2standard deviation above general population mean LDQ<12

SLIDE 31

Reliable (RC) and clinically significant (CS) change (n-925).... n=925

LDQ % CORE % SSQ % RC improved 61.0 46.0 36.7 RC worse 4.6 5.7 12.0 Too small for RC 14.3 6.1 5.2 CS improvement 50.1 30.5 31.9 clinically significant change at 3mth (drinking n=396)..... CS change % Yes No Yes No Yes No Drinking 22.0 31.1 13.4 39.6 16.7 36.4 Abstinent 35.4 11.6 24.2 22.7 17.7 29.3

P<.001 P<.001 n.s. 95% probability

f real

change social situation most difficult to recover abstinence associated with the most

CSC. LDQ

works across drinking

utcomes

CSC is a tough test of improvement The GOLD standard

SLIDE 32

Thank you. Questions?

Conclusion

multiple measures are needed to answer different stakeholder

questions

always include substance misuse
avoid composite measures – tailor to suit the agency
choose scales that meet Quality Framework criteria
choose clinically useful measures
add Societal Impact Measures
check that the package is ethical and easy for routine use

A Rational Approach to Outcome Measurement

Duncan Raistrick Leeds Addiction Unit

Leeds and York Collaborations for Leadership in Applied Health Research and Care (CLAHRC) Funded by the NIHR Payment by Results Conference Friday 28th September 2012

Summary

Outcome measures answer the question “how do I know this person is getting better?” and they also...

summarise complex information in a clinically meaningful and ‘real world’ way communicate complex information in a clear and simple way using minimal data can be integrated into routine practice in a clinically useful way but.............

......different stakeholders have different ideas of ‘getting better’ ...... the same stakeholders have different needs

......so, there is no ideal measure, just the mix of measures that best suits the purpose.

Service user and carer views on what

NHS Treatment 7 4

NHS/3rd sector DRR 3 2

NHS/3rd sector Harm reduction

SMART group Recovery mutual aid 7 n/a

3rd sector Recovery SU only 7 n/a

3rd sector Recovery F&F only n/a 6 TOTAL 24 12

Six focus groups (FGs) were held to elicit service user (SU) views and views of Family and Friends (F&F) on what constitutes a good outcome.

Analysis of Focus Groups

There was a distinct view that abstinence or perhaps moderation is the first step but the social support to maintain change is all important. Other gains such as health and relationships were seen as good but not the essence of a good outcome.

Substance Use as the Primary Outcome

Justification for substance use as the primary outcome measure.......

makers

Different Ways to Measure Substance Use

Composite measures are restrictive:

Birmingham Heavy Drinkers 10yr Follow-up

Uses UNITS of alcohol grouped in ranges

n=259 followed at 10yrs: >50% reported major life events in previous 2yrs: typically health, employment, shift of attitude. 18 known deaths: 3 CVS, 4 liver, 3 cancer, 1 suicide, 2 diabetes. Abstinence increased from 0-10%, weekly units decreased from 90-59 (m) and 60-36 (f)

UK Alcohol Treatment Trial

FREQUENCY and UNITS of alcohol

Problems of the substance itself as an

matters? How reliably can these be measured?

main substance that should be the outcome? Does this include prescribed drugs (eg methadone, diazepam)?

administration than others (eg heroin)?

Are some harms clearly attributable to particular substances (eg alcohol) some much less so (eg heroin)?

Toxicology Screening as an Indirect Measure

Days (mean ± sd) before a positive test would be expected given different drug use frequency and different testing frequency

Correlations with Societal Costs

example data for units of alcohol and health status

10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60

increase decrease weekly units of alcohol health as % of best possible health

Correlations with Substance Use.... Baseline Follow-up EQ5D LDQ CORE SSQ EQ5D LDQ CORE SSQ Alcohol freq

0.28**

0.39**

Alcohol units

0.25**

0.38**

Heroin freq

0.39** 0.09

0.40** 0.40**

but....

Conclusion

There is a scientific and political case for the primacy of substance use as an outcome measure....

Choosing Scales as Secondary Outcome Measures

Types of Outcome Scales

Generic Measures (Health)

Is treatment cost effective? How ill are people with addiction problems compared to other users of health care? How complex are the health problems? What is the illness profile of people with addictions?

Dimension Measures (Addiction)

How severe is the addiction? How difficult is treatment likely to be? How good is

Condition Specific Measures (Depression, Pregnancy)

How severe is the specific condition? How do services targeting the condition compare? How effective is treatment for this specific problem?

Personal Goal Measures (Me)

The personal outcome goals agreed with each service user.

Societal Impact Measures - SIMs

level – SIMs are service level

exercise) – SIMs are a headline contribution

in the fewest possible measures...............

Pregnancy and Parenting i) birth weight ii) child with mother at 12months Hospital In-reach i) A&E attendances ii) in-patient admissions in last 12months Detoxification i) % completing ii) % supervised disulfiram

A Quality Framework for Outcome Scales

Self completion scales are the gold standard Scales evaluated by scoring for:

groups)

Scales that best match quality criteria...

EQ5D weighted scores baseline and 3mth

higher score = better health weighted score max = 1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Frequency of scores at assessment and 3mth

LDQ CORE SSQ

Conclusion

The EQ5D LDQ CORE SSQ are clinically useful, used in routine practice, have predictive value and paint a comprehensive picture....

Interesting Things to Find Out

Domains (or components) of Addiction

psychological well being dependence social well being substance use

0.40 0.40