A Rational Approach to Outcome Measurement Duncan Raistrick Leeds - - PowerPoint PPT Presentation

a rational approach to outcome measurement
SMART_READER_LITE
LIVE PREVIEW

A Rational Approach to Outcome Measurement Duncan Raistrick Leeds - - PowerPoint PPT Presentation

A Rational Approach to Outcome Measurement Duncan Raistrick Leeds Addiction Unit Leeds and York Collaborations for Leadership in Applied Health Research and Care (CLAHRC) Funded by the NIHR Payment by Results Conference Friday 28 th September


slide-1
SLIDE 1

A Rational Approach to Outcome Measurement

Duncan Raistrick Leeds Addiction Unit

Leeds and York Collaborations for Leadership in Applied Health Research and Care (CLAHRC) Funded by the NIHR Payment by Results Conference Friday 28th September 2012

slide-2
SLIDE 2

NIHR Nine CLAHRCs approx £10million each CLAHRC for Leeds/York/Bradford Addiction Research in Acute Settings (ARiAS) 1 of 5 themes ARiAS outcome measurement 1 of 6 strands CLAHRC research group: Duncan Raistrick, Gillian Tober, Christine Godfrey, Charlie Lloyd, Steve Parrot, Jude Watson, Veronica Dale Co-opted Expert Group convened March 2011: Owen Bowden-Jones, Alex Copello, Ed Day, Eilish Gilvarry, Don Lavoie, Damian Mitchell, Julia Sinclair, John Strang, and Alex Whincup

slide-3
SLIDE 3

Summary

  • Views and expectations of different stakeholders
  • Service user and carers
  • Politicians and commissioners
  • Practitioners
  • Substance use and scales
  • Self report
  • Biological measures
  • Scales EQ5D LDQ CORE SSQ
  • Societal Impact Measures
  • Further interesting things to find out
  • Relationship between measures
  • Clinically significant change
slide-4
SLIDE 4

Outcome measures answer the question “how do I know this person is getting better?” and they also...

summarise complex information in a clinically meaningful and ‘real world’ way communicate complex information in a clear and simple way using minimal data can be integrated into routine practice in a clinically useful way but.............

slide-5
SLIDE 5

......different stakeholders have different ideas of ‘getting better’ ...... the same stakeholders have different needs

  • Service users and carers – abstinence
  • Politicians and commissioners – costs and benefits
  • Practitioners
  • Health workers – physical and mental health
  • Criminal justice workers – offending behaviour
  • Social workers – safeguarding children

......so, there is no ideal measure, just the mix of measures that best suits the purpose.

slide-6
SLIDE 6

Service user and carer views on what

  • utcomes are important

Agency Type Type of Activity SU F&F

FG1

NHS Treatment 7 4

FG2

NHS/3rd sector DRR 3 2

FG3

NHS/3rd sector Harm reduction

FG4

SMART group Recovery mutual aid 7 n/a

FG5

3rd sector Recovery SU only 7 n/a

FG6

3rd sector Recovery F&F only n/a 6 TOTAL 24 12

Source: unpublished CLAHRC study 2012

Six focus groups (FGs) were held to elicit service user (SU) views and views of Family and Friends (F&F) on what constitutes a good outcome.

slide-7
SLIDE 7

Analysis of Focus Groups

FG1 recovery is abstinence, social life, support for family and friends, relationships, lifestyle with partner, being in control, life style changes during recovery process, physical and mental health, confidence FG2 positive environment to maintain abstinence and move away from drugs and alcohol, no good

  • utcome – it’s with you for life, not using methadone or buprenorphine, social support, doing

something to fill time, safe accommodation, children back from care, confidence, physical health and appearance FG4 physical and mental health, ability to distract and not act upon cravings, self-worth, energy, not feeling isolated, personal responsibility, living a normal life, engaging in treatment, improved relationships FG5 self esteem, eating healthily, abstinence, sleep, physical health, relationships, social networks, feeling valued, new interests and skills, courses. FG6 employment, safe and supportive social networks, something to do during the day, self-worth, physical and psychological well-being, having a good addiction, not feeling ashamed, financial situation

There was a distinct view that abstinence or perhaps moderation is the first step but the social support to maintain change is all important. Other gains such as health and relationships were seen as good but not the essence of a good outcome.

Source: unpublished CLAHRC study 2012

slide-8
SLIDE 8

Substance Use as the Primary Outcome

slide-9
SLIDE 9

Justification for substance use as the primary outcome measure.......

  • Viewed by service users as most important
  • Viewed by careers as most important
  • Most convincing for general public and policy

makers

  • Substance misuse is the condition
  • Correlation with societal costs
slide-10
SLIDE 10

Different Ways to Measure Substance Use

Frequency Categories ASSIST & AUDIT Objective Testing blood tests toxicology Episodes

  • f Use

OTI Quantity / Frequency MAP & TOP Actual Frequency ASI

Composite measures are restrictive:

  • always a compromise – too much/too little detail eg ASI
  • typically rater completed eg TOP, ASI, MAP
  • item selection bias eg TOP
  • predetermined categories not suited to particular needs eg AUDIT
  • not easy to update/modify and revalidate eg TOP, ASI
slide-11
SLIDE 11

10 20 30 40 50 60 % of sample in drinking volume group Interview year

Abstinent Sensible (<14/21 units) Hazardous (14-35 / 21-50) Heavy (35-70 / 50-100) Very heavy (70+ / 100+)

Birmingham Heavy Drinkers 10yr Follow-up

Uses UNITS of alcohol grouped in ranges

n=259 followed at 10yrs: >50% reported major life events in previous 2yrs: typically health, employment, shift of attitude. 18 known deaths: 3 CVS, 4 liver, 3 cancer, 1 suicide, 2 diabetes. Abstinence increased from 0-10%, weekly units decreased from 90-59 (m) and 60-36 (f)

Source: Rolfe et al.(2009) Report to DH

slide-12
SLIDE 12

UK Alcohol Treatment Trial

FREQUENCY and UNITS of alcohol

20 40 60 80 100 Percent of Days Abstinent Baseline 12 Months MET SBNT 5 10 15 20 25 30 Drinks per Drinking Day Baseline 12 months MET SBNT Source: UKATT Research Team, BMJ (2005)

slide-13
SLIDE 13

Problems of the substance itself as an

  • utcome measure……
  • Is it how much (quantity) or how often (frequency) that

matters? How reliably can these be measured?

  • Is it the range of substances used or just the presenting

main substance that should be the outcome? Does this include prescribed drugs (eg methadone, diazepam)?

  • Are some drugs associated with more harmful routes of

administration than others (eg heroin)?

  • Does substance use itself matter or is it the related harms?

Are some harms clearly attributable to particular substances (eg alcohol) some much less so (eg heroin)?

  • How to measure it?
slide-14
SLIDE 14

72-hour detection window (drugs that can be detected up to 72hrs)

Frequency of drug use Frequency of testing 2x /wk 1x /wk 2x /mth 1x /mth 8x /yr Every day

3±2 7±2 15±10 30±13 46±40

Every other day

4±3 8±3 18±12 35±17 51±50

1x /wk

5±4 11±7 23±18 48±31 71±66

1x /wk

9±9 18±14 40±33 80±64 118±106

2x /mth

19±21 39±35 91±88 160±124 272±260

1x /mth

36±42 71±66 150±141 306±283 560±598

Toxicology Screening as an Indirect Measure

Days (mean ± sd) before a positive test would be expected given different drug use frequency and different testing frequency

Source: Crosby et al., (2003) J Addictive Diseases

slide-15
SLIDE 15

Correlations with Societal Costs

example data for units of alcohol and health status

10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60

  • ne2one

increase decrease weekly units of alcohol health as % of best possible health

moderate drinking not associated with major health problems health may not recover

  • nce damage is done
slide-16
SLIDE 16

Correlations with Substance Use.... Baseline Follow-up EQ5D LDQ CORE SSQ EQ5D LDQ CORE SSQ Alcohol freq

  • 0.32** 0.48**

0.28**

  • 0.08*
  • 0.30** 0.62**

0.39**

  • 0.20**

Alcohol units

  • 0.38** 0.47**

0.25**

  • 0.19** -0.30** 0.62**

0.38**

  • 0.20**

Heroin freq

  • 0.11

0.39** 0.09

  • 0.10
  • 0.21

0.40** 0.40**

  • 0.60**

**p<.001 *p<.01 Source: unpublished CLAHRC clinical sample

but....

  • substance use is difficult to measure – LDQ good proxy
  • other scales are needed to paint a picture of outcome

Conclusion

There is a scientific and political case for the primacy of substance use as an outcome measure....

slide-17
SLIDE 17

Choosing Scales as Secondary Outcome Measures

slide-18
SLIDE 18

Types of Outcome Scales

Generic Measures (Health)

Is treatment cost effective? How ill are people with addiction problems compared to other users of health care? How complex are the health problems? What is the illness profile of people with addictions?

Dimension Measures (Addiction)

How severe is the addiction? How difficult is treatment likely to be? How good is

  • ne addiction service compared to another? Do problems persist?

Condition Specific Measures (Depression, Pregnancy)

How severe is the specific condition? How do services targeting the condition compare? How effective is treatment for this specific problem?

Personal Goal Measures (Me)

The personal outcome goals agreed with each service user.

Source: Fitzpatrick et al. (1998) Health Technology Assessment

slide-19
SLIDE 19

Societal Impact Measures - SIMs

  • Treating the individual has benefits at the societal

level – SIMs are service level

  • Political interest is in societal costs (a research

exercise) – SIMs are a headline contribution

  • SIMs need to be: objective, easy to collect, capable
  • f showing variability, reflect the impact of the service

in the fewest possible measures...............

Pregnancy and Parenting i) birth weight ii) child with mother at 12months Hospital In-reach i) A&E attendances ii) in-patient admissions in last 12months Detoxification i) % completing ii) % supervised disulfiram

slide-20
SLIDE 20

A Quality Framework for Outcome Scales

Self completion scales are the gold standard Scales evaluated by scoring for:

  • Evidence Base (including independent evaluations)
  • Psychometric Properties
  • Normative Data
  • Availability (free and supported by website)
  • Ease of Use
  • Universality (all substances and all socio-economic

groups)

  • Service User Evaluations

Source: NIMHE (2009)

slide-21
SLIDE 21

Scales that best match quality criteria...

Evidence (0-2) Number of validation publications Psychometric properties (0-6) Availability (0-2) Practicality (0-3) Universality (0-2) Population norms (0-2)

EQ-5D health

2 tbc 6 2 2 2 2

LDQ dependence

2 3 5 2 3 2 1

CORE-10 mental health

2 2 5 2 3 2 2

SSQ satisfaction

1 1 5 2 3 2 1

APQ problems

2 2 6 2 3

HoNOS mental health

2 tbc 4 2 3 2 tbc

PHQ9 depression

2 tbc tbc 2 3 2 tbc

GAD7 anxiety

2 tbc tbc 2 3 2 tbc

IRS impulsivity

2 5 2 2 2

FMQ family coping

tbc tbc tbc 2 tbc 2

PCQ parenting

1 1 2 2 3 2

slide-22
SLIDE 22

EQ5D weighted scores baseline and 3mth

higher score = better health weighted score max = 1

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

All Alcohol Heroin & Opiates Methadone Other drug Population 25-35

Baseline Follow up Gen Pop

Source: unpublished clinical data

slide-23
SLIDE 23

Frequency of scores at assessment and 3mth

LDQ CORE SSQ

Source: unpublished clinical data

slide-24
SLIDE 24

Conclusion

The EQ5D LDQ CORE SSQ are clinically useful, used in routine practice, have predictive value and paint a comprehensive picture....

20 40 60 80 100 5 10 15 20 25 30

Your score now… …this is what happens in treatment

slide-25
SLIDE 25

Interesting Things to Find Out

slide-26
SLIDE 26

Domains (or components) of Addiction

psychological well being dependence social well being substance use

slide-27
SLIDE 27

Factor analysis with #4 factor solution EQ5D LDQ CORE-10 SSQ

F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 F#1 F#2 F#3 F#4 .77 .13 .07

  • .09

.10 .80 .25

  • .10

.19 .36 .71

  • .12

.11

  • .04
  • .01

.66 .67

  • .02

.15

  • .08

.12 .81 .24

  • .15
  • .15
  • .01

.15

  • .40
  • .01
  • .05
  • .04

.70 .57 .28 .31

  • .15

.12 .80 .27

  • .10

.02 .26 .44

  • .22
  • .15
  • .19
  • .03

.57 .68 .22 .08

  • .07

.11 .83 .20

  • .11

.04 .19 .60

  • .14
  • .05
  • .22
  • .06

.59 .25 .31 .69

  • .12

.18 .65 .20

  • .11

.28 .25 .75

  • .10
  • .17
  • .13
  • .30

.56 .15 .75 .18

  • .13

.34 .14 .57

  • .20
  • .11
  • .07
  • .30

.60 .04 .75 .25

  • .08

.26 .30 .55

  • .11
  • .06
  • .05
  • .20

.64 .04 .66 .22

  • .12

.22 .31 .77

  • .22
  • .02
  • .11
  • .25

.57 .09 .78 .22

  • .12

.20 .30 .73

  • .23

.15 .74 .35

  • .13

.27 .22 .69

  • .17

8% of variance 23% of variance 17% of variance 9% of variance

slide-28
SLIDE 28

Relationship between EQ5D LDQ CORE SSQ

LDQ and CORE close to each other. Predicts if dependence is treated then mental health improves. SSQ is most separated. Predicts it will change most differently to other measures. EQ5D is generic but independent and between the other

  • measures. Has less

influence than LDQ on mental health more on social satisfaction.

slide-29
SLIDE 29

Clinically Significant Change - ‘Gold Standard’

Reliable Change Score Well Functioning Population LDQ >= 4 < 12 CORE-10 >= 6 < 14 SSQ >= 4 > 10

Source: CLAHRC submitted

Jacobson et al. (1999) proposed that in order to take account of baseline scores and measuring error, clinically significant change should a) be statistically reliable b) end scores be in a well functioning population range

retest e

r 1 S S

1

 

2 e diff

) 2(S S 

RC = reliable change 95% probability if RC >=1.28 Sdiff = standard error of difference between means of LDQ scores Se = standard error of measurement of LDQ S1 = standard deviation of mean1 rretest = test/retest reliabilty of LDQ

slide-30
SLIDE 30

Clinically Significant Change - example

10 20 30 40 50 60 70 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 LDQ scores

reliable change >=4 if pushed into well functioning population range then Clinically Significant Change achieved well functioning population is 2standard deviation above general population mean LDQ<12

slide-31
SLIDE 31

Reliable (RC) and clinically significant (CS) change (n-925).... n=925

LDQ % CORE % SSQ % RC improved 61.0 46.0 36.7 RC worse 4.6 5.7 12.0 Too small for RC 14.3 6.1 5.2 CS improvement 50.1 30.5 31.9 clinically significant change at 3mth (drinking n=396)..... CS change % Yes No Yes No Yes No Drinking 22.0 31.1 13.4 39.6 16.7 36.4 Abstinent 35.4 11.6 24.2 22.7 17.7 29.3

P<.001 P<.001 n.s. 95% probability

  • f real

change social situation most difficult to recover abstinence associated with the most

  • CSC. LDQ

works across drinking

  • utcomes

CSC is a tough test of improvement The GOLD standard

slide-32
SLIDE 32

Thank you. Questions?

Conclusion

  • multiple measures are needed to answer different stakeholder

questions

  • always include substance misuse
  • avoid composite measures – tailor to suit the agency
  • choose scales that meet Quality Framework criteria
  • choose clinically useful measures
  • add Societal Impact Measures
  • check that the package is ethical and easy for routine use